July 3, 2024

By Arturo Collazo

June 19, 2024

By Ignacio Aristimuño

Exploring RetNet: The Evolution of Transformers

Since 2017, transformers have demonstrated their superiority in performance and computational efficiency, surpassing recurrent neural networks (RNNs). The attention mechanism introduced in the paper ‘Attention is All You Need’ and their ability to parallelize training—a feat traditional RNNs struggled with—attribute this superiority. However, transformers come with a challenge: the memory and inference costs associated with … Continued

Difference between Gemma and Gemini

Introduction Google -particularly its DeepMind squad- has launched a set of lightweights models, called Gemma, the same that were involved in Gemini creation. It is available in two sizes, 2B and 7B. It comes with an outstanding Responsible Generative AI Toolkit, an SFT for several frameworks and ready-to-use libs and colabs. Pre-trained models or customized … Continued

Model Merging: Combining Different Fine-Tuned LLMs

Model Merging 1. Introduction Model composition is a well-known problem in the Machine Learning community. Its aim is to extend the capabilities of a model, without forgetting what it already knows. Let’s consider a situation in which we have a good performing model in a certain task (e.g. Text-to-SQL) but it was fine-tuned for that … Continued

AI in Space: Combining Machine Learning with Satellite Imagery for boosting Agriculture

Introduction In the area of agricultural research, utilizing the power of satellite imagery has become a game-changer. These technologies provide valuable information that enables farmers, researchers, and industry professionals to make informed decisions, optimize resources, and maximize crop yields. However, working with satellite data has its own set of challenges and intricacies that we must … Continued

Prototype your ML project without a single line of code with Azure

Introduction Making a prototype of a Machine Learning solution is crucial, it acts as a cornerstone for effective problem-solving. Because it lets you quickly test and validate ideas, which is key in a field such as Machine Learning, characterized by complex algorithms and unpredictable data patterns. Through this process, teams can quickly assess the viability … Continued

Teaching Diffusion Models Specific Concepts

1. Introduction 1.1 Motivation Have you ever found yourself tirelessly scouring the internet for that one image that perfectly conveys your creative vision, only to come up short? Perhaps you’re a content creator on the quest for visuals that align seamlessly with your ideas. But hours of web surfing yield little more than frustration. Imagine … Continued

Diffusion models for video generation

Introduction Diffusion models have earned a special place in the AI visual Content Generation landscape, dethroning GANs and positioning themselves as the go-to approach when creating realistic content. As technologies like LoRAs and Latent Consistency Models arrived, these models started to be less restrictive in terms of time and computing resources, and new possibilities and … Continued

Adapting Agile Methodologies in Machine Learning: A good match or a fine-tune operation?

Machine learning and data projects, in contrast with traditional software engineering projects, are governed by a resource characterized by unpredictable patterns, distributions, and biases – namely, the data itself. For a successful implementation of projects that give value through inference over datasets, we often consider: Does the collaboration between Agile methodologies and Machine Learning projects … Continued

Edge computing: deploying AI models into multiple edge devices

Imagine you have to develop a computer vision application that must run in an industrial or agricultural environment with limited connectivity. Think, for example, of an application that detects plagues using a camera mounted on agricultural machinery.  Now imagine an application that monitors some machine in an industrial plant and needs to raise a real-time … Continued

Finetuning LLMs: Enhancing Product Descriptions Efficiently

Going on with our blog series on Large Language Models (LLMs) today we will talk about a successful finetuning case. While a well-crafted prompt can get the job done in many scenarios, there are times where it might not be enough. Finetuning steps in when we need results of higher quality compared to using prompts … Continued

Next Page »

Document