GPU for LLMs: What Is a GPU and Why Does It Matter?

Picture of a GPU

If you want to understand GPU for LLMs – why every AI company is spending billions on graphics cards to run large language models – the answer comes down to two things: parallel math and memory. This post explains both, without the handwaving. What is a CPU, and what’s it good at? A CPU – […]

Building a reliable RAG system

Array of floating point numbers representing a vector database

If you arrived here from LinkedIn, this is the longer version of the story. If you have used a RAG system, you have witnessed the “it works sometimes” problem. A query returns the right answer on Tuesday and an unrelated paragraph on Wednesday. The fix is rarely a single change. It is a sequence of […]

Data privacy in the AI era

Man taking photo of bathroom mirror

Moving fast and breaking privacy Let’s be honest – the AI boom has blurred a lot of lines. The barrier to entry has never been lower, and people are building and shipping at incredible speeds. That’s not a bad thing in itself. But somewhere between the excitement and the hustle, privacy has quietly taken a […]

Databricks Mosaic AI – Frequently Asked Questions

Databricks. Mosaic AI

Motivation If you’re part of any enterprise that wants to use Large Language Models (LLMs) like ChatGPT, Mistral, Gemini etc., you know there is always some concern around exposing sensitive organizational data to these LLMs.  Databricks acquired Mosaic AI in 2023 which helps tackle this exact problems – allows users to use any LLM without […]

Running LLM-generated code

cat-sleeping-on-keyboard

How seamless is running a LLM-generated code? These days, everyone and their grandparents are talking about AI taking over human jobs. You cannot scroll through social media without seeing some CEO of a global conglomerate talking about how by 202x traditional programming jobs will be eaten up by AI agents. A new concept called Vibe […]

Is Vibe Coding Right For You?

Vibe Coding by a Cat

In this post, we will go over Vibe Coding and by the time we are done, we hope you will have a clear idea if Vibe Coding is right for your project.  So what is Vibe Coding? Andrej Carpathy, one of the co-founders at OpenAI, came up with this phrase in one of his tweets, […]

Prompt Engineering 101

In this post from our Byte Sized series, we will explore the subtle art of Prompt Engineering that has been gaining a lot of prominence in the realm of AI. The common analogy to describe Prompt Engineering is like teaching a child through asking questions. How effective your Large Language Model’s response is depends on the […]

Navigating the Landscape of Large Language Models

Cloud migration

In the ever-evolving landscape of artificial intelligence, Large Language Models (LLMs) stand out for their unprecedented language mastery, primarily attributed to their large parameter scale. However, LLMs are not without their challenges. This article delves into the intricacies of LLMs, exploring the pain points associated with their deployment and highlighting the role of data, ethical […]