We're Doing RAG All Wrong—and How We Can Do So Much Better

December 08, 2024 5 min Free

MLOps World - MLOps World & Generative AI World 2024

rag llms embeddings vector-database retrieval-augmented-generation ai machine-learning prompt-engineering natural-language-processing feature-stores

Description

This talk challenges the conventional approach to Retrieval Augmented Generation (RAG), which typically involves embedding user queries, performing nearest neighbor searches on chunked text via vector databases, and fitting results into prompts. The speaker argues that this method is suboptimal and misses opportunities to maximize relevant information within the context window. The session will delve into why the current RAG strategy is limiting and explore smarter techniques for optimizing context, enhancing retrieval, and unlocking the full potential of RAG systems. The speaker, Simba Khadder, Founder & CEO of Featureform, draws on his experience with recommender systems and ML infrastructure.

Up Next

40 min

Customizable RAG Workflows with your Own Data

MLOps World - MLOps World & Generative AI World 2024

Christy Bergman

milvus zilliz semantic-similarity rag generative-ai llm vector-databases langchain hugging-face embeddings retrieval-augmented-generation data-modeling

30 min

Driving GenAI Success in Production: Proven Approaches for Data Quality, Context, and Logging

MLOps World - MLOps World & Generative AI World 2024

Alison Cossette

generative-ai genai rag data-quality knowledge-graphs mlops neo4j data-science embeddings cosine-similarity

50 min

Better Chatbots with Advanced RAG Techniques

MLOps World - MLOps World & Generative AI World 2024

Zain Hasan

vector-databases weaviate chatbots hybrid-search query-generation semantic-search rag retrieval-augmented-generation llms language-models machine-learning ai

1h 5m

Building a Multimodal RAG: A Step-by-Step Guide

MLOps World - MLOps World & Generative AI World 2024

Ivan Nardini Holt Skinner

rag gemini vertex-ai generative-ai llm multimodal-ai ai-ml google-cloud embeddings vector-database data-processing python

45 min

The BEST component for your RAG system

MLOps World - MLOps World & Generative AI World 2024

Jeffrey Kim

auto-ml data-optimization language-models evaluation-data human-in-the-loop benchmark rag retrieval-augmented-generation llm natural-language-processing information-retrieval

35 min

RAG Hyperparameter Optimization: Translating a Traditional ML Design Pattern to RAG Applications

MLOps World - MLOps World & Generative AI World 2024

Niels Bantilan

hyperparameter-optimization pipelines traditional-ml rag llm mlops ai generative-ai inference orchestration data-quality machine-learning

Back to Home