Function Calling for LLMs: RAG without a Vector Database

May 16, 2024 40 min Free

MLOps World - MLOps World & Generative AI World 2024

feature-store llm rag function-calling vector-database ai data-modeling natural-language-processing retrieval-augmented-generation api machine-learning

Description

In this talk, Jim Dowling explores extending Retrieval Augmented Generation (RAG) with Function Calling to access structured/tabular data without relying on a vector database. The presentation covers how to enrich tables with metadata and the expressivity of queries that can perform well. It examines function calling in the context of queries to the Hopsworks feature store, which supports extensive metadata and statistics for columns and tables (feature groups) to improve function calling performance. The talk discusses both cloud-hosted LLMs (like GPT-4) and private LLMs, such as Hermes-2 (a fine-tuned mistral 7b LLM), demonstrating practical code examples and architectural patterns.

Up Next

1h 12m

Building Agentic and Multi-Agent Systems with LangGraph

MLOps World - MLOps World & Generative AI World 2024

Greg Loughnane Chris Alexiuk

llm langchain langgraph agentic-systems multi-agent-systems rag function-calling python ai-engineering llm-applications tool-use state-machines

35 min

RAG Hyperparameter Optimization: Translating a Traditional ML Design Pattern to RAG Applications

MLOps World - MLOps World & Generative AI World 2024

Niels Bantilan

hyperparameter-optimization pipelines traditional-ml rag llm mlops ai generative-ai inference orchestration data-quality machine-learning

39 min

From Idea to Production: AI Infra for Scaling LLM Apps

MLOps World - MLOps World & Generative AI World 2024

Guy Eshet

llm ai ai-infrastructure llm-ops prompt-engineering model-deployment gpu data-pipelines rag cost-optimization generative-ai llm-applications

40 min

Customizable RAG Workflows with your Own Data

MLOps World - MLOps World & Generative AI World 2024

Christy Bergman

milvus zilliz semantic-similarity rag generative-ai llm vector-databases langchain hugging-face embeddings retrieval-augmented-generation data-modeling

47 min

LLM Fine-Tuning for Modern AI Teams: How One E-Commerce Unicorn Cut Inference Cost by 90%

MLOps World - MLOps World & Generative AI World 2024

Emmanuel Turlay

inference-cost data-preparation mistral-7b gpt-3.5 cost-reduction llm fine-tuning ai machine-learning e-commerce natural-language-processing model-evaluation

1h 35m

Building GenAI Powered Apps: A Workshop for Software Engineers

MLOps World - MLOps World & Generative AI World 2024

Stefan Krawczyk Hugo Bowne-Anderson

generative-ai genai ai-development llm large-language-models prompt-engineering rag retrieval-augmented-generation python application-development mlops

Back to Home