Function Calling for LLMs: RAG without a Vector Database
May 16, 2024
40 min
Free
feature-store
llm
rag
function-calling
vector-database
ai
data-modeling
natural-language-processing
retrieval-augmented-generation
api
machine-learning
Description
In this talk, Jim Dowling explores extending Retrieval Augmented Generation (RAG) with Function Calling to access structured/tabular data without relying on a vector database. The presentation covers how to enrich tables with metadata and the expressivity of queries that can perform well. It examines function calling in the context of queries to the Hopsworks feature store, which supports extensive metadata and statistics for columns and tables (feature groups) to improve function calling performance. The talk discusses both cloud-hosted LLMs (like GPT-4) and private LLMs, such as Hermes-2 (a fine-tuned mistral 7b LLM), demonstrating practical code examples and architectural patterns.