What It Actually Takes to Deploy GenAI Applications to Enterprises: Custom Evaluation Models

May 16, 2024 49 min Free

Description

Alexander Kvamme (Echo AI) and Arjun Bansal (Log10) discuss the challenges and solutions for deploying Generative AI applications, particularly Large Language Models (LLMs), in enterprise environments. They highlight Echo AI's experience deploying their conversational intelligence platform to large retail brands, addressing issues with LLM accuracy and the need for scalable solutions. The talk covers iterative prompt engineering, collaborative workflows for prompt optimization, and the importance of an end-to-end LLMOps workflow. They emphasize how platforms like Log10 enable efficient resolution of accuracy issues, scaling enterprise customers, and leveraging AI-powered assistance. The discussion also touches upon infrastructure requirements for enterprise-scale deployment, including logging, debugging, prompt optimization, and seamless integration with existing AI tech stacks. Log10's role in automating LLM evaluation and improving model accuracy through prompt optimization and fine-tuning is also detailed.

What It Actually Takes to Deploy GenAI Applications to Enterprises: Custom Evaluation Models

Description

Up Next

LLMidas' Touch; Safely Adopting GenAI for Production Use Cases

Optimizing LLM Apps Through Usage: Implicit Feedback, Given Explicitly

Running prompts at CI does not make your GenAI app enterprise ready

10x Faster AI Evaluations to Ship AI Apps at Lightning Speed

From Idea to Production: AI Infra for Scaling LLM Apps

Lessons Learned Productionizing LLMs for Stripe Support