tech talks
Sign in Register
  • Sign in
  • Register

Tags

Speakers

Events

Sort By

Clear All Filters

Filters

Tags

Speakers

Events

Sort By

Clear All Filters
Enhance Cost Efficiency in Domain Adaptation with PruneMe
17 min

Enhance Cost Efficiency in Domain Adaptation with PruneMe

MLOps World - MLOps World & Generative AI World 2024
Shamane Siri
domain-adaptation continual-pretraining ai-research llm large-language-models pruning cost-efficiency model-optimization transformer nlp
A Practical Guide to Efficient AI
30 min

A Practical Guide to Efficient AI

MLOps World - MLOps World & Generative AI World 2024
Shelby Heinecke
ai artificial-intelligence machine-learning llm large-language-models model-optimization quantization small-language-models function-calling prompt-engineering inference model-efficiency
Mastering Enterprise-Grade LLM Deployment: Overcoming Production Challenges
5 min

Mastering Enterprise-Grade LLM Deployment: Overcoming Production Challenges

MLOps World - MLOps World & Generative AI World 2024
Jaeman An
llm deployment enterprise-ai machine-learning-operations mlops gpu-management model-optimization data-security compliance ai-infrastructure latency-reduction
On-Device ML for LLMs: Post-Training Optimization Techniques with T5 and Beyond
29 min

On-Device ML for LLMs: Post-Training Optimization Techniques with T5 and Beyond

MLOps World - MLOps World & Generative AI World 2024
Sri Raghu Malireddi
on-device-ml llms t5 model-optimization quantization pruning layer-fusion inference-optimization latency-reduction edge-devices mlops grammarly

© 2025 Tech Talks. All rights reserved.

Privacy Policy Terms of Service Contact