tech talks
Sign in
Register
Open main menu
Sign in
Register
Filters
1
Tags
Speakers
Events
Sort By
Newest First
Oldest First
Title A-Z
Title Z-A
Clear All Filters
Filters
Tags
Speakers
Events
Sort By
Newest First
Oldest First
Title A-Z
Title Z-A
Clear All Filters
17 min
Enhance Cost Efficiency in Domain Adaptation with PruneMe
MLOps World - MLOps World & Generative AI World 2024
Shamane Siri
domain-adaptation
continual-pretraining
ai-research
llm
large-language-models
pruning
cost-efficiency
model-optimization
transformer
nlp
30 min
A Practical Guide to Efficient AI
MLOps World - MLOps World & Generative AI World 2024
Shelby Heinecke
ai
artificial-intelligence
machine-learning
llm
large-language-models
model-optimization
quantization
small-language-models
function-calling
prompt-engineering
inference
model-efficiency
5 min
Mastering Enterprise-Grade LLM Deployment: Overcoming Production Challenges
MLOps World - MLOps World & Generative AI World 2024
Jaeman An
llm
deployment
enterprise-ai
machine-learning-operations
mlops
gpu-management
model-optimization
data-security
compliance
ai-infrastructure
latency-reduction
29 min
On-Device ML for LLMs: Post-Training Optimization Techniques with T5 and Beyond
MLOps World - MLOps World & Generative AI World 2024
Sri Raghu Malireddi
on-device-ml
llms
t5
model-optimization
quantization
pruning
layer-fusion
inference-optimization
latency-reduction
edge-devices
mlops
grammarly