tech talks
Sign in
Register
Open main menu
Sign in
Register
Filters
1
Tags
Speakers
Events
Sort By
Newest First
Oldest First
Title A-Z
Title Z-A
Clear All Filters
Filters
Tags
Speakers
Events
Sort By
Newest First
Oldest First
Title A-Z
Title Z-A
Clear All Filters
41 min
Lessons learned from scaling large language models in production
MLOps World - MLOps World & Generative AI World 2024
Matt Squire
ray-serve
large-language-models
llm
rag
mlops
gpu
performance-optimization
inference
scaling
python
fastapi
kubernetes
vm
vector-database
25 min
Autoscaling Can Be Reliable: Running Cluster Autoscaler in Prod
KubeCon + CloudNativeCon - KubeCon + CloudNativeCon Europe 2023
Maciej Pytel
kubernetes
cluster-autoscaler
autoscaling
monitoring
debugging
cloud-provider
gke
vm
nodes
pods