tech talks
Sign in Register
  • Sign in
  • Register

Tags

Speakers

Events

Sort By

Clear All Filters

Filters

Tags

Speakers

Events

Sort By

Clear All Filters
Lessons learned from scaling large language models in production
41 min

Lessons learned from scaling large language models in production

MLOps World - MLOps World & Generative AI World 2024
Matt Squire
ray-serve large-language-models llm rag mlops gpu performance-optimization inference scaling python fastapi kubernetes vm vector-database
Autoscaling Can Be Reliable: Running Cluster Autoscaler in Prod
25 min

Autoscaling Can Be Reliable: Running Cluster Autoscaler in Prod

KubeCon + CloudNativeCon - KubeCon + CloudNativeCon Europe 2023
Maciej Pytel
kubernetes cluster-autoscaler autoscaling monitoring debugging cloud-provider gke vm nodes pods

© 2025 Tech Talks. All rights reserved.

Privacy Policy Terms of Service Contact