tech talks
Sign in Register
  • Sign in
  • Register

Tags

Speakers

Events

Sort By

Clear All Filters

Filters

Tags

Speakers

Events

Sort By

Clear All Filters
Running Multiple Models on the Same GPU, on Spot Instances
33 min

Running Multiple Models on the Same GPU, on Spot Instances

MLOps World - MLOps World & Generative AI World 2024
Oscar Rovira
ml-inference spot-instances gpu-fractionalization gpu cost-optimization generative-ai llm cloud-computing aws gcp azure mlops
Data Versioning in Generative AI: A Pathway to Cost-effective ML
36 min

Data Versioning in Generative AI: A Pathway to Cost-effective ML

MLOps World - MLOps World & Generative AI World 2024
Dmitry Petrov
dataset-sharing annotations generative-ai ml data-versioning machine-learning dvc cost-optimization collaboration embeddings
Finding training inefficiencies with CentML DeepView
41 min

Finding training inefficiencies with CentML DeepView

MLOps World - MLOps World & Generative AI World 2024
Yubo Gao
visual-profiler training-throughput batch-size mixed-precision tensor-cores flash-attention deep-learning machine-learning performance-optimization profiling gpu-utilization pytorch tensorflow model-deployment cost-optimization energy-efficiency
From Idea to Production: AI Infra for Scaling LLM Apps
39 min

From Idea to Production: AI Infra for Scaling LLM Apps

MLOps World - MLOps World & Generative AI World 2024
Guy Eshet
llm ai ai-infrastructure llm-ops prompt-engineering model-deployment gpu data-pipelines rag cost-optimization generative-ai llm-applications
Kubernetes Infra SIG: Intro and Updates
29 min

Kubernetes Infra SIG: Intro and Updates

KubeCon + CloudNativeCon - KubeCon + CloudNativeCon Europe 2023
Arnaud Meukam Davanum Srinivas
kubernetes kubernetes-infra sig cloud-native infrastructure aws gcp cncf cost-optimization ci-cd supply-chain registry
Cloud Computing’s First Economic Recession? Let’s Talk Platform Efficiency
37 min

Cloud Computing’s First Economic Recession? Let’s Talk Platform Efficiency

KubeCon + CloudNativeCon - KubeCon + CloudNativeCon Europe 2023
Aparna Subramanian Todd Ekenstam Phillip Wittrock Nagarajan Chinnakaveti Thulasiraman
cloud-computing platform-efficiency cost-optimization kubernetes autoscaling finops cloud-costs design culture observability ci-cd
Use Knative When You Can, and Kubernetes When You Must
37 min

Use Knative When You Can, and Kubernetes When You Must

KubeCon + CloudNativeCon - KubeCon + CloudNativeCon Europe 2023
David Hadas Michael Maximilien
knative kubernetes serverless microservices cloud-native automation auto-scaling security cost-optimization developer-experience
Multi-Arch Infrastructure from the Ground up
23 min

Multi-Arch Infrastructure from the Ground up

KubeCon + CloudNativeCon - KubeCon + CloudNativeCon Europe 2023
Cheryl Hung
multi-arch infrastructure kubernetes arm x86 ci-cd cloud-native aws-graviton containers devops performance cost-optimization
Colocate Hadoop YARN with Kubernetes to Save Massive Costs on Big Data
43 min

Colocate Hadoop YARN with Kubernetes to Save Massive Costs on Big Data

KubeCon + CloudNativeCon - KubeCon + CloudNativeCon Europe 2023
Irvin Lim Hailin Xiang
kubernetes hadoop yarn big-data cost-optimization resource-management cgroups kernel container-runtimes scheduler kubelet data-infrastructure
From SBOMs to IBOMs - Know What's Happening in Your Clusters
31 min

From SBOMs to IBOMs - Know What's Happening in Your Clusters

KubeCon + CloudNativeCon - KubeCon + CloudNativeCon Europe 2023
Cindy Blake Ido Neeman
sbom ibom cloud-native infrastructure-management security supply-chain-security asset-management kubernetes compliance cost-optimization attack-surface-management drift-detection

© 2025 Tech Talks. All rights reserved.

Privacy Policy Terms of Service Contact