tech talks
Sign in
Register
Open main menu
Sign in
Register
Filters
1
Tags
Speakers
Events
Sort By
Newest First
Oldest First
Title A-Z
Title Z-A
Clear All Filters
Filters
Tags
Speakers
Events
Sort By
Newest First
Oldest First
Title A-Z
Title Z-A
Clear All Filters
33 min
Running Multiple Models on the Same GPU, on Spot Instances
MLOps World - MLOps World & Generative AI World 2024
Oscar Rovira
ml-inference
spot-instances
gpu-fractionalization
gpu
cost-optimization
generative-ai
llm
cloud-computing
aws
gcp
azure
mlops
36 min
Data Versioning in Generative AI: A Pathway to Cost-effective ML
MLOps World - MLOps World & Generative AI World 2024
Dmitry Petrov
dataset-sharing
annotations
generative-ai
ml
data-versioning
machine-learning
dvc
cost-optimization
collaboration
embeddings
41 min
Finding training inefficiencies with CentML DeepView
MLOps World - MLOps World & Generative AI World 2024
Yubo Gao
visual-profiler
training-throughput
batch-size
mixed-precision
tensor-cores
flash-attention
deep-learning
machine-learning
performance-optimization
profiling
gpu-utilization
pytorch
tensorflow
model-deployment
cost-optimization
energy-efficiency
39 min
From Idea to Production: AI Infra for Scaling LLM Apps
MLOps World - MLOps World & Generative AI World 2024
Guy Eshet
llm
ai
ai-infrastructure
llm-ops
prompt-engineering
model-deployment
gpu
data-pipelines
rag
cost-optimization
generative-ai
llm-applications
29 min
Kubernetes Infra SIG: Intro and Updates
KubeCon + CloudNativeCon - KubeCon + CloudNativeCon Europe 2023
Arnaud Meukam
Davanum Srinivas
kubernetes
kubernetes-infra
sig
cloud-native
infrastructure
aws
gcp
cncf
cost-optimization
ci-cd
supply-chain
registry
37 min
Cloud Computing’s First Economic Recession? Let’s Talk Platform Efficiency
KubeCon + CloudNativeCon - KubeCon + CloudNativeCon Europe 2023
Aparna Subramanian
Todd Ekenstam
Phillip Wittrock
Nagarajan Chinnakaveti Thulasiraman
cloud-computing
platform-efficiency
cost-optimization
kubernetes
autoscaling
finops
cloud-costs
design
culture
observability
ci-cd
37 min
Use Knative When You Can, and Kubernetes When You Must
KubeCon + CloudNativeCon - KubeCon + CloudNativeCon Europe 2023
David Hadas
Michael Maximilien
knative
kubernetes
serverless
microservices
cloud-native
automation
auto-scaling
security
cost-optimization
developer-experience
23 min
Multi-Arch Infrastructure from the Ground up
KubeCon + CloudNativeCon - KubeCon + CloudNativeCon Europe 2023
Cheryl Hung
multi-arch
infrastructure
kubernetes
arm
x86
ci-cd
cloud-native
aws-graviton
containers
devops
performance
cost-optimization
43 min
Colocate Hadoop YARN with Kubernetes to Save Massive Costs on Big Data
KubeCon + CloudNativeCon - KubeCon + CloudNativeCon Europe 2023
Irvin Lim
Hailin Xiang
kubernetes
hadoop
yarn
big-data
cost-optimization
resource-management
cgroups
kernel
container-runtimes
scheduler
kubelet
data-infrastructure
31 min
From SBOMs to IBOMs - Know What's Happening in Your Clusters
KubeCon + CloudNativeCon - KubeCon + CloudNativeCon Europe 2023
Cindy Blake
Ido Neeman
sbom
ibom
cloud-native
infrastructure-management
security
supply-chain-security
asset-management
kubernetes
compliance
cost-optimization
attack-surface-management
drift-detection