tech talks
Sign in
Register
Open main menu
Sign in
Register
Filters
1
Tags
Speakers
Events
Sort By
Newest First
Oldest First
Title A-Z
Title Z-A
Clear All Filters
Filters
Tags
Speakers
Events
Sort By
Newest First
Oldest First
Title A-Z
Title Z-A
Clear All Filters
8 min
Streamlining AI Deployments
MLOps World - MLOps World & Generative AI World 2024
Vasilis Vagias
ai
llm
mlops
deployment
optimization
inference
compiler
pytorch
docker
gpu
api
40 min
The State and Future of Cloud-Native Model Serving
KubeCon + CloudNativeCon - KubeCon + CloudNativeCon Europe 2023
Dan Sun
Theofilos Papapanagiotou
mlops
cloud-native
kubernetes
model-serving
kserve
cncf
knative
istio
serverless
scalability
observability
inference
35 min
Efficient Access to Shared GPU Resources: Mechanisms and Use Cases
KubeCon + CloudNativeCon - KubeCon + CloudNativeCon Europe 2023
Diogo Guerra
Diana Gaponcic
gpu-scheduling
kubernetes
nvidia-mig
time-slicing
resource-management
high-energy-physics
machine-learning
inference
ci-cd
benchmarking
« First
‹ Prev
1
2