tech talks
Sign in
Register
Open main menu
Sign in
Register
Filters
1
Tags
Speakers
Events
Sort By
Newest First
Oldest First
Title A-Z
Title Z-A
Clear All Filters
Filters
Tags
Speakers
Events
Sort By
Newest First
Oldest First
Title A-Z
Title Z-A
Clear All Filters
41 min
Bring AI to your existing databases without complex MLOps
MLOps World - MLOps World & Generative AI World 2024
Duncan Blythe
vector-search
data-layer
data-integration
ai
databases
mlops
llms
python
api
rag
kubernetes
enterprise-ai
31 min
Low-latency Model Inference in Finance: A Close Look at Seldon V2
MLOps World - MLOps World & Generative AI World 2024
Vincent David
Michael Meredith
model-inference
low-latency
service-oriented-architecture
dag
seldon
machine-learning
fintech
kubernetes
custom-resource-definitions
mlops
api-gateway
grpc
kafka
36 min
From Model T to Machine Learning: A Glimpse into Ford's MLOps and Hybrid Infrastructure Strategy
MLOps World - MLOps World & Generative AI World 2024
Naiel Samaan
Valmir Bucaj
on-premise
google-cloud-platform
airflow
mlops
machine-learning
ai
hybrid-cloud
gcp
docker
kubernetes
data-modeling
devops
18 min
Secure Open Source MLOps - Ubuntu Principles
MLOps World - MLOps World & Generative AI World 2024
Andreea Munteanu
Maciej Mazur
ubuntu
data-tokenization
privacy-enhancing-technologies
mlops
kubernetes
kubeflow
confidential-computing
security
compliance
open-source
machine-learning
ai
29 min
LLMs From Dream to Deployed
MLOps World - MLOps World & Generative AI World 2024
Josh Goldstein
chatbots
seldon
llm
large-language-models
machine-learning
mlops
deployment
retrieval-augmented-generation
rag
kubernetes
openai
hugging-face
gpu
41 min
Lessons learned from scaling large language models in production
MLOps World - MLOps World & Generative AI World 2024
Matt Squire
ray-serve
large-language-models
llm
rag
mlops
gpu
performance-optimization
inference
scaling
python
fastapi
kubernetes
vm
vector-database
36 min
From ML Repository to ML Production Pipeline
MLOps World - MLOps World & Generative AI World 2024
Jakub Witkowski
Dariusz Adamczyk
production-pipelines
ml-repository
mlops
machine-learning
devops
docker
kubernetes
ci-cd
kubeflow
data-science
gpu
automation
23 min
Leverage Kubernetes To Optimize the Utilization of Your AI Accelerators
MLOps World - MLOps World & Generative AI World 2024
Nathan Beach
accelerators
kubernetes
kubernetes-engine
ai
gpu
optimization
training
inference
workloads
resource-utilization
cloud-computing
52 min
MLOps for Time Series in Production
MLOps World - MLOps World & Generative AI World 2024
Eddie Mattia
mlops
time-series
machine-learning
data-science
python
xgboost
metaflow
outerbounds
batch-inference
data-modeling
postgresql
kubernetes
ci-cd
29 min
Creating our own Private OpenAI API
MLOps World - MLOps World & Generative AI World 2024
Meryem Arik
Hannes Hapke
large-language-models
llms
private-api
openai-api
self-hosting
mlops
generative-ai
inference-optimization
quantization
gpu-utilization
api-gateway
kubernetes
24 min
Large Language Model Training and Serving at LinkedIn
MLOps World - MLOps World & Generative AI World 2024
Dre Olgiati
llm
large-language-models
ai
machine-learning
mlops
training
gpu
kubernetes
python
tensorflow
pytorch
kernels
optimization
memory-management
transformer
35 min
Challenges of Modern Application Delivery: A Retrospection of KubeVela Highlight Technologies
KubeCon + CloudNativeCon - KubeCon + CloudNativeCon Europe 2023
Jianbo Sun
Da Yin
kubevela
application-delivery
cloud-native
kubernetes
cncf
platform-engineering
ci-cd
observability
resource-management
multi-cloud
orchestration
devops
1
2
3
4
5
…
Next ›
Last »