tech talks
Sign in
Register
Open main menu
Sign in
Register
Filters
1
Tags
Speakers
Events
Sort By
Newest First
Oldest First
Title A-Z
Title Z-A
Clear All Filters
Filters
Tags
Speakers
Events
Sort By
Newest First
Oldest First
Title A-Z
Title Z-A
Clear All Filters
51 min
Avoid ML OOps with ML Ops: A modular approach to scaling Forethought’s E2E ML Platform
MLOps World - MLOps World & Generative AI World 2024
Salina Wu
sagemaker
spark
dagster
feature-engineering
data-drift
mlops
machine-learning
data-engineering
devops
ci-cd
model-serving
ml-training
39 min
Lessons Learned: The Journey to Real-Time Machine Learning at Instacart
MLOps World - MLOps World & Generative AI World 2024
Guanghua Shu
real-time-ml
instacart
online-inference
streaming-infrastructure
flink
recommendation-systems
machine-learning
ml-platform
feature-store
model-serving
kafka
data-modeling
42 min
Efficiently Fine-Tune And Serve Your Own LLMs
MLOps World - MLOps World & Generative AI World 2024
Alex Sherstinsky
llm-fine-tuning
predibase
ludwig
lorax
large-language-models
lora
parameter-efficient-fine-tuning
peft
transformer-models
mistral-7b
model-serving
inference
31 min
How to Run Your Own LLMs, From Silicon to Service
MLOps World - MLOps World & Generative AI World 2024
Charles Frye
llms
large-language-models
mlops
machine-learning-operations
inference
gpu
quantization
tensorrt-llm
vllm
modal-labs
model-serving
ai-engineering
30 min
Generative AI Infrastructure at Lyft
MLOps World - MLOps World & Generative AI World 2024
Konstantin Gizdarski
generative-ai
ai-infrastructure
mlops
ml-platform
llms
model-serving
model-training
ai-agents
pii-preservation
customer-support-ai
rag
40 min
The State and Future of Cloud-Native Model Serving
KubeCon + CloudNativeCon - KubeCon + CloudNativeCon Europe 2023
Dan Sun
Theofilos Papapanagiotou
mlops
cloud-native
kubernetes
model-serving
kserve
cncf
knative
istio
serverless
scalability
observability
inference
1h 46m
Building a Model Prediction Server
PyCon - PyCon US 2023
Ethan Swan
python
fastapi
scikit-learn
machine-learning
api-development
model-serving
predictive-modeling
data-science
production-deployment