Podcast

Deploying LLMs in Production: Fine-Tuning, Retrieval & Open-Source vs API Tradeoffs

S15E3

Open original DataTalks.Club episode

LLMs MLOps open-source production retrieval-augmented generation

Deploying LLMs in Production: Fine-Tuning, Retrieval & Open-Source vs API Tradeoffs

Original Episode

Use these links for the canonical episode and media sources.

Episode Overview

How do you take large language models from experiment to reliable production—balancing fine-tuning, retrieval strategies, and the tradeoffs between open-source models and API services? In this episode, Meryem Arik, a recovering physicist and co-founder of TitanML, walks through practical choices for LLM deployment based on her pivot from computer vision to building tools that make models smaller, cheaper, and easier to run in production.

People

Use these links to connect the episode to guest notes.

Chapter Summary

Use these checkpoints to decide whether to open the source transcript.