Podcast

Build Open-Source NLP Tools: Weak Supervision, LLM Heuristics & Enterprise ML Product Strategy

S13E9

Open original DataTalks.Club episode

NLP machine learning strategy entrepreneurship founder

Build Open-Source NLP Tools: Weak Supervision, LLM Heuristics & Enterprise ML Product Strategy

Original Episode

Use these links for the canonical episode and media sources.

Episode Overview

How can teams scale high-quality NLP labeling without hand-labeling every example? In this episode, Johannes Hötter, data scientist, engineer, and co-founder of kern, explains practical approaches to that problem using weak supervision, heuristics, and open-source tooling. We walk through demos of Refinery and Bricks, with a close look at Refinery’s weak supervision and labeling workflows, and why Jupyter widgets leave a gap for NLP tooling.

People

Use these links to connect the episode to guest notes.

Chapter Summary

Use these checkpoints to decide whether to open the source transcript.