Podcast
DataTalks.Club Behind the Scenes: Alexey Grigorev on Scaling and Growing the Community
Open original DataTalks.Club episode
DataTalks.Club Behind the Scenes: Alexey Grigorev on Scaling and Growing the Community
Original Episode
Use these links for the canonical episode and media sources.
- Open the original DataTalks.Club podcast page
- Watch on YouTube
- Listen on Spotify
- Listen on Apple Podcasts
Episode Overview
How do you scale a grassroots machine learning community from a few forum posts to thousands of active members? In this episode, Alexey Grigorev — founder of DataTalks.Club — sits down with Eugene Yan to walk through the real-world steps behind scaling and growing a machine learning community. Alexey shares his origins (forums, landing page, early events), the growth inflection that led to ~9k members, and practical event formats that work: Open Source Spotlight, Minis, Book of the Week, live coding and office.
People
Use these links to connect the episode to guest notes.
Chapter Summary
Use these checkpoints to decide whether to open the source transcript.
- 0:00 - Podcast Introduction
- 0:09 - Career Transition: Java to Machine Learning (Coursera, Andrew Ng)
- 1:26 - Freelancing, Master’s, and first data-science roles; building data pipelines
- 5:06 - Career Lessons: step outside comfort zone; product mindset; prefer simple
- 6:27 - Principal Data Scientist Role: internal consulting, architecture, mentoring
- 9:36 - Motivation to Start the Community: early interactions and LinkedIn outreach
- 10:05 - Community Origins: forums, landing page, first events and format inspiration
- 16:54 - Community Growth & Events: conference boost and scaling to ~9k members
- 20:22 - Content Production & Automation: planning, scheduling, Zapier, Eventbrite
- 24:38 - Event Formats: Open Source Spotlight, Minis, Book of the Week
- 27:51 - Notable Guests & Popular Episodes: Martin Kleppmann, Elena Samuylova, Santiago
- 31:37 - Monetization & Sponsorship: costs, TopCoder, Toloka crowdsourcing workshop
- 38:22 - ML Bookcamp & Machine Learning Zoomcamp: project-based, end-to-end learning
- 39:06 - Deployment Focus in the Book/Course: Flask, AWS Lambda, Kubernetes, Kubeflow
- 42:49 - Career Advice: join communities, answer questions, find mentors
- 43:55 - Motivation & Persistence: handling frustration and sustaining interest
- 45:40 - Tool Evaluation Strategy: avoid tool churn, follow lasting trends, Kedro
- 48:56 - Productivity & Workflow: public deadlines, accountability, batching work
- 50:31 - Learning by Projects & Notes: just-in-time learning, Notion, READMEs, GitHub
- 53:04 - Community Inspiration & Format Ideas: borrowing from ML Ops and JavaRanch
- 55:07 - Interactive Formats: live coding, office hours, ML Zoomcamp sessions
- 55:56 - Community Thanks & Future Plans
- 56:50 - Podcast Closing