Podcast
Bioinformatics Workflows in Practice: Sequencing, Metagenomics, and Open-Source Tools
Open original DataTalks.Club episode
Bioinformatics Workflows in Practice: Sequencing, Metagenomics, and Open-Source Tools
Original Episode
Use these links for the canonical episode and media sources.
- Open the original DataTalks.Club podcast page
- Watch on YouTube
- Listen on Spotify
- Listen on Apple Podcasts
Episode Overview
How do you build reproducible, scalable bioinformatics workflows for sequencing and metagenomics using open-source tools? In this episode we explore practical answers with Sebastian Ayala Ruano, a bioinformatics software developer and Master’s student in Systems Biology at Maastricht University. Sebastian has contributed to open-source projects such as MicW2Graph, VueGen, and VueCore to simplify multi-omics data analysis and has a background in cheminformatics, peptide discovery, and network-based analysis.
People
Use these links to connect the episode to guest notes.
Chapter Summary
Use these checkpoints to decide whether to open the source transcript.
- 0:00 - Podcast Introduction
- 1:09 - Career Transition: Biotechnology to Bioinformatics Software
- 3:41 - Master’s Thesis Overview: Wastewater Microbiome Knowledge Graph
- 6:27 - Bioinformatics Role: Reducing Lab Experiments with Computational Analysis
- 8:23 - Wet Lab vs Dry Lab: Experimental Work vs Computational Pipelines
- 11:21 - Bioinformatics as Data Science: From Sequencing to Analysis
- 12:35 - Genomic Data Basics: Nucleotides and DNA Sequences
- 15:30 - DNA Sequencing Workflow and Reference Genomes
- 17:56 - Metagenomics: Environmental Sampling and Abundance Tables
- 19:41 - Building Microbial Networks: Co-abundance and Association Inference
- 24:31 - Network Inference Methodology: CC Lasso, Correlations, and Thresholding
- 27:06 - Molecular Simulations: Protein–Ligand Dynamics and Water Boxes
- 29:58 - Protein Folding Revolution: AlphaFold Impact on Structure Prediction
- 36:20 - Open-Source Projects Overview: MCW2 Graph, VueGen, and VueCore
- 38:31 - Knowledge Graph Exploration: Neo4j, Streamlit, and Graph Algorithms
- 40:00 - Report Automation with VueGen: Quarto, Streamlit, and Export Formats
- 42:29 - Package Ecosystem: Bioconda, Bioconductor, and Bioinformatics Libraries
- 43:56 - Omics Visualization: VueCore for Genomics, Proteomics, and Metabolomics
- 45:08 - Portfolio Advice: Beginner Bioinformatics Projects and Tools to Showcase
- 47:50 - AI & LLMs in Bioinformatics: Documentation, MLOps, and Coding Assistants
- 50:25 - Language Tradeoffs: R vs Python and Scaling Scientific Tools
- 51:53 - Visualization Workflows: Viewer and Supporting Plotting Libraries
- 53:17 - Remote Work & Field Life: Working from Ecuador and Nature Notes
- 54:10 - Episode Wrap-up: Open-Source Encouragement and Closing Remarks