DataTalks.Club

From Biotechnology to Bioinformatics Software - Sebastian Ayala Ruano

In this talk, Sebastian, a bioinformatics researcher and software engineer, shares his inspiring journey from wet lab biotechnology to computational bioinformatics. Hosted by Data Talks Club, this session explores how data science, AI, and open-source tools are transforming modern biological research — from DNA sequencing to metagenomics and protein structure prediction.

You’ll learn about:

- The difference between wet lab and dry lab workflows in biotechnology

- How bioinformatics enables faster insights through data-driven modeling

- The MCW2 Graph Project and its role in studying wastewater microbiomes

- Using co-abundance networks and the CC Lasso algorithm to map microbial interactions

- How AlphaFold revolutionized protein structure prediction

- Building scientific knowledge graphs to integrate biological metadata

- Open-source tools like VueGen and VueCore for automating reports and visualizations

- The growing impact of AI and large language models (LLMs) in research and documentation

- Key differences between R (BioConductor) and Python ecosystems for bioinformatics

This talk is ideal for data scientists, bioinformaticians, biotech researchers, and AI enthusiasts who want to understand how data science, AI, and biology intersect. Whether you work in genomics, computational biology, or scientific software, you’ll gain insights into real-world tools and workflows shaping the future of bioinformatics.

Links:

- MicW2Graph: https://zenodo.org/records/12507444

- VueGen: https://github.com/Multiomics-Analytics-Group/vuegen

- Awesome-Bioinformatics: https://github.com/danielecook/Awesome-Bioinformatics

TIMECODES00:00 Sebastian’s Journey into Bioinformatics06:02 From Wet Lab to Computational Biology08:23 Wet Lab vs Dry Lab Explained12:35 Bioinformatics as Data Science for Biology15:30 How DNA Sequencing Works19:29 MCW2 Graph and Wastewater Microbiomes23:10 Building Microbial Networks with CC Lasso26:54 Protein–Ligand Simulation Basics29:58 Predicting Protein Folding in 3D33:30 AlphaFold Revolution in Protein Prediction36:45 Inside the MCW2 Knowledge Graph39:54 VueGen: Automating Scientific Reports43:56 VueCore: Visualizing OMIX Data47:50 Using AI and LLMs in Bioinformatics50:25 R vs Python in Bioinformatics Tools53:17 Closing Thoughts from Ecuador

Connect with Sebastian

  • Twitter - https://twitter.com/sayalaruano
  • Linkedin - https://linkedin.com/in/sayalaruano
  • Github - https://github.com/sayalaruano
  • Website - https://sayalaruano.github.io/

Connect with DataTalks.Club:

  • Join the community - https://datatalks.club/slack.html
  • Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ
  • Check other upcoming events - https://lu.ma/dtc-events
  • GitHub: https://github.com/DataTalksClub
  • LinkedIn - https://www.linkedin.com/company/datatalks-club/
  • Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/