Flush to Data

Kris Villez and Jörg Rieckermann

We mix Water, Poo, and Data.The Flush to Data podcast invites conversations about the data, models, and science used in wastewater engineering. Your hosts are Jörg Rieckermann (Eawag, Dübendorf, Switzerland) and Kris Villez (Oak Ridge National Laboratory, TN, USA). The contributions of Kris Villez to this podcast are a reflection of personal opinion only and are not related to any project, study, or opinion at the Oak Ridge National Laboratory or the U.S. Department of Energy. ---- Credits: ---- In our Trailer and intro we use the beautiful music from the "Obliterator" Amiga game by the incredible David Whittaker https://en.wikipedia.org/wiki/David_Whittaker_(video_game_composer) we also use creative common artwork distributed under a CC BY-ND 2.0 license. Check out our homepage for more info --- https://flush2data.gitlab.io

Episodes

  1. 05/16/2023

    [Episode 07] Linda Åmand

    We are back! This is the seventh episode of the Flush to Data podcast. We started with an introduction to the IVL institute, discussed all sorts of data and ended with notes on the unique features of the wastewater industry. In the extras, you'll here about redundancy and data needs. Thanks Linda! Episode guide: [00:00:00] Intro [00:01:40] Who is Linda Åmand? [00:03:40] IVL Swedish Environmental Research Institute - What is it? [00:05:35] Wastewater data [00:08:30] What does a production manager do at a wastewater treatment plant? [00:09:55] Are Nordic countries leading the pack in process automation? and why? [00:15:50] Never trust a sensor? [00:19:00] The lab measurement rules them all [00:24:25] The information hidden in our data [00:29:30] Meta-data and trust in data [00:34:00] Intermezzo! [00:42:15] Why measure? Integrating data into day-to-day decisions [00:48:55] Cost of digital maintenance [00:55:30] Data scientists: Get stuck in wastewater! [01:01:10] Is the wastewater business unique? [01:04:30] Goodbye and see you soon! Extras: [01:05:20] Machine-to-machine communication, cybersecurity [01:08:15] What kind of measurements are missing? [01:11:40] How will work on the plant change? [01:14:30] Integrated management of sewers, treatment plants, and receiving water? [01:19:30] Redundancy and resolution [01:24:30] What needs to be better? [01:28:30] Thank you! Links: Käppalaförbundet: www.kappala.se IVL: www.ivl.se Smart utilities book: https://www.iwapublishing.com/books/9781780407579/smart-water-utilities-complexity-made-simple

    1h 30m
  2. 11/23/2020

    Episode 04 - Open Science

    This is the fourth episode of the Flush to Data podcast. We start of discussing open science. We venture into many related aspects such as scientific reproducibility, scientific career evaluation, software versioning, and the distinction between the scientific process and academic institutes. Hosts: Jörg Rieckermann and Kris Villez Guest: Juan Pablo Carbajal (Hochschule für Technik Rapperswil, Rapperswil, Switzerland) Links: * Juan Pablo's web page: https://sites.google.com/site/juanpicarbajal/ * OSF - Open Science Framework: https://www.cos.io/our-products/osf * A publication on the scientific mission: https://dx.doi.org/10.4161%2Fcib.1.1.6285 * Open peer review: https://peercommunityin.org/ * Book "On Fact and Fraud: Cautionary Tales from the Front Lines of Science ": https://www.jstor.org/stable/j.ctt7s7j4 * Goodhart's law: https://en.wikipedia.org/wiki/Goodhart's_law Episode guide: [00:00:19] Typical approach to sharing of scientific results today vs. new approaches [00:07:20] Is the product of science a pdf document? [00:11:15] Juan Pablo's approach to producing and sharing results [00:19:50] Pre-register your study and methods to mitigate the risk of (inadvertent) p-hacking [00:27:10] Utility of open data, FAIR principles, copyright, licenses, sensitive data [00:45:50] Open peer review [00:54:10] Evaluating scientific careers [01:04:55] Practical measures to achieve the goals of open science [01:07:40] Science is ... [01:08:35] Closing main session [01:09:10] Start bonus session [01:09:15] Is open science a sad story? [01:11:50] Publish or perish [01:15:25] Creating knowledge is easier than ever - Can open science be weaponized? [01:24:04] Why do we publish? Why do we stay in the system?

    1h 32m
  3. 07/10/2020

    Episode 01 - Mechanistic Machine Learning

    This is the first episode of the Flush to Data podcast. We start with a discussion on mechanistic modelling and machine learning and venture into models for emulation, uncertainty quantification, and data quality. Bonus material includes a discussion on aspects of current scientific practice, including the lack of hypothesis testing, the evaluation of novelty, and the challenges with a generalist approach. Hosts: Jörg Rieckermann and Kris Villez Guest: Juan Pablo Carbjal Links: * Juan Pablo's web page: https://sites.google.com/site/juanpicarbajal/ * Article relating Gaussian processes and Kalman filter: www.jstor.org/stable/2984861  * BBC podcast on Gauss: https://www.bbc.co.uk/programmes/b09gbnfj * Using Lake Zurich as a heat sink: Unfortunately, we could not back-track the original source, despite considerable effort. If anyone of the listeners happens to know how to access the original source we would be grateful for a notice. The best we could find was documentation of related projects by Eawag: https://thermdis.eawag.ch/ and [1]. These show that ecological consequences have indeed been assessed in detail.  * Goodhart's law: https://en.wikipedia.org/wiki/Goodhart's_law * An invitation to reproducible computational research: https://doi.org/10.1093/biostatistics/kxq028 * Science in the age of selfies: https://doi.org/10.1073/pnas.1609793113  References: [1] Wüest, A. (2012). Potential zur Wärmeenergienutzung aus dem Zürichsee. Machbarkeit. Wärmeentzug (Heizen) und Einleitung von Kühlwasser. Kastanienbaum: Eawag. DORA-Link  Episode guide: [0:00:00] Who is Juan Pablo Carbajal? [0:03:10] Mechanistic modelling versus artificial intelligence [0:07:08] Who is Juan Pablo Carbajal? (ctd.) [0:09:26] Cross-fertilization between robotics and wastewater engineering [0:15:05] Emulation: using models to approximate other models [0:21:22] Incorporating common sense and prior knowledge into data-driven models [0:31:31] Equivalence between Gaussian processes and Kalman filter [0:33:50] Utility of emulation [0:40:15] Utility of quantified uncertainty [0:44:50] Intermezzo [0:49:04] What can models say about data quality  [1:02:15] How to communicate about data quality? [1:10:10] Preparing engineers for the future [1:15:23] Thank you and goodbye! Bonus material: [1:16:40] Interpretable machine learning models [1:22:33] Hypothesis testing [1:26:14] Critical assessment of novelty [1:30:50] Barriers to the generalist approach  [1:35:48] Thank you and goodbye!

    1h 36m

About

We mix Water, Poo, and Data.The Flush to Data podcast invites conversations about the data, models, and science used in wastewater engineering. Your hosts are Jörg Rieckermann (Eawag, Dübendorf, Switzerland) and Kris Villez (Oak Ridge National Laboratory, TN, USA). The contributions of Kris Villez to this podcast are a reflection of personal opinion only and are not related to any project, study, or opinion at the Oak Ridge National Laboratory or the U.S. Department of Energy. ---- Credits: ---- In our Trailer and intro we use the beautiful music from the "Obliterator" Amiga game by the incredible David Whittaker https://en.wikipedia.org/wiki/David_Whittaker_(video_game_composer) we also use creative common artwork distributed under a CC BY-ND 2.0 license. Check out our homepage for more info --- https://flush2data.gitlab.io