Super Data Science: ML & AI Podcast with Jon Krohn
Episodes
681: XGBoost: The Ultimate Classifier, with Matt Harrison
23 May 2023
Contributed by Lukas
Unlock the power of XGBoost by learning how to fine-tune its hyperparameters and discover its optimal modeling situations. This and more, when best-se...
680: Automating Industrial Machines with Data Science and the Internet of Things (IoT)
19 May 2023
Contributed by Lukas
Industrial machinery’s dependence on data science, tech stacks to build IoT platforms, and transitioning from data science to product: This week’s...
679: The A.I. and Machine Learning Landscape, with investor George Mathew
16 May 2023
Contributed by Lukas
Generative AI, MLOps, and making smart investments in AI: This week’s episode is critical listening for AI investors and generative AI creators. AI ...
678: StableLM: Open-source "ChatGPT"-like LLMs you can fit on one GPU
12 May 2023
Contributed by Lukas
StableLM, the new family of open-source language models from the brilliant minds behind Stable Diffusion is out! Small, but mighty, these models have ...
677: Digital Analytics with Avinash Kaushik
09 May 2023
Contributed by Lukas
How does one use marketing analytics to drive business success? Avinash Kaushik, Chief Strategy Officer at Croud and former Sr. Director of Global Str...
676: The Chinchilla Scaling Laws
05 May 2023
Contributed by Lukas
Chinchilla AI, and fine-tuning proprietary tasks with large language models: On this week’s Five-Minute Friday, host Jon Krohn outlines the principl...
675: Pandas for Data Analysis and Visualization
02 May 2023
Contributed by Lukas
Wrangling data in Pandas, when to use Pandas, Matplotlib or Seaborn, and why you should learn to create Python packages: Jon Krohn speaks with guest S...
674: Parameter-Efficient Fine-Tuning of LLMs using LoRA (Low-Rank Adaptation)
28 Apr 2023
Contributed by Lukas
Models like Alpaca, Vicuña, GPT4All-J and Dolly 2.0 have relatively small model architectures, but they're prohibitively expensive to train even on a...
673: Taipy, the open-source Python application builder
25 Apr 2023
Contributed by Lukas
Vincent Gosselin, CEO and co-founder of Taipy, an open-source Python library, joins Jon Krohn to discuss how to accelerate productivity in Python and ...
672: Open-source "ChatGPT": Alpaca, Vicuña, GPT4All-J, and Dolly 2.0
21 Apr 2023
Contributed by Lukas
Get started with language models: Learn about the commercial-use options available for your business in this week’s Five-Minute Friday, where host J...
671: Cloud Machine Learning
18 Apr 2023
Contributed by Lukas
Get to grips with AWS, Azure, Google Cloud Platform on this week’s episode. Host Jon Krohn speaks with Kirill Eremenko and Hadelin de Ponteves about...
670: LLaMA: GPT-3 performance, 10x smaller
14 Apr 2023
Contributed by Lukas
How does Meta AI's natural language model, LLaMa compare to the rest? Based on the Chinchilla scaling laws, LLaMa is designed to be smaller but more p...
669: Streaming, reactive, real-time machine learning
11 Apr 2023
Contributed by Lukas
In this episode, Jon Krohn welcomes Adrian Kosowski, Co-Founder and Chief Product Officer at Pathway, who shares insights on streaming data processing...
668: GPT-4: Apocalyptic stepping stone?
07 Apr 2023
Contributed by Lukas
AI risks, RLHF, and inner alignment: GPT stands to give the business world a major boost. But with everyone racing either to develop products that inc...
667: Harnessing GPT-4 for your Commercial Advantage
04 Apr 2023
Contributed by Lukas
GPT-4, augmenting human tasks with AI, and using GPT-4 commercially: Vin Vashishta speaks to host Jon Krohn about how to leverage GPT-4 and outperform...
666: GPT-4
31 Mar 2023
Contributed by Lukas
GPT-4 has landed! But how well does it compare to GPT-3.5? Tune in to hear Jon stack its performance against its predecessor–the results might just ...
665: How to be both socially impactful and financially successful in your data career
28 Mar 2023
Contributed by Lukas
Angel investor and data science consultant Josh Wills sits down with Jon Krohn to discuss his former roles (Google, Slack, and Cloudera) and the essen...
664: MIT Study: ChatGPT Dramatically Increases Productivity
24 Mar 2023
Contributed by Lukas
Can ChatGPT make us better and faster in our work, and is it the future or just another fad? In this episode, Jon Krohn delves into a new study from M...
663: Astonishing CICERO negotiates and builds trust with humans using natural language
21 Mar 2023
Contributed by Lukas
NLP, transformer architectures, and machines beating humans at their own game: Jon Krohn talks to Alexander H. Miller about his work in building a mac...
662: The Most Popular SuperDataScience Podcast Episodes of 2022
17 Mar 2023
Contributed by Lukas
Our list of the top 10 SuperDataScience podcast episodes for 2022 is here. From Pandas to causality, AI breakthroughs and data storytelling, these wer...
661: Designing Machine Learning Systems
14 Mar 2023
Contributed by Lukas
Chip Huyen, co-founder of Claypot AI and author of O'Reilly's best-selling "Designing Machine Learning Systems" is here to share her expertise on desi...
660: Five Ways to Use ChatGPT for Data Science
10 Mar 2023
Contributed by Lukas
ChatGPT is well-known for its potential to disrupt the writing industry, but in what other, perhaps less explored, ways can we use the tool? In this e...
659: Open-Source Tools for Natural Language Processing
07 Mar 2023
Contributed by Lukas
NLP practitioners: this episode is for you. From the awareness of linguistic elements and annotation to getting the necessary people in the room, Vinc...
658: How to Build Data and ML Products Users Love
03 Mar 2023
Contributed by Lukas
What makes data products popular? Brian T. O'Neill, Founder and Principal of Designing for Analytics, returns to the podcast to help us crack the code...
657: How to Learn Data Engineering
28 Feb 2023
Contributed by Lukas
Data engineering educator Andreas Kretz joins Jon Krohn for a 1-hour primer that covers everything you need to know about the most in-demand role in d...
656: A.I. Talent and the Red-Hot A.I. Skills
24 Feb 2023
Contributed by Lukas
How to attract an AI recruiter’s attention: In this episode, Jon Krohn and Tribe AI CEO Jaclyn Rice Nelson break down the key ingredients needed to ...
655: AI ROI: How to get a profitable return on an AI-project investment
21 Feb 2023
Contributed by Lukas
Transparent data science, profitable AI, and what’s missing from a data science education: Pandata’s Data Scientist in Residence Keith McCormick a...
654: Mike Wimmer: The 14-Year-Old A.I. Entrepreneur
17 Feb 2023
Contributed by Lukas
14-year-old AI prodigy Mike Wimmer joins Jon Krohn to discuss his latest projects. Whether he's using AI to help conserve the world's coral reefs or l...
653: Efficiently Glean-ing Insights from Vast Data Warehouses
14 Feb 2023
Contributed by Lukas
Carlos Aguilar, the founder and CEO of Glean, a data exploration and visualization platform, knows a thing or two about starting and growing a tech st...
652: A.I. Speech for the Speechless
10 Feb 2023
Contributed by Lukas
MedTech, communications technology and computer vision: In this Five-Minute Friday, Jon Krohn investigates the technology that allows patients who hav...
651: The Intentional Use of Color in Data Communication
07 Feb 2023
Contributed by Lukas
Data visualizations, color theories and color inclusivity: In this episode, Kate Strachnyi and host Jon Krohn discuss how color can make or break your...
650: SparseGPT: Remove 100 Billion Parameters but Retain 100% Accuracy
03 Feb 2023
Contributed by Lukas
SparseGPT is a noteworthy one-shot pruning technique that can halve the size of large language models like GPT-3 without adversely affecting accuracy....
649: Introduction to Machine Learning
31 Jan 2023
Contributed by Lukas
Looking for a short primer on Machine Learning concepts? SDS Founder Kirill Eremenko and AI expert Hadelin de Ponteves are back, joining Jon Krohn to ...
648: VALL-E: Uncannily Realistic Voice Imitation from a 3-Second Clip
27 Jan 2023
Contributed by Lukas
Text-to-speech gets a groundbreaking update with Microsoft’s VALL-E. On this Five-Minute Friday, Jon Krohn investigates how the Microsoft team model...
647: Is Data Science Still Sexy?
24 Jan 2023
Contributed by Lukas
Knowledge management, trust of AI, and job automation: Tom Davenport speaks with Jon Krohn about the organizational obstacles to adopting AI, and why ...
646: ChatGPT: How to Extract Commercial Value Today
20 Jan 2023
Contributed by Lukas
Are you still wondering how to get the most out of ChatGPT's game-changing technology? In this week's Five-Minute Friday guest episode, Jon Krohn sits...
645: Machine Learning for Video Games
17 Jan 2023
Contributed by Lukas
Machine learning, security and Call of Duty collide this week as Jon Krohn sits down with Carly Taylor, Lead Machine Learning Engineer for Activision'...
644: A Framework for Big Life Decisions
13 Jan 2023
Contributed by Lukas
Love and money matter in this week’s Five-Minute Friday, as Stanford University’s Myra Strober sits down with Jon Krohn to talk about her latest b...
643: A.I. for Medicine
10 Jan 2023
Contributed by Lukas
AI prediction tools for antibodies and using statistics to prepare healthcare systems for pandemics: host Jon Krohn speaks with Chief Scientist of Bio...
642: Continuous Calendar for 2023
06 Jan 2023
Contributed by Lukas
Looking to shake up your data science productivity in 2023? Switching to a continuous calendar can make all the difference. Jon Krohn shares his new c...
641: Data Science Trends for 2023
03 Jan 2023
Contributed by Lukas
The top data science trends of 2023 are here. Sadie St. Lawrence joins Jon Krohn to share annual predictions on the future of AI. From the data mesh t...
640: What I Learned in 2022
30 Dec 2022
Contributed by Lukas
From AI trends to rediscovering how fun it is to work with colleagues ‘in person’, host Jon Krohn wraps up the year’s best SuperDataScience cont...
639: Simplifying Machine Learning
27 Dec 2022
Contributed by Lukas
Learning Python for beginners is made fun on Mariya Sha’s YouTube and Discord channels, on which she posts hacks, breakdowns and tutorials on everyt...
638: ChatGPT Holiday Greeting
23 Dec 2022
Contributed by Lukas
OpenAI's ChatGPT helps us generate a special holiday greeting this week. Tune in to hear the festive message that this impressive natural language gen...
637: How to Influence Others with Your Data
20 Dec 2022
Contributed by Lukas
It's all about data visualization this week as Jon Krohn welcomes Ann K. Emery, data visualization designer and owner of Depict Data Studio, to the sh...
636: The Equality Machine
16 Dec 2022
Contributed by Lukas
Digital literacy and data bias: Can one reduce or even eradicate the other? Law professor Orly Lobel speaks with SDS host Jon Krohn about Orly’s lat...
635: The Perils of Manually Labeling Data for Machine Learning Models
13 Dec 2022
Contributed by Lukas
Hand labeling data and information bias: Jon Krohn speaks with Watchful CEO Shayan Mohanty about the pitfalls of data analysis when bias comes into th...
634: Model Error Analysis
09 Dec 2022
Contributed by Lukas
Data scientist and author Serg Masís joins Jon Krohn for a Five-Minute Friday episode that touches on model error analysis. Learn how this process ca...
633: Responsible Decentralized Intelligence
06 Dec 2022
Contributed by Lukas
This week's episode is all about Responsible Decentralized Intelligence as award-winning professor and tech entrepreneur, Dawn Song, joins Jon Krohn t...
632: Liquid Neural Networks
02 Dec 2022
Contributed by Lukas
Liquid neural networks are a type of bio-inspired machine learning set to make a huge impact in the field of data analytics. On this week’s Five-Min...
631: Data Analytics Career Orientation
29 Nov 2022
Contributed by Lukas
Interview success, funny memes about data, and stakeholder management: Jon Krohn speaks with Luke Barousse, a full-time YouTuber who produces content ...
630: Resilient Machine Learning
25 Nov 2022
Contributed by Lukas
Jon Krohn sits with Dr. Dan Shiebler at the Open Data Science Conference (ODSC) to dive into the critical components of building resilient machine lea...
629: Software for Efficient Data Science
22 Nov 2022
Contributed by Lukas
Has the term developer advocacy ever left you scratching your head? This week data science developer advocate for JetBrains, Dr. Jodie Burchell, joins...
628: The Critical Human Element of Successful A.I. Deployments
18 Nov 2022
Contributed by Lukas
On this episode of Five-Minute Friday, Jon Krohn speaks from the Open Data Science Conference (ODSC). There, he sits down with author and data scienti...
627: AutoML: Automated Machine Learning
15 Nov 2022
Contributed by Lukas
Jon Krohn speaks with Erin LeDell, H2O.ai’s Chief Machine Learning Scientist. They investigate how AutoML supercharges the data science process, the...
626: Subword Tokenization with Byte-Pair Encoding
11 Nov 2022
Contributed by Lukas
Word tokenization, character tokenization and subword tokenization go head-to-head this week as Jon Krohn delivers a mini-bootcamp on the NLP-related ...
625: Analyzing Blockchain Data and Cryptocurrencies
08 Nov 2022
Contributed by Lukas
Chainalysis' Director of Research, Kim Grauer joins Jon Krohn to explore the state of economic-data analysis on the blockchain. This episode is broug...
624: Imagen Video: Incredible Text-to-Video Generation
04 Nov 2022
Contributed by Lukas
On this week’s Five-Minute Friday, Jon Krohn investigates Imagen Video, Google’s latest model for making video art out of text prompts. Recently p...
623: Data Analyst, Data Scientist, and Data Engineer Career Paths
01 Nov 2022
Contributed by Lukas
Jon Krohn speaks with Shashank Kalanithi, the man who makes a sport out of YouTube and data analytics out of sports. Listen in as he talks about how h...
622: Burnout: Causes and Solutions
28 Oct 2022
Contributed by Lukas
Is burnout on the horizon for you and your team? Christina Maslach, author of the new book "The Burnout Challenge," joins Jon Krohn to help us identif...
621: Blockchains and Cryptocurrencies: Analytics and Data Applications
25 Oct 2022
Contributed by Lukas
Cryptocurrency and blockchain take center stage this week as we welcome Chief Economist at Chainalysis, Philip Gradwell, to discuss the data science a...
620: OpenAI Whisper: General-Purpose Speech Recognition
21 Oct 2022
Contributed by Lukas
What’s your secret to superb audio recognition? Whisper it. We mean that literally—Whisper is the latest in OpenAI’s growing suite of models aim...
619: Tools for Deploying Data Models into Production
18 Oct 2022
Contributed by Lukas
Jon Krohn speaks with Erik Bernhardsson, the man who invented Spotify’s original music recommendation system. They address the different ways to int...
618: The Joy of Atelic Activities
14 Oct 2022
Contributed by Lukas
Telic and atelic activities take center stage this week as Jon Krohn contemplates how our daily actions contribute to our overall sense of fulfillment...
617: Causal Modeling and Sequence Data
11 Oct 2022
Contributed by Lukas
Dr. Sean Taylor, Co-Founder and Chief Scientist of Motif Analytics, joins Jon Krohn this week for yet another perspective on causal modeling. Tune in ...
616: The Four Requirements for Expertise (beyond the 10,000 Hours)
07 Oct 2022
Contributed by Lukas
10,000 hours of study: Will it make you an expert? On this episode of Five-Minute Friday, host Jon Krohn explores whether increasing your skills is ju...
615: How to Ace Your Data Science Interview
04 Oct 2022
Contributed by Lukas
“Being a great data scientist” and “being great at a data science interview” are not one and the same. Jon Krohn speaks with Nick Singh about ...
614: Thriving on Information Overload
30 Sep 2022
Contributed by Lukas
World-leading futurist, author and entrepreneur, Ross Dawson joins us for the first of our extended Five-Minute Friday episodes. As information overwh...
613: Causal Machine Learning
27 Sep 2022
Contributed by Lukas
Dr. Emre Kiciman, Senior Principal Researcher at Microsoft Research joins the podcast to share his world-leading knowledge on causal machine learning....
612: More Guests on Fridays
23 Sep 2022
Contributed by Lukas
Some exciting changes are coming to our popular Five-Minute Friday series! From longer episodes to new guests, tune in to hear what's next. Additional...
611: Open-Ended A.I.: Practical Applications for Humans and Machines
20 Sep 2022
Contributed by Lukas
Dr. Ken Stanley, a world-leading expert on Open-Ended AI and author of the genre-bending book "Why Greatness Cannot be Planned," joins Jon Krohn for a...
610: Who Dares Wins
16 Sep 2022
Contributed by Lukas
On this episode of Five-Minute Friday, host Jon Krohn shares his life motto, “Who dares, wins”, and the sentiment behind it: that to get anywhere ...
609: Data Mesh
13 Sep 2022
Contributed by Lukas
Jon Krohn speaks with Zhamak Dehghani, the empathetic technologist who coined the term “data mesh”. They explore what a data mesh is, and how its ...
608: Daily Habit #11: Assigning Deliverables
09 Sep 2022
Contributed by Lukas
Company meetings should be held to solve problems. So, why do we often feel like the weekly stand-ups and check-ins are a waste of everyone’s time? ...
607: Inferring Causality
06 Sep 2022
Contributed by Lukas
We welcome Dr. Jennifer Hill, Professor of Applied Statistics at New York University, to the podcast this week for a discussion that covers causality,...
606: Four Thousand Weeks
02 Sep 2022
Contributed by Lukas
Four thousand weeks equate to roughly 80 years—a lifetime for those of us lucky enough to get there. What do we choose to do with this time? How can...
605: Upskilling in Data Science and Machine Learning
30 Aug 2022
Contributed by Lukas
Kian Katanforoosh, CEO of Workera and Lecturer at Stanford University, joins Jon Krohn to reveal the tools, frameworks, and machine learning models th...
604: Ignition: A Landmark Nuclear Fusion Milestone is Achieved
26 Aug 2022
Contributed by Lukas
During this week's Five-Minute Friday episode features, Jon explores recent groundbreaking developments in nuclear fusion –ignition–and what that ...
603: Geospatial Data and Unconventional Routes into Data Careers
23 Aug 2022
Contributed by Lukas
Christina Stathopoulos, Analytical Lead for Waze and Adjunct Professor at IE Business School, joins the podcast to shed light on her work with geospat...
602: We Are Living in Ancient Times
19 Aug 2022
Contributed by Lukas
Inspired by a quote from by science fiction writer, Teresa Nielsen Hayden, Jon Krohn reflects on the notion of living in ancient times and the machine...
601: Venture Capital for Data Science
16 Aug 2022
Contributed by Lukas
This week, Sarah Catanzaro, General Partner at Amplify Partners joins Jon for an episode that dives into the venture capital side of data science. Lea...
600: Yoga Nidra Practice with Steve Fazzari
12 Aug 2022
Contributed by Lukas
Rest and relaxation await as Steve Fazzari joins us this week for a special edition of the podcast! Tune in for a rejuvenating session of Yoga Nidra l...
599: MLOps: Machine Learning Operations
09 Aug 2022
Contributed by Lukas
This week, Mikiko Bazeley, Senior Software Engineer at Mailchimp joins the podcast to share her in-depth knowledge of MLOps: Machine Learning Operatio...
598: Getting Kids Excited about STEM Subjects
05 Aug 2022
Contributed by Lukas
Ben Taylor makes a fourth appearance on Five-Minute Friday to discuss the best ways to introduce STEM to children. Tune in to hear the many ways in wh...
597: A.I. Policy at OpenAI
02 Aug 2022
Contributed by Lukas
Dr. Miles Brundage, Head of Policy Research at OpenAI, joins Jon Krohn this week to discuss AI model production, policy, safety, and alignment. Tune i...
596: The A.I. Platforms of the Future
29 Jul 2022
Contributed by Lukas
Ben Taylor returns for a third Five-Minute Friday episode! This week, he looks ahead and digs into what we can expect from the A.I. platforms of the f...
595: Data Engineering 101
26 Jul 2022
Contributed by Lukas
Tune in as Joe Reis and Matt Housley, co-founders of Ternary Data and co-authors of the book “Fundamentals of Data Engineering” join Jon Krohn to ...
594: Why CEOs Care About A.I. More than Other Technologies
22 Jul 2022
Contributed by Lukas
This week, Jon Krohn and A.I. industry veteran Ben Taylor discuss the driving factors that push CEOs to prioritize A.I. over other technologies. Addi...
593: The Real-World Impact of Cross-Disciplinary Data Science Collaboration
19 Jul 2022
Contributed by Lukas
Jon welcomes Professor Philip Bourne, Founding Dean of the School of Data Science at the University of Virginia to discuss his biomedical data science...
592: How to Sell a Multimillion Dollar A.I. Contract
15 Jul 2022
Contributed by Lukas
In this episode, Jon Krohn welcomes A.I. industry veteran Ben Taylor to discuss how to sell multimillion dollar A.I. contracts. Tune in to hear why tr...
591: Simulations and Synthetic Data for Machine Learning
12 Jul 2022
Contributed by Lukas
Mars Buttfield-Addison, PhD Candidate at the University of Tasmania, joins Jon Krohn for a high-energy episode covering everything from Machine Learni...
590: Artificial General Intelligence is Not Nigh (Part 2 of 2)
08 Jul 2022
Contributed by Lukas
In this episode, Jon continues his two-part series on artificial general intelligence (AGI) and why we are unlikely to realize it anytime soon. Listen...
589: Narrative A.I. with Hilary Mason
05 Jul 2022
Contributed by Lukas
Hilary Mason, Co-Founder and CEO of Hidden Door, joins Jon Krohn for a live discussion that explores narrative A.I., emerging ML techniques, and how h...
588: Artificial General Intelligence is Not Nigh
01 Jul 2022
Contributed by Lukas
In this episode, Jon kicks off a two-part series that sees him explore the popular topic of artificial general intelligence and why it might–or migh...
587: Data Engineering for Data Scientists
28 Jun 2022
Contributed by Lukas
Mark Freeman, Senior Data Scientist at Humu, joins Jon Krohn to talk about all things data engineering and offers listeners some critical tips for the...
586: Daily Habit #10: Limit Social Media Use
24 Jun 2022
Contributed by Lukas
In this episode, Jon dives into the popular topic of social media and its impact on his productivity. Tune in to hear how minimizing the use of social...
585: PyMC for Bayesian Statistics in Python
21 Jun 2022
Contributed by Lukas
In this episode, Dr. Thomas Wiecki, Core Developer of the PyMC Library and CEO of PyMC Labs, joins Jon for a masterclass in Bayesian statistics. Tune ...
584: OpenAI Codex
17 Jun 2022
Contributed by Lukas
In this episode, Jon reviews the remarkable natural language model Codex by OpenAI. Learn why it has amassed a waitlist and how you can leverage its p...
583: The State of Natural Language Processing
14 Jun 2022
Contributed by Lukas
In this episode, natural language processing (NLP) expert and Lead Data Scientist at CB Insights, Rongyao Huang, joins Jon Krohn to discuss NLP. Liste...
582: Model Speed vs Model Accuracy
10 Jun 2022
Contributed by Lukas
In this episode, Jon wraps up his three-part series on business value and machine learning. Listen in as he explains why starting with simple models i...