The Heavybit Library
The Heavybit Library is an extensive catalog of educational content featuring hundreds of hours of expert presentations, insightful podcasts, and articles focused on helping technical founders achieve breakout success.
Browse
Best Practices for Developing Data Pipelines in Regulated Spaces
How to Think About Data Pipelines in Regulated Spaces Tech teams standing up new AI programs, or scaling existing programs, need...
How to Properly Scope and Evolve Data Pipelines
For Data Pipelines, Planning Matters. So Does Evolution. A data pipeline is a set of processes that extracts, transforms, and...
Data Council 2025: The Data Science & Algorithms Track with Sean Taylor and Jesse Robbins
Heavybit is thrilled to be sponsoring Data Council 2025, and we invite you to join us in Oakland from Apr 22-24 to experience 3...

Platform Builders Ep. #1, The Future of CRM is No CRM with Justin Belobaba
In this inaugural episode of Platform Builders, hosts Christine Spang and Isaac Nassimi of Nylas welcome Justin Belobaba, Founder...
LLM Fine-Tuning: A Guide for Engineering Teams in 2025
General-purpose large language models (LLMs) are built for broad artificial intelligence (AI) applications. The most popular...
Platform Builders Ep. #3, Building Platforms in the AI Era with Ben Rubin of Verify
In this episode of Platform Builders, Christine Spang and Isaac Nassimi chat with Ben Rubin about the evolution of software...
How to Create Data Pipelines
How to Create Data Pipelines Introduction to Data Pipelines In today’s data-driven world developers and product managers rely...
Generationship Ep. #32, Structuring Data with Marcel Kornacker
In episode 32 of Generationship, Rachel speaks with Marcel Kornacker, creator of Pixeltable and a pioneer in database technology....
Platform Builders Ep. #8, The Beauty of Vertical SaaS with John Melas-Kyriazi of Standard Metrics
In episode 8 of Platform Builders, Christine Spang and Isaac Nassimi are joined by John Melas-Kyriazi, founder and CEO of...
The Role of Synthetic Data in AI/ML Programs in Software
Why Synthetic Data Matters for Software Running AI in production requires a great deal of data to feed to models. Reddit is now...
Data Council 2025: The Databases Track with Sai Krishna Srirampur and Craig Kerstiens
Heavybit is thrilled to be sponsoring Data Council 2025, and we invite you to join us in Oakland from Apr 22-24 to experience 3...
Synthetic Data for AI: Purpose and Use Cases
What to Know About Synthetic Data for AI Programs For software developers, large language models (LLMs) like ChatGPT can help...