The Data Stack Show

10 Episodes
Subscribe

By: Rudderstack

Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

The PRQL: Exploring the Evolution of AI and ML in E-commerce Search Optimization with Jesse Clark of Marqo.ai
Today at 8:30 AM

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.


197: Deep Dive: How to Build AI Features and Why it is So Dang Hard with Barry McCardle of Hex
Last Wednesday at 8:30 AM

Highlights from this week’s conversation include:

Overview of Hex and its Purpose (0:51)Discussion on AI and Data Collaboration (1:42)Product Updates in Hex (2:14)Challenges of Building AI Features (13:29)Magic Features and AI Context (15:22)Chatbots and UI (17:31)Benchmarking AI Models (19:06)AI as a Judge Pattern (23:32)Challenges in AI Development (25:31)AI in Production and Product Integration (28:43)Difficulties in AI Feature Prediction (33:38)Deterministic template selection and AI model uncertainty (36:21)Infrastructure for AI experimentation and evaluation (40:11)Consolidation and competition in the data stack industry (42:27)Data gravity, integration, and market dynamics (47:12)Enterprise adoption and the bundling and unbundling of platforms (51:03)The open source da...


The PRQL: Why is Building Great AI Features so Hard? Featuring Barry McCardel of Hex
07/08/2024

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.


196: Why Big Query Was a Big Deal, Observability AI, and How AI is Like a Guy at the Bar, Featuring David Wynn of Edge Delta
07/03/2024

Highlights from this week’s conversation include:

David’s Background and Career (0:49)Econometrics Work at UPS (3:14)Challenges with Time Series Data and Tools (7:15)Working at Google Cloud (11:28)BigQuery's Significance (13:51)Comparison of Data Warehouse Products (17:23)Learning different cloud platforms (20:17)Coherence in GCP (23:04)Observability and data analysis (32:44)Support for Iceberg format in BigQuery (36:31)AI in Observability (40:25)AI's Role in Observability (43:39)AI and Mental Models (46:04)Final thoughts and takeaways (48:32)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around buildi...


The PRQL: Google Cloud Deep Dive and Observability AI with David Wynn of Edge Delta
07/01/2024

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.


195: Supply Chain Data Stacks and Snowflake Optimization Pro Tips with Jeff Skoldberg of Green Mountain Data Solutions
06/26/2024

Highlights from this week’s conversation include:

Jeff's Background and Transition to Independent Consulting (0:03)Working at Keurig and Business Model Changes (2:16)Tech Stack Evolution and SAP HANA Implementation (7:33)Adoption of Tableau and Data Pipelines (11:21)Supply Chain Analytics and Timeless Data Modeling (15:49)Impact of Cloud Computing on Cost Optimization (18:35)Challenges of Managing Variable Costs (20:59)Democratization of Data and Cost Impact (23:52)Quality of Fivetran Connectors (27:29)Data Ingestion and Cost Awareness (29:44)Virtual Warehouse Cost Management (31:22)Auto-Scaling and Performance Optimization (33:09)Cost-Saving Frameworks for Business Problems (38:19)Dashboard Frameworks (40:53)Increasing Dashboards (43:29)Final thoughts and takeaways (46:28)

The Data Stack Show is a weekly podcast po...


The PRQL: Breaking down Keurig’s Supply Chain Data Stack with Jeff Skoldberg of Green Mountain Data Solutions
06/24/2024

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.


194: Building Retail Churn Prediction on DuckDB with Clint Dunn of Wilde
06/19/2024

Highlights from this week’s conversation include:

Clint’s Background and Journey in Data (0:51)Starting a Data Career (2:01)Transition to Startup SaaS World (4:27)Clint’s Connection to a Federal Reserve Database (5:31)Challenges in Predictive Modeling (10:27)Data Input Challenges (15:50)Marketers' Workflow and Data Integration (18:29)Soft ROI vs. Hard ROI in Data Analysis (00:21:31)Balancing Internal Marketing and Data Team's Value (22:35)Simplifying Data Inputs for Predictive Models (25:09)Data Analysis Workflow and Tech Stack (29:06)Open Data Formats and Impact on Data Platforms (34:40)The S3 and Ecosystem Model (37:08)In-browser SQL Queries with DuckDB (39:24)Data Security Concerns and Solutions (41:47)Clean Rooms and Data Sharing (43:32)Final...


The PRQL: Hard Data ROI and Productizing Retail Churn Prediction with Clint Dunn of Wilde
06/17/2024

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.


193: Introducing the Cynical Data Guy: Is Data-Driven a Myth?
06/12/2024

Highlights from this week’s conversation include:

Introducing a special edition of the show with the cynical data guy (0:19)Metadata and LLMs (2:32)Data-driven culture (8:44)No-code orchestration tools (17:09)No Code vs. Low Code (19:58)Enterprise Challenges with No Code Solutions (20:08)No Code Tools for Small Companies (21:40)Inappropriate Use of Tools (23:06)Final thoughts and takeaways (24:05)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes acro...