DataEngPodcast Profile
DataEngPodcast

@DataEngPodcast

Followers
3K
Following
198
Media
1
Statuses
453

A podcast about data engineering and modern data infrastructure. Hosted by @TobiasMacey

Joined December 2016
Don't wanna be here? Send us removal request.
@DataEngPodcast
DataEngPodcast
3 years
In this episode Frank Liu talks about how the open source Towhee library simplifies the work of building pipelines to generate vector embeddings of your data for building machine learning projects.
1
0
4
@DataEngPodcast
DataEngPodcast
3 years
In this episode Nick van Wiggeren talks about the Planetscale serverless MySQL service built on top of the open source Vitess project and the impact on developer productivity that it offers when you don't have to worry about database operations.
2
0
1
@DataEngPodcast
DataEngPodcast
3 years
In this episode Sabin Thomas talks about how Zing Data is lets you bring business intelligence with you when you're on the go with first-class support for mobile devices
1
0
1
@DataEngPodcast
DataEngPodcast
3 years
In this episode Arjun Narayan talks about how to enable organizations of all sizes to take advantage of real-time data, including the technical and organizational investments required to make it happen.
0
1
2
@DataEngPodcast
DataEngPodcast
3 years
In this episode Wes McKinney talks about his work at Voltron Data to support and grow the Arrow project and its integration with the broader data ecosystem
0
8
27
@DataEngPodcast
DataEngPodcast
3 years
In this episode Matt Jaffee talks about FeatureBase, an open source bitmap database that allows you to query and analyze massive data sets at interactive speeds and the work they have done to simplify integration with the rest of your data platform.
1
2
3
@DataEngPodcast
DataEngPodcast
3 years
In this episode Ian Schweer Talks about the data team behind the League of Legends franchise and how they manage to innovate in the face of legacy systems.
0
3
2
@DataEngPodcast
DataEngPodcast
3 years
In this episode Salma Bakouk talks about how to use data entropy as a model for identifying and resolving problems in your data platform before they occur and Sifflet's approach to full stack data observability.
0
0
1
@DataEngPodcast
DataEngPodcast
3 years
In this episode Shane Gibson talks about his work on the AgileData service and how it encodes agile practices into a self-serve platform which allows organizations to deliver reliable data products faster and easier.
0
0
0
@DataEngPodcast
DataEngPodcast
3 years
In this episode Vishnu Venkataraman talks about his work on the data platform at CreditKarma and how it has evolved over the years that he has been there and their journey to the cloud.
0
0
1
@DataEngPodcast
DataEngPodcast
3 years
In this episode Nick King talks about how being deliberate about data creation can produce better and faster results than just consuming whatever data is available
0
0
1
@DataEngPodcast
DataEngPodcast
3 years
In this episode Sonal Goyal talks about Zingg, the open source and customizable framework for scalable entity resolution, data mastering, and cleaning without having to start from scratch
0
0
2
@DataEngPodcast
DataEngPodcast
3 years
In this episode Amir Orad talks about how the Sisense platform power embedded analytics experiences and brings the promise of business intelligence beyond the bounds of prefabricated dashboards and reports.
0
0
0
@DataEngPodcast
DataEngPodcast
3 years
In this episode Nandam Karthik talks about his experiences at Sisense combining Optimus and dbt to deliver analytics projects without the overhead of complex pipeline development so that analysts can own the end-to-end workflow.
0
0
1
@DataEngPodcast
DataEngPodcast
3 years
In this episode Shane Gibson talks about how to apply agile development practices to your data projects while avoiding overwhelming technical debt
0
1
1
@DataEngPodcast
DataEngPodcast
3 years
In this episode Manjot Singh, field CTO of MariaDB, talks about their eponymous open source database and how they are continuing to evolve and innovate.
1
1
3
@DataEngPodcast
DataEngPodcast
3 years
In this episode Adrian Kosowski talks about Pathway, the database that thinks, and how it is designed to perform real time analysis on data that powers logistics and supply chains so businesses can survive in the modern economy.
0
0
1
@DataEngPodcast
DataEngPodcast
3 years
In this episode Jason Hughes talks about the Dremio product suite and their various contributions to the open data lakehouse ecosystem.
0
5
13
@DataEngPodcast
DataEngPodcast
3 years
In this episode Vusal Dadalov talks about the Iomete platform and how they are building a managed data lakehouse using open technologies and formats without the overhead of running it yourself or paying more than if you hosted it yourself.
0
2
1
@DataEngPodcast
DataEngPodcast
3 years
In this episode Purvi Shah, VP of Enterprise Big Data Platforms, talks about the Customer 360 project at American Express and their journey into the cloud for enterprise data management
0
0
3