The Data Maven

Home
Archive
About
Revolutionize Your AI Development Process with Kestra: The Ultimate AI Pipeline Builder!
As an AI developer, I am always on the lookout for tools that can help me streamline my workflow and make my development process more efficient.
May 9, 2023 • 
Mayur Choubey

Share this post

User's avatar
The Data Maven
Revolutionize Your AI Development Process with Kestra: The Ultimate AI Pipeline Builder!
Auto Optimizing Apache Iceberg tables with Tabular: Best practices from a DBA standpoint – Part 1
Welcome to the first part of our blog series on auto-optimizing Apache Iceberg tables with Tabular from a DBA standpoint.
Apr 24, 2023 • 
Mayur Choubey
1

Share this post

User's avatar
The Data Maven
Auto Optimizing Apache Iceberg tables with Tabular: Best practices from a DBA standpoint – Part 1
From Theory to Practice: Count distinct optimization in Trino / Presto
Approximate count distinct is a powerful technique used in a variety of use cases where exact count distinct is computationally expensive or not…
Apr 18, 2023 • 
Mayur Choubey

Share this post

User's avatar
The Data Maven
From Theory to Practice: Count distinct optimization in Trino / Presto
Most Popular
View all
The Power of Three: Using Apache Iceberg, Databricks, and Tabular for Data Engineering
Apr 16, 2023 • Mayur Choubey
4

Share this post

User's avatar
The Data Maven
The Power of Three: Using Apache Iceberg, Databricks, and Tabular for Data Engineering
Building Serverless Data Pipelines with AWS Lambda, PyIceberg, and Tabular
Apr 14, 2023 • Mayur Choubey
1

Share this post

User's avatar
The Data Maven
Building Serverless Data Pipelines with AWS Lambda, PyIceberg, and Tabular
How to create a unified data lake with Tabular in 5 mins
Apr 13, 2023 • Mayur Choubey
3

Share this post

User's avatar
The Data Maven
How to create a unified data lake with Tabular in 5 mins
Auto Optimizing Apache Iceberg tables with Tabular: Best practices from a DBA standpoint – Part 1
Apr 24, 2023 • Mayur Choubey
1

Share this post

User's avatar
The Data Maven
Auto Optimizing Apache Iceberg tables with Tabular: Best practices from a DBA standpoint – Part 1
From Theory to Practice: Count distinct optimization in Trino / Presto
Apr 18, 2023 • Mayur Choubey

Share this post

User's avatar
The Data Maven
From Theory to Practice: Count distinct optimization in Trino / Presto
The Power of Three: Using Apache Iceberg, Databricks, and Tabular for Data Engineering
Tabular is a centralized table storage for all of your analytical data that can be utilized anywhere, whereas Apache Iceberg is a high-performance and…
Apr 16, 2023 • 
Mayur Choubey
4

Share this post

User's avatar
The Data Maven
The Power of Three: Using Apache Iceberg, Databricks, and Tabular for Data Engineering
Building Serverless Data Pipelines with AWS Lambda, PyIceberg, and Tabular
In this blog, we will cover the benefits of using PyIceberg and Tabular from AWS Lambda and how easy it is to set up, integrate, and build cost…
Apr 14, 2023 • 
Mayur Choubey
1

Share this post

User's avatar
The Data Maven
Building Serverless Data Pipelines with AWS Lambda, PyIceberg, and Tabular
How to create a unified data lake with Tabular in 5 mins
With {AWS EMR, Starburst Trino} as Execution Engines on S3 using Tabular
Apr 13, 2023 • 
Mayur Choubey
3

Share this post

User's avatar
The Data Maven
How to create a unified data lake with Tabular in 5 mins
The Data Maven
Data Engineering Publication

The Data Maven

AboutArchive

Share this publication

User's avatar
thedatamaven
The Data Maven
© 2025 The Data Maven
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share