The Data Maven
Subscribe
Sign in
Home
Archive
About
Revolutionize Your AI Development Process with Kestra: The Ultimate AI Pipeline Builder!
As an AI developer, I am always on the lookout for tools that can help me streamline my workflow and make my development process more efficient. That's…
May 9, 2023
•
Mayur Choubey
Share this post
Revolutionize Your AI Development Process with Kestra: The Ultimate AI Pipeline Builder!
thedatamaven.substack.com
Copy link
Facebook
Email
Note
Other
April 2023
Auto Optimizing Apache Iceberg tables with Tabular: Best practices from a DBA standpoint – Part 1
Welcome to the first part of our blog series on auto-optimizing Apache Iceberg tables with Tabular from a DBA standpoint. In this series, we will…
Apr 24, 2023
•
Mayur Choubey
1
Share this post
Auto Optimizing Apache Iceberg tables with Tabular: Best practices from a DBA standpoint – Part 1
thedatamaven.substack.com
Copy link
Facebook
Email
Note
Other
From Theory to Practice: Count distinct optimization in Trino / Presto
Approximate count distinct is a powerful technique used in a variety of use cases where exact count distinct is computationally expensive or not…
Apr 18, 2023
•
Mayur Choubey
Share this post
From Theory to Practice: Count distinct optimization in Trino / Presto
thedatamaven.substack.com
Copy link
Facebook
Email
Note
Other
The Power of Three: Using Apache Iceberg, Databricks, and Tabular for Data Engineering
Tabular is a centralized table storage for all of your analytical data that can be utilized anywhere, whereas Apache Iceberg is a high-performance and…
Apr 16, 2023
•
Mayur Choubey
4
Share this post
The Power of Three: Using Apache Iceberg, Databricks, and Tabular for Data Engineering
thedatamaven.substack.com
Copy link
Facebook
Email
Note
Other
Building Serverless Data Pipelines with AWS Lambda, PyIceberg, and Tabular
In this blog, we will cover the benefits of using PyIceberg and Tabular from AWS Lambda and how easy it is to set up, integrate, and build cost…
Apr 14, 2023
•
Mayur Choubey
1
Share this post
Building Serverless Data Pipelines with AWS Lambda, PyIceberg, and Tabular
thedatamaven.substack.com
Copy link
Facebook
Email
Note
Other
How to create a unified data lake with Tabular in 5 mins
With {AWS EMR, Starburst Trino} as Execution Engines on S3 using Tabular
Apr 13, 2023
•
Mayur Choubey
3
Share this post
How to create a unified data lake with Tabular in 5 mins
thedatamaven.substack.com
Copy link
Facebook
Email
Note
Other
Share
Copy link
Facebook
Email
Note
Other
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts