Archive - The Data Maven

Revolutionize Your AI Development Process with Kestra: The Ultimate AI Pipeline Builder!

As an AI developer, I am always on the lookout for tools that can help me streamline my workflow and make my development process more efficient.

May 9, 2023 •

April 2023

Auto Optimizing Apache Iceberg tables with Tabular: Best practices from a DBA standpoint – Part 1

Welcome to the first part of our blog series on auto-optimizing Apache Iceberg tables with Tabular from a DBA standpoint.

Apr 24, 2023 •

From Theory to Practice: Count distinct optimization in Trino / Presto

Approximate count distinct is a powerful technique used in a variety of use cases where exact count distinct is computationally expensive or not…

Apr 18, 2023 •

The Power of Three: Using Apache Iceberg, Databricks, and Tabular for Data Engineering

Tabular is a centralized table storage for all of your analytical data that can be utilized anywhere, whereas Apache Iceberg is a high-performance and…

Apr 16, 2023 •

Building Serverless Data Pipelines with AWS Lambda, PyIceberg, and Tabular

In this blog, we will cover the benefits of using PyIceberg and Tabular from AWS Lambda and how easy it is to set up, integrate, and build cost…

Apr 14, 2023 •

How to create a unified data lake with Tabular in 5 mins

With {AWS EMR, Starburst Trino} as Execution Engines on S3 using Tabular

Apr 13, 2023 •

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts