The Dremio Blog

Dremio Blog: Various Insights

Dremio Blog: Various Insights

Streaming Data into Apache Iceberg Tables Using AWS Kinesis and AWS Glue

Learn how to ingest streaming data from AWS Kinesis into Apache Iceberg Tables using AWS Glue, and then query it with Dremio.

Alex Merced
Dremio Blog: Various Insights

Ensuring High Performance at Any Scale with Apache Iceberg’s Object Store File Layout

Object Storage can have some potential bottlenecks when it comes to working with big data. Apache Iceberg’s architecture lends to overcoming these challenges for a scalable table format solution for object storage.

Alex Merced
Dremio Blog: Various Insights

Introduction to Apache Iceberg Using Spark

Learn the basics of Iceberg’s many features and utilities by trying them out in a Spark sandbox.

Alex Merced
Dremio Blog: Various Insights

How Z-Ordering in Apache Iceberg Helps Improve Performance

This tutorial introduces the Z-order clustering algorithm in Apache Iceberg and explains how it adds value to the file optimization strategy.

Dipankar Mazumdar
Dremio Blog: Various Insights

Getting Started with Apache Iceberg in Databricks

Getting started with Apache Iceberg in Databricks is straightforward. This article walks through the setup and usage step by step.

Steve Baldwin
Dremio Blog: Various Insights

The Life of a Write Query for Apache Iceberg Tables

What happens under the hood with Apache Iceberg when you run a write query.

Alex Merced
Dremio Blog: Various Insights

A Hands-On Look at the Structure of an Apache Iceberg Table

This tutorial provides a practical deep dive into the internals of Apache Iceberg using Dremio Sonar as the engine.

Dipankar Mazumdar
Dremio Blog: Various Insights

Future-Proof Partitioning and Fewer Table Rewrites with Apache Iceberg

Avoid unnecessary table rewrites with partition evolution.

Alex Merced
Dremio Blog: Various Insights

Row-Level Changes on the Lakehouse: Copy-On-Write vs. Merge-On-Read in Apache Iceberg

How copy-on-write and merge-on-read work in Apache Iceberg.

Alex Merced
Dremio Blog: Various Insights

The Origins of Apache Arrow & Its Fit in Today’s Data Landscape

This blog post features the history behind Apache Arrow and how it addresses modern challenges in today’s data landscape.

Dipankar Mazumdar
Dremio Blog: Various Insights

Table Format Partitioning Comparison: Apache Iceberg, Apache Hudi, and Delta Lake

Learn about the differences in partitioning with Apache Iceberg, Apache Hudi, and Delta Lake.

Alex Merced
Dremio Blog: Various Insights

Migrating a Hive Table to an Iceberg Table Hands-on Tutorial

Learn how to migrate your existing Hive tables into Apache Iceberg tables to take full advantage of features like Version Rollback, Partition Evolution and more.

Alex Merced
Dremio Blog: Various Insights

Table Format Governance and Community Contributions: Apache Iceberg, Apache Hudi, and Delta Lake

Learn about the differences in the governance and communities behind open source table formats like Apache Iceberg, Apache Hudi, and Delta Lake.

Alex Merced
Dremio Blog: Various Insights

Fewer Accidental Full Table Scans Brought to You by Apache Iceberg’s Hidden Partitioning

Learn about hidden partitioning and why it is such a valuable feature of Apache Iceberg tables.

Alex Merced
Dremio Blog: Various Insights

Comparison of Data Lake Table Formats (Apache Iceberg, Apache Hudi and Delta Lake)

Apache Iceberg, Apache Hudi, and Delta Lake: A Comparison of Data Lake Table Formats

Alex Merced

Streaming Data into Apache Iceberg Tables Using AWS Kinesis and AWS Glue

Ensuring High Performance at Any Scale with Apache Iceberg’s Object Store File Layout

Introduction to Apache Iceberg Using Spark

How Z-Ordering in Apache Iceberg Helps Improve Performance

Getting Started with Apache Iceberg in Databricks

The Life of a Write Query for Apache Iceberg Tables

A Hands-On Look at the Structure of an Apache Iceberg Table

Future-Proof Partitioning and Fewer Table Rewrites with Apache Iceberg

Row-Level Changes on the Lakehouse: Copy-On-Write vs. Merge-On-Read in Apache Iceberg

The Origins of Apache Arrow & Its Fit in Today’s Data Landscape

Table Format Partitioning Comparison: Apache Iceberg, Apache Hudi, and Delta Lake

Migrating a Hive Table to an Iceberg Table Hands-on Tutorial

Table Format Governance and Community Contributions: Apache Iceberg, Apache Hudi, and Delta Lake

Fewer Accidental Full Table Scans Brought to You by Apache Iceberg’s Hidden Partitioning

Comparison of Data Lake Table Formats (Apache Iceberg, Apache Hudi and Delta Lake)

Get Started with a Free Data Lakehouse Powered by Apache Iceberg