The Dremio Blog

Dremio Blog: Various Insights

Dremio Blog: Various Insights

Why Are Unified Data Products the Next Evolution of Data Architecture?

By embracing unified data products, organizations can move beyond vendor lock-in, streamline data access for BI and AI, and future-proof their data architectures. With Dremio’s platform, enterprises can build the foundation for a truly unified, high-performance data ecosystem that meets the needs of modern data consumers.

Alex Merced
Dremio Blog: Various Insights

The Evolution of the Modern Data Team

Business data needs are quickly evolving, and technology is adapting to keep pace. Cloud data warehouses now offer elastic storage and compute. Data lakes have evolved into lakehouses, combining lakes' flexibility with warehouses' reliability. Many organizations are utilizing a hybrid on-prem + cloud data storage strategy. Transformation tools have shifted from proprietary ETL platforms to open-source frameworks that enable software engineering practices on analytics. These technological advances are fundamentally changing how organizations work with data.

Andrew Madson
Dremio Blog: Various Insights

Understanding Data Mesh and Data Fabric: A Guide for Data Leaders

Traditional data management techniques increasingly struggle to keep pace with modern data's volume, variety, and velocity. The need to evolve legacy data management to enable AI-ready data has caused organizations to evaluate their data strategies. Two innovative approaches have gained prominence: Data Mesh and Data Fabric.

Andrew Madson
Dremio Blog: News Highlights

Why Your Data Strategy Needs Data Products: Enabling Analytics, AI, and Business Insights

Modern organizations are increasingly reliant on data to drive innovation, optimize operations, and gain a competitive edge. However, extracting meaningful insights from the ever-growing volume of data presents a significant challenge. Despite substantial investments in data infrastructure and specialized teams, many organizations struggle to make their data readily accessible and actionable for decision-making. The traditional centralized approach to data management, while offering control and standardization, often leads to bottlenecks, delays, and frustrated data consumers. This, in turn, can hinder agility, stifle innovation, and ultimately impact the bottom line.

Andrew Madson
Dremio Blog: Various Insights

Accelerating Analytical Insight – The NetApp & Dremio Hybrid Iceberg Lakehouse Reference Architecture

Organizations are constantly seeking ways to optimize data management and analytics. The Dremio and NetApp Hybrid Iceberg Lakehouse Reference Architecture brings together Dremio’s Unified Lakehouse Platform and NetApp’s advanced data storage solutions to create a high-performance, scalable, and cost-efficient data lakehouse platform. With this solution combining NetApp’s advanced storage technologies with Dremio’s high-performance lakehouse platform, […]

Mark Shainman
Dremio Blog: Various Insights

8 Tools For Ingesting Data Into Apache Iceberg

Apache Iceberg has an expansive ecosystem, and this article provides an overview of eight powerful tools that can facilitate data ingestion into Apache Iceberg and offers resources to help you get started. Whether leveraging Dremio's comprehensive lakehouse platform, using open-source solutions like Apache Spark or Kafka Connect, or integrating with managed services like Upsolver and Fivetran, these tools offer the flexibility and scalability needed to build and maintain an efficient and effective data lakehouse environment.

Alex Merced
Dremio Blog: Various Insights

Evolving the Data Lake: From CSV/JSON to Parquet to Apache Iceberg

The evolution of data storage—from the simplicity of CSV and JSON to the efficiency of Parquet and the advanced capabilities of Apache Iceberg—reflects the growing complexity and scale of modern data needs. As organizations progress through this journey, the Dremio Lakehouse Platform emerges as a crucial ally, offering seamless query capabilities across all these formats and ensuring that your data infrastructure remains flexible, scalable, and future-proof. Whether you're just starting with small datasets or managing a vast data lakehouse, Dremio enables you to unlock the full potential of your data, empowering you to derive insights and drive innovation at every stage of your data journey.

Alex Merced
Dremio Blog: Product Insights

Lakehouse Architecture for Unified Analytics – A Data Analyst’s Guide to Accelerated Insights

A data flow design for modern data analytics. The medallion architecture empowers data analysts to access trusted data, collaborate with colleagues, and uncover invaluable insights quickly and efficiently. Analysts can unlock the full potential of their organization's data and drive informed decision-making by understanding the distinct layers of the data lakehouse and its role in unifying data analytics.

Andrew Madson
Dremio Blog: Various Insights

3 Reasons to Create Hybrid Apache Iceberg Data Lakehouses

Platforms like Dremio facilitate this hybrid approach by connecting to various data sources and utilizing the Apache Iceberg format, ensuring that your data is always accessible and performant, regardless of where it resides. Whether you are looking to optimize costs, enhance performance, or achieve greater agility, a hybrid data lakehouse could be the perfect solution for your data needs.
Dremio Blog: Various Insights

Advancing the Capabilities of the Premier Data Lakehouse Platform for Apache Iceberg

With the latest release of Dremio, 25.0 we are helping accelerate the adoption and benefits of Apache Iceberg, while bringing your users closer to the data with lakehouse flexibility, scalability and performance at a fraction of the cost. We are excited to announce some of the new features that improve scalability, manageability, ease of use […]

Mark Shainman
Dremio Blog: Various Insights

From MongoDB to Dashboards with Dremio and Apache Iceberg

Dremio enables directly serving BI dashboards from MongoDB or leveraging Apache Iceberg tables in your data lake. This post will explore how Dremio's data lakehouse platform simplifies your data delivery for business intelligence by doing a prototype version that can run on your laptop.

Alex Merced
Dremio Blog: Various Insights

Announcing the First Iceberg Summit

Tabular and Dremio have received approval from the Apache Iceberg Project Management Committee to organize the inaugural Iceberg Summit, a free-to-attend virtual event to be held May 14 - 15, 2024. Iceberg Summit is an Apache Software Foundation (ASF) sanctioned event. Those wishing to attend can register here. Your information will only be used for […]
Dremio Blog: Various Insights

The Who, What, and Why of Data Products

Dremio offers a robust platform for creating data products by simplifying data integration, providing a semantic layer for data curation, and enabling secure data sharing. Whether you're curating data for a single product or managing multiple data products, Dremio's features can streamline the process and enhance collaboration among data professionals, ultimately leading to the successful creation of valuable data products.

Alex Merced
Dremio Blog: Various Insights

Compaction in Apache Iceberg: Fine-Tuning Your Iceberg Table’s Data Files

Learn how to optimize the data files in your Apache Iceberg Table using compaction and its different strategies including z-order.

Alex Merced
Dremio Blog: Various Insights

Puffins and Icebergs: Additional Stats for Apache Iceberg Tables

A short introduction to the new file format called Puffin in Apache Iceberg that helps with additional table statistics

Dipankar Mazumdar