Alex Merced

Blog Post

A Journey from AI to LLMs and MCP – 1 – What Is AI and How It Evolved Into LLMs

Artificial Intelligence (AI) has become the defining technology of the decade. From chatbots to code generators, from self-driving cars to predictive text—AI systems are everywhere. But before we dive into the cutting-edge world of large language models (LLMs), let’s rewind and understand where this all began. This post kicks off our 10-part series exploring how […]

Read more ->

Gnarly Data Waves Episode

From Vectors to AI Agents: AI Concepts

Join us for AI Concepts, the first in a two-part webinar series hosted by Alex Merced. Learn the fundamentals of AI and machine learning, and lay the groundwork for building AI solutions with Dremio in part two.

Read more ->

Gnarly Data Waves Episode

From Vectors to AI Agents: AI With Dremio

AI agents are changing the game—but they need fast, clean, and contextual data to succeed. In this exclusive Gnarly Data Waves session, Alex Merced shows how Dremio accelerates AI agent development by streamlining data prep and enabling real-time access across…

Read more ->

Blog Post

Building a Basic MCP Server with Python

If you’ve ever wished you could ask an AI model like Claude to interact with your local files or run custom code—good news: you can. That’s exactly what the Model Context Protocol (MCP) makes possible. In this tutorial, we’ll walk you through building a beginner-friendly MCP server that acts as a simple template for future projects. You don’t need to be […]

Read more ->

Blog Post

From SQL Server to Lakehouse: A Better Journey to an Apache Iceberg Lakehouse

The story probably sounds familiar if you’ve been building data systems for a while. Your company starts with an operational database—SQL Server, PostgreSQL, MySQL, maybe even MongoDB. It serves its purpose well. Transactions are fast, applications are happy, and reports run—well, they used to run—just fine. But as the business grows, so do the data […]

Read more ->

Blog Post

Disaster Recovery for Apache Iceberg Tables – Restoring from Backup and Getting Back Online

Disaster recovery isn’t just for databases and monolithic data warehouses anymore. In the world of modern data lakehouses—where Apache Iceberg is increasingly used to power large-scale, multi-engine analytics—knowing how to recover from failure is just as important as knowing how to scale. Imagine this: your data platform experiences a serious outage. Maybe a cloud region […]

Read more ->

Blog Post

Demystifying Apache Iceberg Table Services – What They Are and Why They Matter

Apache Iceberg has quickly emerged as one of the most critical technologies in the modern data lakehouse stack. Originally designed to solve the shortcomings of Hive table formats, Iceberg brings ACID transactions, schema evolution, time travel, and hidden partitioning to data lakes—features that were once the domain of traditional data warehouses. But as the project […]

Read more ->

Blog Post

What is the Model Context Protocol (MCP) and Why It Matters for AI Applications

Introduction Imagine if every time you bought a new device, you had to invent a new plug to power it on. That’s precisely what working with LLMs used to feel like — every app had to create its unique way of connecting models to data, tools, and other systems. Enter the Model Context Protocol (MCP) […]

Read more ->

Blog Post

Why Are Unified Data Products the Next Evolution of Data Architecture?

Data architectures have undergone a significant transformation over the past few decades. Enterprises once relied on rigid, centralized data warehouses to store and process information. Then came the era of data lakes, which offered more flexibility but often lacked governance and structure. Now, as organizations grapple with vast amounts of diverse data across multiple platforms, […]

Read more ->

Gnarly Data Waves Episode

From Insight to Action: Building Scalable Data Products with Dbt

See how Dremio enables seamless virtual modeling and query acceleration, while dbt brings version-controlled transformations and a powerful semantic layer. Join us to learn how this duo simplifies governance and scales your data workflows!

Read more ->

Blog Post

The Future of Apache Polaris (Incubating)

Apache Polaris rapidly evolves into a robust open-source data catalog that bridges the gap between governance policies and federated data management. With its growing adoption, it has become essential to align development efforts with the needs of its users, ensuring that Polaris remains scalable, secure, and feature-rich. Over the past months, the Polaris community has […]

Read more ->

Blog Post

Using Helm with Kubernetes: A Guide to Helm Charts and Their Implementation

Managing applications in Kubernetes can be complex, requiring multiple YAML files to define resources such as Deployments, Services, ConfigMaps, and Secrets. As applications scale, manually maintaining and updating these configurations becomes cumbersome and error-prone. This is where Helm comes in. Helm is a package manager for Kubernetes that simplifies deployment by bundling application configurations into reusable, version-controlled Helm charts. With […]

Read more ->

Blog Post

Building AI Apps with Dremio, LangChain, Flask, and FastAPI

AI-powered applications transform industries by enabling data-driven decision-making, automated workflows, and intelligent chat interfaces. However, one of the biggest challenges in developing these AI applications is efficiently accessing, integrating, and querying enterprise data from multiple sources. Many AI models rely on structured data stored in databases, data lakes, and warehouses—but accessing this data in real […]

Read more ->

Blog Post

Building AI Agents with LangChain using Dremio, Iceberg, and Unified Data

AI-driven applications have revolutionized the way businesses extract value from data. AI models thrive on structured, well-organized data, from virtual assistants to predictive analytics. However, one of the biggest challenges in building AI-powered solutions is efficiently accessing and integrating data from multiple sources—data lakes, data warehouses, relational databases, and streaming platforms. This is where Dremio shines. […]

Read more ->

Gnarly Data Waves Episode

10 Use Cases for Dremio in Your Data Architecture

This webinar, “10 Use Cases for Dremio in Your Data Architecture,” will showcase how Dremio empowers organizations to accelerate analytics, unify data across diverse sources, and optimize data operations.

Read more ->

Gnarly Data Waves Episode

3 Reasons Why Dremio is the Best Query Engine for Apache Iceberg

This webinar, “3 Reasons Why Dremio is the Best Query Engine for Apache Iceberg,” will showcase Dremio’s unparalleled capabilities: industry-leading performance, query acceleration with Reflections, and advanced catalog solutions for Iceberg table management.

Read more ->

Blog Post

Modeling Your Data Lakehouse with Dremio’s Query Federation, Semantic Layer & Reflections

Building high-quality data models is at the core of creating a robust analytics ecosystem. However, this task often becomes a daunting challenge as organizations grow and data environments become increasingly complex. Data Engineers frequently find their efforts derailed by the need to move data into centralized data warehouses, a process that introduces inefficiencies and complications. […]

Read more ->

Blog Post

Building Scalable Data Applications with Dremio

Building data applications is an exciting but complex journey. As a developer, you’re often tasked with transforming raw data into actionable insights while balancing performance, scalability, and cost. Along the way, you face a series of challenges: databases that struggle to handle both transactional and analytical workloads, ETL pipelines that become costly to maintain, and […]

Read more ->

Blog Post

Governance in the Era of the Data Lakehouse

The evolution of data architectures has reshaped how organizations manage, access, and derive value from their data. Traditional systems like data warehouses and databases offered centralized control, making governance relatively straightforward but often at the expense of scalability and cost. The emergence of data lakehouses has transformed this paradigm, combining the scalability and flexibility of […]

Read more ->

Gnarly Data Waves Episode

10 Things to Look Forward in 2025 in the Iceberg Ecosystem

Explore how Apache Iceberg is redefining open data lakehouse technology in 2025! Join our webinar to uncover 10 key advancements, from scan planning to geospatial data support, and transform your data strategies.

Read more ->

Blog Post

2024 Year in Review: Lakehouses, Apache Iceberg and Dremio

As 2024 comes to a close, it’s clear that this year has been remarkable for the data lakehouse and the growing momentum driving its adoption. In this blog, I’ll reflect on some of the most exciting developments in the data lakehouse space, focusing on the new possibilities unlocked by tools like Apache Iceberg and Dremio. […]

Read more ->

Blog Post

Football Playoffs Hackathon powered by Dremio

The Output Welcome to the 2024 Football Playoffs Hackathon powered by Dremio. Teams from across the globe will apply their analytics prowess to predict: Each team must analyze current stats provided to support their selections with detailed insights. Judging criteria will include the accuracy of predictions, the quality of analysis, the clarity of visual presentation, […]

Read more ->

Gnarly Data Waves Episode

Unlocking the Future of Analytics: Dremio’s Updated Architecture Guide

Join us for an exclusive webinar as we explore insights from Dremio’s Architecture Guide and discover how organizations are transforming their data strategies with a modern data lakehouse solution.

Read more ->

Blog Post

3 Reasons Why Dremio Is the Best SQL Query Engine for Apache Iceberg

Dremio is a cutting-edge Lakehouse Platform designed to make data more accessible and actionable. With Apache Iceberg tables as first-class citizen, Dremio offers a powerful combination of data virtualization and unification capabilities. This means you can seamlessly combine data from databases, data warehouses, data lakes, and lakehouses into a single, governed platform. Dremio’s built-in semantic […]

Read more ->

Blog Post

Building a Universal Semantic Layer with Dremio

Organizations face a common challenge: ensuring consistent and reliable data insights across multiple departments, tools, and teams. As data becomes increasingly central to decision-making, the need for a unified view—one everyone in the organization can rely on—has never been more critical. This is where a universal semantic layer comes into play. By creating a standardized […]

Read more ->

Blog Post

Top Data Mesh Tools for Modern Enterprises

Modern enterprises are increasingly adopting data mesh architecture to keep up with demand for accessible, consistent data. Unlike traditional, centralized data models, data mesh prioritizes a decentralized approach, allowing individual teams to own and manage their own data domains. This structure enables organizations to achieve greater agility, faster access to data, and enhanced scalability. For […]

Read more ->

Blog Post

Data Virtualization Tools: The Key to Real-Time Analytics

Organizations need rapid access to insights from their data to stay competitive. However, the complexity of managing data from diverse sources often slows down this process. Traditional methods like ETL (Extract, Transform, Load) are effective but can create delays due to data replication and movement. To overcome these challenges, data virtualization tools provide a robust […]

Read more ->

Blog Post

Understanding the Role of Metadata in Dremio’s Iceberg Data Lakehouse

An Iceberg Data Lakehouse—a unified system that combines the scalability of data lakes with the analytical power of data warehouses—has emerged as a powerful solution to modern data requirements for performance, accessibility and costs. However, what makes this architecture effective is the strategic use of metadata to optimize performance, ensure data consistency, and enhance governance. […]

Read more ->

Blog Post

How Dremio’s Reflections Enhance Iceberg Lakehouses, Data Availability, AI/BI, and Infrastructure Scalability

The demand for quick, actionable insights is higher than ever. Businesses are moving beyond traditional data warehouses to adopt lakehouses and other flexible data architectures that better support real-time analytics, BI, and AI applications. Dremio is at the forefront of this shift, providing a robust, high-performance hybrid lakehouse platform that enables fast, scalable analytics in […]

Read more ->

Blog Post

Adopting Apache Iceberg? How Dremio can enhance your Iceberg Journey

The rise of data lakehouses is transforming the way organizations manage, analyze, and leverage their data. Lakehouse architecture offers a flexible, scalable solution that bridges the gap between traditional data warehouses and data lakes. Apache Iceberg, an open table format designed to deliver reliable, high-performance analytics on large datasets, is at the heart of this […]

Read more ->

Alex Merced

Alex Merced's Articles and Resources

Blog Post

A Journey from AI to LLMs and MCP – 1 – What Is AI and How It Evolved Into LLMs

Gnarly Data Waves Episode

From Vectors to AI Agents: AI Concepts

Gnarly Data Waves Episode

From Vectors to AI Agents: AI With Dremio

Blog Post

Building a Basic MCP Server with Python

Blog Post

From SQL Server to Lakehouse: A Better Journey to an Apache Iceberg Lakehouse

Blog Post

Disaster Recovery for Apache Iceberg Tables – Restoring from Backup and Getting Back Online

Blog Post

Demystifying Apache Iceberg Table Services – What They Are and Why They Matter

Blog Post

What is the Model Context Protocol (MCP) and Why It Matters for AI Applications

Blog Post

Why Are Unified Data Products the Next Evolution of Data Architecture?

Gnarly Data Waves Episode

From Insight to Action: Building Scalable Data Products with Dbt

Blog Post

The Future of Apache Polaris (Incubating)

Blog Post

Using Helm with Kubernetes: A Guide to Helm Charts and Their Implementation

Blog Post

Building AI Apps with Dremio, LangChain, Flask, and FastAPI

Blog Post

Building AI Agents with LangChain using Dremio, Iceberg, and Unified Data

Gnarly Data Waves Episode

10 Use Cases for Dremio in Your Data Architecture

Gnarly Data Waves Episode

3 Reasons Why Dremio is the Best Query Engine for Apache Iceberg

Blog Post

Modeling Your Data Lakehouse with Dremio’s Query Federation, Semantic Layer & Reflections

Blog Post

Building Scalable Data Applications with Dremio

Blog Post

Governance in the Era of the Data Lakehouse

Gnarly Data Waves Episode

10 Things to Look Forward in 2025 in the Iceberg Ecosystem

Blog Post

2024 Year in Review: Lakehouses, Apache Iceberg and Dremio

Blog Post

Football Playoffs Hackathon powered by Dremio

Gnarly Data Waves Episode

Unlocking the Future of Analytics: Dremio’s Updated Architecture Guide

Blog Post

3 Reasons Why Dremio Is the Best SQL Query Engine for Apache Iceberg

Blog Post

Building a Universal Semantic Layer with Dremio

Blog Post

Top Data Mesh Tools for Modern Enterprises

Blog Post

Data Virtualization Tools: The Key to Real-Time Analytics

Blog Post

Understanding the Role of Metadata in Dremio’s Iceberg Data Lakehouse

Blog Post

How Dremio’s Reflections Enhance Iceberg Lakehouses, Data Availability, AI/BI, and Infrastructure Scalability

Blog Post

Adopting Apache Iceberg? How Dremio can enhance your Iceberg Journey

Get Started Free

See Dremio in Action

Talk to an Expert

Ready to Get Started?