Gnarly Data Waves

Episode 21

|

June 13, 2023

Data as Code with Dremio Arctic: ML Experimentation & Reproducibility on the Lakehouse

In this episode of Gnarly Data Waves, we will discuss how Dremio Arctic and data as code enable data science use cases like Machine learning experimentation and reproducibility on a consistent view of your data in a no-copy architecture.

As more data consumers require access to critical customer and operational data in the data lake, data teams need solutions that enable multiple users to leverage the same view of the data for a wide range of use cases without impacting each other. In this episode of Gnarly Data Waves, we will discuss how the data as code capabilities in Dremio Arctic enable data scientists to:

    • Create a data science branch of the production branch for experimentation without creating expensive data copies or impacting production workloads

    • Easily work and collaborate cross-functionally with other data consumers and line of business experts

    • Quickly reproduce models and results by returning to previous branch states with tags and commit history

Topics Covered

Data lakehouse

Watch or listen on your favorite platform

Register to view episode

Ready to Get Started? Here Are Some Resources to Help

Webinars

It’s Time To Consider a Hybrid Lakehouse Strategy

Discover the power of the hybrid lakehouse! Join data expert David Loshin to explore how this strategy combines the scalability of data lakes with the performance of data warehouses, enabling flexibility and future-proofing your data ecosystem.

read more

Webinars

Mastering Dremio’s Well-Architected Framework: Overview & Security: Overview and Security

Learn best practices for creating sustainable data architectures that emphasize efficiency and long-term maintainability. Discover strategies for optimizing workflows while exploring core design principles like security, cost optimization, and operational excellence. Ideal for professionals looking to enhance their data architecture skills!

read more

Webinars

AI-Ready Data with Data Products

As AI adoption rises, data quality and reliability are crucial. This presentation shows how treating data as a product—with clear ownership, quality standards, and governance—ensures AI readiness. Discover practical strategies to overcome challenges like accessibility and governance, turning data into a strategic asset for AI innovation.

read more
get started

Get Started Free

No time limit - totally free - just the way you like it.

Sign Up Now
demo on demand

See Dremio in Action

Not ready to get started today? See the platform in action.

Watch Demo
talk expert

Talk to an Expert

Not sure where to start? Get your questions answered fast.

Contact Us

Ready to Get Started?

Enable the business to create and consume data products powered by Apache Iceberg, accelerating AI and analytics initiatives and dramatically reducing costs.