Gnarly Data Waves
Episode 28
|
August 8, 2023
Gnarly Data Waves: Apache Iceberg Office Hours
Get all your Apache Iceberg questions answered at Apache Iceberg office hours. Questions on architecture, migration and anything else are welcomed!
Join the Dremio developer advocacy and engineering teams for an installment of Apache Iceberg Office Hours. During this time we’ll have a brief Iceberg presentation on table format interoperability, going over the table format migration options, converters, and newer interoperability solutions like onetable and uniform. We’ll go through the capabilities, limitations, and considerations and then have lots of dedicated time for Q&A on the presented topic or any other questions or guidance you’re looking for help on in learning about Apache Iceberg or architecting your data lakehouse around Apache Iceberg.
Examples of questions you can come to ask:
-
- How can I optimize my Iceberg tables for my different use cases?
-
- What tools will best handle my ETL job to write to Iceberg?
-
- How can I control access to my Iceberg tables?
-
- How can I convert data from X into an Iceberg table?
-
- How can I get started with Iceberg in Databricks?
Topics Covered
Watch or listen on your favorite platform
Register to view episode
Speakers
Jason Hughes
Jason Hughes is the Director of Technical Advocacy at Dremio. Previously at Dremio, he’s been a Product Director, Technical Director and a Senior Solutions Architect. He’s been working in technology and data for over a decade, including roles as tech lead for the field at Dremio, the pre-sales and post-sales lead for Presto and QueryGrid for the Americas at Teradata, and leading the development, deployment, and management of a custom CRM system for multiple auto dealerships. He is passionate about making customers and individuals successful and self-sufficient. When he’s not working, he’s usually taking his dog to the dog park, playing hockey, or cooking (when he feels like it). He lives in San Diego, California.
Alex Merced
Alex Merced is a Senior Tech Evangelist for Dremio, a developer, and a seasoned instructor with a rich professional background. Having worked with companies like GenEd Systems, Crossfield Digital, CampusGuard, and General Assembly.
Alex is a co-author of the O’Reilly Book “Apache Iceberg: The Definitive Guide.” With a deep understanding of the subject matter, Alex has shared his insights as a speaker at events including Data Day Texas, OSA Con, P99Conf and Data Council.
Driven by a profound passion for technology, Alex has been instrumental in disseminating his knowledge through various platforms. His tech content can be found in blogs, videos, and his podcasts, Datanation and Web Dev 101.
Moreover, Alex Merced has made contributions to the JavaScript and Python communities by developing a range of libraries. Notable examples include SencilloDB, CoquitoJS, and dremio-simple-query, among others.
Dipankar Mazumdar
Dipankar is currently a Developer Advocate at Dremio where his primary focus is advocating data practitioners such as engineers, architects & scientists on Dremio’s lakehouse platform & various open-sourced projects such as Apache Iceberg, Arrow, etc. that helps data teams apply & scale analytics. In his past roles, he worked at the intersection of Machine Learning & Data visualization. Dipankar holds a Masters in Computer Science and his research area is Explainable AI.
Ajantha Bhat
Have 5+ years of work experience around open source table formats and file formats, PMC at Apache Carbondata. Works on Project Nessie and is also an active contributor to Apache Iceberg.
Dmitri Bourlatchkov
Coming from the background of distributed trace processing. Interest switch to graph database technologies based on Apache Cassandra while working at DataStax. Contributor to Stargate*. Currently working at Dremio. Developing Project Nessie, especially Nessie’s support for Apache Iceberg tables.
Scott Cowell
No Bio Available
Ready to Get Started? Here Are Some Resources to Help
Webinars
Data Disruptors Podcast: Whoop’s Carlos Peralta on Building a Data-Driven Culture at Whoop and Moderna
read moreWebinars
Cyber Lakehouse for the AI Era, ZTA and Beyond
Many agencies today are struggling not only with managing the scale and complexity of cyber data but also with extracting actionable insights from that data. With new data retention regulations, such as M-21-31, compounding this problem further, agencies need a next-generation solution to address these challenges.
read more