Data Mastery Hub: Term Resource for Data Professionals

Whether you're a newcomer to the world of big data and data lakes or an experienced pro looking to expand your knowledge, the Dremio Wiki provides insights and guidance for all your data-related needs. Dive in and unlock the power of your data today!

Apache

Apache ServiceMix

Apache ServiceMix is an open-source integration container that provides a lightweight and flexible integration framework. It is built on top of Apache Karaf and Apache Camel

Apache

Apache Slider

Apache Slider is a Hadoop YARN application to deploy existing distributed and containerized applications seamlessly to YARN without any modifications

Apache

Apache Solr

Apache Solr is a fast and reliable search engine platform that offers a wide range of features like faceted search, hit highlighting, and more.

Apache

Apache Spark

Apache Spark is an open-source distributed computing system that can handle large amounts of data processing tasks. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Apache

Apache Sqoop

Apache Sqoop is a tool used in big data processing and analytics. It helps transfer bulk data between data storages and Hadoop ecosystems.

Apache

Apache Storm

Apache Storm is a distributed real-time big data processing system that allows for rapid data processing and analysis.

Apache

Apache Submarine

Apache Submarine is a distributed data processing and analytics platform that allows businesses to unify streaming and batch data processing.

Apache

Apache Tez

Apache Tez is a data processing framework that enables efficient execution of complex DAG tasks

Apache

Apache Thrift

Apache Thrift is a scalable cross-language framework for building efficient and reliable communication protocols between different systems.

Apache

Apache Tika

Apache Tika is a content detection and analysis framework designed to help businesses process and analyze large amounts of data.

Apache

Apache Tomcat

Apache Tomcat is an open-source web server and servlet container that provides a Java-based environment for running web applications.

Apache

Apache Unomi

Apache Unomi is a customer data platform designed to help businesses gather, process, and analyze customer data to improve business performance and customer engagement.

Apache

Apache Whirr

Apache Whirr is an open-source tool that simplifies the deployment of distributed applications to cloud environments without vendor lock-in.

Apache

Apache XTable

A comprehensive guide on Apache XTable: its history, functionality, benefits, use-cases, integration with a data lakehouse, and comparison with Dremio's technologies.

Apache

Apache YARN

Apache YARN is a resource manager that enables distributed data processing and supports Hadoop's processing framework

1 2 3 4 5 6 7 8 60 61 62 63
No Wikis Found
Topics
get started

Get Started Free

No time limit - totally free - just the way you like it.

Sign Up Now
demo on demand

See Dremio in Action

Not ready to get started today? See the platform in action.

Watch Demo
talk expert

Talk to an Expert

Not sure where to start? Get your questions answered fast.

Contact Us

Ready to Get Started?

Enable the business to create and consume data products powered by Apache Iceberg, accelerating AI and analytics initiatives and dramatically reducing costs.