Data Mastery Hub: Term Resource for Data Professionals

Whether you're a newcomer to the world of big data and data lakes or an experienced pro looking to expand your knowledge, the Dremio Wiki provides insights and guidance for all your data-related needs. Dive in and unlock the power of your data today!

Data Storage

Analytical Data Store

Analytical Data Store is a data storage architecture that enables optimized data processing and analytics.

Data Management

Analytics Engine

Analytics Engine is a powerful tool that enables businesses to process and analyze large volumes of data efficiently and effectively.

Data Analysis

Anomaly Detection

Anomaly Detection is the process of identifying patterns or data points that deviate significantly from the norm, indicating unusual behavior or events.

Data Security

Anonymization

Anonymization is the process of removing or altering identifying information from data to protect privacy and ensure compliance.

Data Management

Anti-Corruption Layer

Explore the concept of Anti-Corruption Layer, its advantages in the data processing landscape, and its role in data lakehouse environments.

Apache

Apache Accumulo

Apache Accumulo is a distributed database written in Java that stores structured and unstructured data and provides fine-grained access control.

Apache

Apache ActiveMQ

Apache ActiveMQ is an open-source message broker that facilitates the exchange of data between different applications, systems, and services.

Apache

Apache Airflow

Apache Airflow is an open-source platform for creating, scheduling, and monitoring data pipelines. It provides scalable and reliable data processing and analytics automation.

Apache

Apache Ambari

Apache Ambari is an administration tool that helps manage, monitor, and deploy Apache Hadoop clusters.

Data Storage

Apache Arrow

Apache Arrow is an in-memory data format that enables efficient and high-performance data processing and analytics.

Apache

Apache Atlas

Apache Atlas is a data governance and metadata framework that simplifies the process of data discovery, classification, and analysis.

Apache

Apache Avro

Apache Avro is a data serialization system designed to help with data processing and analytics by defining data structures and allowing data to be passed between programming languages.

Apache

Apache Beam

Apache Beam is an open-source platform for processing big data that provides a unified programming model and can run on different execution engines.

Apache

Apache Bigtop

Apache Bigtop is an open-source toolset for building and managing big data platforms. It simplifies data processing and analytics workloads and improves productivity.

Apache

Apache BookKeeper

Apache BookKeeper is a reliable data storage system that is optimized for streaming and big data processing and enables distributed coordination and efficient data processing.

1 2 3 4 60 61 62 63
No Wikis Found
Topics
get started

Get Started Free

No time limit - totally free - just the way you like it.

Sign Up Now
demo on demand

See Dremio in Action

Not ready to get started today? See the platform in action.

Watch Demo
talk expert

Talk to an Expert

Not sure where to start? Get your questions answered fast.

Contact Us

Ready to Get Started?

Enable the business to create and consume data products powered by Apache Iceberg, accelerating AI and analytics initiatives and dramatically reducing costs.