HomeTool Atlas / processing

Data Processing

This category collects batch processing, streaming, scheduling, integration, and compute frameworks for stable data pipelines.

Collected21

related tools

Discuss Selection

Airbyte

Open-source ELT data integration and connector platform

Data ProcessingView Details

Apache Airflow

Workflow orchestration system

Data ProcessingView Details

Azkaban

Distributed workflow manager

Data ProcessingView Details

Apache Beam

Unified batch and stream processing model

Data ProcessingView Details

Dagster

Software-defined orchestration platform centered on data assets

Data ProcessingView Details

dbt Core

SQL transformation and modeling framework for analytics engineering

Data ProcessingView Details

Debezium

Open-source platform for database change data capture

Data ProcessingView Details

Apache DolphinScheduler

Distributed visual workflow scheduler for big data platforms

Data ProcessingView Details

Dremio

SQL query and semantic layer platform for Apache Iceberg lakehouses

Data ProcessingView Details

Flink CDC

Apache Flink-based framework for real-time CDC data integration

Data ProcessingView Details

Apache Flink

Unified stream and batch processing engine

Data ProcessingView Details

AWS Glue

Serverless ETL service

Data ProcessingView Details

Apache Kafka

Distributed streaming platform

Data ProcessingView Details

Kestra

Event-driven open-source orchestration and automation platform

Data ProcessingView Details

Apache NiFi

Visual dataflow automation and integration platform

Data ProcessingView Details

Apache Oozie

Hadoop workflow scheduler

Data ProcessingView Details

Prefect

Orchestration platform for Python dataflows and automation tasks

Data ProcessingView Details

Presto

Distributed SQL query engine

Data ProcessingView Details

Apache SeaTunnel

Open-source platform for large-scale data synchronization and integration

Data ProcessingView Details

Apache Spark

Unified data processing engine

Data ProcessingView Details

Trino

Distributed SQL query engine for lakehouse and multi-source analytics

Data ProcessingView Details