Alation
AI-powered data catalog
Built for data leaders, technical teams, and people evaluating enterprise AI adoption. It curates tools and writing across AI engineering, big data, data governance, enterprise agents, data platforms, and knowledge graphs. This is not a generic tool directory. It is the public knowledge entry point for BigDataFlowing.
107 AI and data tools curated across AI engineering, governance, analytics, processing, and storage.
The main site builds public trust through a connected chain of profile, tool map, selected writing, case context, and lab work.
Understand Dugufeng’s career transition, graduate exam journey, data governance practice, and AI engineering direction.
EnterBrowse AI and data tools by AI engineering, governance, analytics, processing, and storage.
EnterRead structured views on data governance, AI for Data, Data for AI, and enterprise AI agent engineering.
EnterPlace data middle platforms, metadata, lineage, quality, knowledge bases, and agents back into enterprise scenarios.
EnterTrack explorations around CourseMotion AI, governance agents, enterprise semantic layers, and AI engineering evaluation.
EnterA curated map connecting open-source projects, commercial platforms, and core enterprise data capabilities through AI engineering and data governance.
LLM apps, agents, RAG, local models, MCP, evaluation, and enterprise AI orchestration.
Open CategoryMetadata, data catalogs, lineage, quality, standards, and data asset management.
Open CategoryBI, data exploration, reporting, visualization, and analytics platforms.
Open CategoryBatch, streaming, scheduling, integration, and compute frameworks.
Open CategoryData lakes, object storage, distributed storage, and lakehouse foundations.
Open CategoryThe original data navigation foundation remains visible, with DataHub, OpenMetadata, Atlas, Great Expectations, and other metadata, lineage, quality, and data asset tools.
AI-powered data catalog
Open-source data catalog for discovery and metadata search
Data catalog and governance collaboration platform for modern data teams
Metadata management and governance framework
Enterprise data quality monitoring and observability platform
Enterprise data governance platform
Hadoop, Spark, Flink, Kafka, Iceberg, MinIO, and related projects remain part of BigDataFlowing core. AI engineering extends from these data foundations.
Open-source ELT data integration and connector platform
Workflow orchestration system
Distributed workflow manager
Unified batch and stream processing model
Software-defined orchestration platform centered on data assets
SQL transformation and modeling framework for analytics engineering
Dify, Ollama, OpenClaw, Hermes, and related tools are framed through enterprise AI applications, agents, RAG, and data governance context.
Desktop and server tool for team knowledge bases, RAG, and AI agents
Open-source observability platform for LLM, RAG, and ML systems
Microsoft open-source framework for building collaborative multi-agent systems
Platform for AI model serving and inference deployment
Embedding database for AI-native applications
Multi-agent orchestration framework for role-based task collaboration