Top Java Data Platforms 2025

GitHub Libraries Java Data Platforms

apache/spark 40K +87

Added by sizovs added 2 weeks ago

Apache Spark - A unified analytics engine for large-scale data processing.

apache/kafka 29K +81

Added by sizovs added 1 month ago

Distributed event streaming platform used by thousands of companies for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications.

apache/flink 24K +48

Added by sizovs added 1 month ago

Apache Flink is an open source stream processing framework with powerful stream- and batch-processing capabilities.

hazelcast/hazelcast 6K +11

Added by sizovs added 1 month ago

A unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.

apache/nifi 5K +30

Added by sizovs added 1 month ago

Apache NiFi is an easy to use, powerful, and reliable system to process and distribute data.

apache/ignite 4K +8

Added by sizovs added 1 month ago

Apache Ignite is a distributed database for high-performance computing with in-memory speed.

infinispan/infinispan 1K +7

Added by sizovs added 1 month ago

An open source data grid platform and highly scalable NoSQL cloud data store.

apache/systemds 1K +1

Added by sizovs added 2 weeks ago

An open source ML system for the end-to-end data science lifecycle

apache/streampipes 629

Added by sizovs added 2 weeks ago

A self-service (Industrial) IoT toolbox to enable non-technical users to connect, analyze and explore IoT data streams.

Join libs.tech

...and unlock some superpowers

GitHub

We won't share your data with anyone else.