Clean Architecture Masterclass

Clean Architecture MasterclassMay 28-29

Join

Top Java Data Platforms 2025

GitHub Libraries Java Data Platforms

apache/spark 40K +67

added 2 months ago

Apache Spark - A unified analytics engine for large-scale data processing.

apache/kafka 29K +101

added 2 months ago

Distributed event streaming platform used by thousands of companies for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications.

apache/flink 24K +62

added 2 months ago

Apache Flink is an open source stream processing framework with powerful stream- and batch-processing capabilities.

apache/rocketmq 21K +36

added 1 month ago

Apache RocketMQ is a cloud native messaging and streaming platform, making it simple to build event-driven applications.

alibaba/datax 16K +38

added 2 weeks ago

DataX is the open source data integration framework maintained by Alibaba. As a data synchronization framework, DataX abstracts the synchronization of different data sources.

apache/pulsar 14K +31

added 1 month ago

Pulsar is a distributed pub-sub messaging platform with a very flexible messaging model and an intuitive client API.

elastic/logstash 14K +23

added 1 month ago

Logstash is a server-side data processing pipeline that ingests data from a multitude of sources simultaneously, transforms it, and then sends it to your favorite "stash."

hazelcast/hazelcast 6K +5

added 2 months ago

A unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.

apache/nifi 5K +29

added 2 months ago

Apache NiFi is an easy to use, powerful, and reliable system to process and distribute data.

apache/ignite 4K +1

added 2 months ago

Apache Ignite is a distributed database for high-performance computing with in-memory speed.

infinispan/infinispan 1K +4

added 2 months ago

An open source data grid platform and highly scalable NoSQL cloud data store.

apache/systemds 1K +3

added 2 months ago

An open source ML system for the end-to-end data science lifecycle

apache/streampipes 638 -1

added 2 months ago

A self-service (Industrial) IoT toolbox to enable non-technical users to connect, analyze and explore IoT data streams.

Join libs.tech

...and unlock some superpowers

GitHub

We won't share your data with anyone else.