Java Clean Architecture Masterclass

Java Clean Architecture MasterclassNov 20-21

Join

Top Java Data Platforms 2025

GitHub Libraries Java Data Platforms

apache/spark 41K +84

added 3 months ago

Apache Spark - A unified analytics engine for large-scale data processing.

apache/kafka 30K +68

added 4 months ago

Distributed event streaming platform used by thousands of companies for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications.

apache/flink 24K +33

added 4 months ago

A stream processing framework with powerful stream- and batch-processing capabilities.

apache/rocketmq 21K +25

added 3 months ago

Apache RocketMQ is a cloud native messaging and streaming platform, making it simple to build event-driven applications.

alibaba/datax 16K +26

added 2 months ago

DataX is the open source data integration framework maintained by Alibaba. As a data synchronization framework, DataX abstracts the synchronization of different data sources.

apache/pulsar 14K +28

added 2 months ago

Pulsar is a distributed pub-sub messaging platform with a very flexible messaging model and an intuitive client API.

elastic/logstash 14K +17

added 2 months ago

Logstash is a server-side data processing pipeline that ingests data from a multitude of sources simultaneously, transforms it, and then sends it to your favorite "stash."

apache/seatunnel 8K +32

added 1 week ago

A high-performance, distributed data integration tool, capable of synchronizing vast amounts of data daily.

hazelcast/hazelcast 6K +9

added 4 months ago

A unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.

apache/nifi 5K +32

added 4 months ago

Apache NiFi is an easy to use, powerful, and reliable system to process and distribute data.

apache/ignite 4K +4

added 4 months ago

Apache Ignite is a distributed database for high-performance computing with in-memory speed.

infinispan/infinispan 1K -1

added 4 months ago

An open source data grid platform and highly scalable NoSQL cloud data store.

apache/systemds 1K

added 3 months ago

An open source ML system for the end-to-end data science lifecycle

apache/streampipes 653 +2

added 3 months ago

A self-service (Industrial) IoT toolbox to enable non-technical users to connect, analyze and explore IoT data streams.

Join libs.tech

...and unlock some superpowers

GitHub

We won't share your data with anyone else.