Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

MOA - Massive Online Analysis

Compare

  Analyzed 1 day ago

A framework for learning from a continuous supply of examples, a data stream. Includes classification and clustering methods. Related to the WEKA project, also written in Java, while scaling to more demanding problems.

131K lines of code

13 current contributors

about 2 months since last commit

9 users on Open Hub

Very Low Activity
0.0
 
I Use This

Apache Storm

Compare

Claimed by Apache Software Foundation Analyzed 1 day ago

Storm is a free and open source distributed realtime computation system. Storm makes it easy to reliably process unbounded streams of data, doing for realtime processing what Hadoop did for batch processing. Storm is simple, can be used with any programming language. Storm is fast: a benchmark ... [More] clocked it at over a million tuples processed per second per node. It is scalable, fault-tolerant, guarantees your data will be processed, and is easy to set up and operate. Storm integrates with the queueing and database technologies you already use. A Storm topology consumes streams of data and processes those streams in arbitrarily complex ways, repartitioning the streams between each stage of the computation however needed. [Less]

353K lines of code

45 current contributors

6 days since last commit

6 users on Open Hub

Moderate Activity
5.0
 
I Use This

SeerDataCruncher

Compare

  Analyzed about 17 hours ago

SeerDataCruncher is a Data Quality Firewall, Data Quality Monitor and ETL middleware to manage data streams on the fly.

84.4K lines of code

3 current contributors

about 1 month since last commit

1 users on Open Hub

Very Low Activity
5.0
 
I Use This