Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

Apache Spark

Claimed by Apache Software Foundation Analyzed 1 day ago

Apache Spark is an open source cluster computing system that aims to make data analytics fast — both fast to run and fast to write. To run programs faster, Spark provides primitives for in-memory cluster computing: your job can load data into memory and query it repeatedly more rapidly than with ... [More]

1.58M lines of code

374 current contributors

2 days since last commit

56 users on Open Hub

Very High Activity

0 Reviews

I Use This

Mostly written in Scala

Licenses: apache_2

Apache Fluo

Analyzed about 6 hours ago

Apache Fluo (incubating) is an open source implementation of Percolator (which populates Google's search index) for Apache Accumulo. Fluo makes it possible to update the results of a large-scale computation, index, or analytic as new data is discovered.

32K lines of code

5 current contributors

about 2 months since last commit

0 users on Open Hub

Very Low Activity

0 Reviews

I Use This

Mostly written in Java

Licenses: apache_2

Tags accumulo analytics apache-software-foundation bigdata bigtable cluster clustercomputing fault_tolerant graph_computing hadoop highthroughput incremental 6 more...