Managed Projects

Apache Spark

Claimed by Apache Software Foundation Analyzed 2 days ago

Apache Spark is an open source cluster computing system that aims to make data analytics fast — both fast to run and fast to write. To run programs faster, Spark provides primitives for in-memory cluster computing: your job can load data into memory and query it repeatedly more rapidly than with ... [More] disk-based systems like Hadoop. To make programming faster, Spark offers high-level APIs in Scala, Java and Python, letting you manipulate distributed datasets like local collections. You can also use Spark interactively to query big data from the Scala or Python shells. Spark integrates closely with Hadoop to run inside Hadoop clusters and can access any existing Hadoop data source. [Less]

1.51M lines of code

374 current contributors

3 days since last commit

56 users on Open Hub

Very High Activity
5.0
 
I Use This

Apache Mesos

Claimed by Apache Software Foundation Analyzed 2 days ago

Apache Mesos abstracts CPU, memory, storage, and other compute resources away from machines (physical or virtual), enabling fault-tolerant and elastic distributed systems to easily be built and run effectively.

881K lines of code

41 current contributors

12 months since last commit

29 users on Open Hub

Very Low Activity
4.0
   
I Use This