Projects tagged ‘clustercomputing’

Apache Spark

Claimed by Apache Software Foundation Analyzed 1 day ago

Apache Spark is an open source cluster computing system that aims to make data analytics fast — both fast to run and fast to write. To run programs faster, Spark provides primitives for in-memory cluster computing: your job can load data into memory and query it repeatedly more rapidly than with ... [More]

1.59M lines of code

374 current contributors

1 day since last commit

55 users on Open Hub

Very High Activity

0 Reviews

I Use This

Mostly written in Scala

Licenses: apache_2

Apache Hive

Claimed by Apache Software Foundation No analysis available

Hive is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc querying and analysis of large datasets data stored in Hadoop files. It provides a mechanism to put structure on this data and it also provides a simple query language called ... [More]

0 lines of code

0 current contributors

0 since last commit

22 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in language not available

Licenses: apache_2

Tags apache bigdata cluster clustercomputing distributed_computing hadoop hdfs java mapreduce orc spark sql 4 more...

Apache Storm

A

Claimed by Apache Software Foundation Analyzed about 10 hours ago

Storm is a free and open source distributed realtime computation system. Storm makes it easy to reliably process unbounded streams of data, doing for realtime processing what Hadoop did for batch processing. Storm is simple, can be used with any programming language. Storm is fast: a benchmark ... [More]

346K lines of code

45 current contributors

1 day since last commit

6 users on Open Hub

Moderate Activity

0 Reviews

I Use This

Mostly written in Java

Licenses: apache_2

Tags bigdata cloud cluster clustercomputing datastreams distributed distributed_computing distributedsystem ec2 fault_tolerant java json 8 more...

Apache Airavata

Claimed by Apache Software Foundation Analyzed about 19 hours ago

Apache Airavata is a software toolkit currently used to build science gateways but that has a much wider potential use. It provides features to compose, manage, execute, and monitor small to large scale applications and workflows on computational resources ranging from local clusters to national ... [More]

1.74M lines of code

15 current contributors

18 days since last commit

4 users on Open Hub

Moderate Activity

0 Reviews

I Use This

Mostly written in JavaScript

Licenses: apache_2

Tags apache bigdata clustercomputing distributed_computing highperformancecomputing workflow

TORQUE Resource Manager

Analyzed about 4 hours ago

TORQUE is an open source resource manager providing control over batch jobs and distributed compute nodes. It is a community effort based on the original *PBS project and, with more than 1,200 patches, has incorporated significant advances in the areas of scalability, fault tolerance, and feature ... [More]

334K lines of code

1 current contributors

over 5 years since last commit

3 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in C

Licenses: openpbs

Tags c cluster clustercomputing distributed_computing grid hpc jobscheduling linux mpi resourcemanagement unix

GridGain

G

Analyzed 1 day ago

GridGain is an open-source Java-based grid computing platform that is changing the world of grid computing in the same way as JBoss and Spring Framework reshaped J2EE market.

1.81M lines of code

155 current contributors

10 days since last commit

3 users on Open Hub

Moderate Activity

0 Reviews

I Use This

Mostly written in Java

Licenses: apache_2, lgpl

Tags clustercomputing distributed_computing distributedcomputing

ROOT-Sim

Analyzed 1 day ago

The ROme OpTimistic Simulator: Multithreaded Parallel Discrete Event Simulator

5.56K lines of code

2 current contributors

7 months since last commit

2 users on Open Hub

Very Low Activity

0 Reviews

I Use This

Mostly written in C

Licenses: gpl3

Tags asynchronous cloudcomputing clustercomputing event-driven high_performance_computing hpc Middleware mpi parallelcomputing pdes platform simulation 2 more...

GC3Pie

G

Analyzed 1 day ago

gc3pie is a suite of Python classes (and command-line tools built upon them) to aid in submitting and controlling batch jobs to clusters and grid resources seamlessly. gc3pie aims at providing the building blocks by which Python scripts that combine several applications in a dynamic workflow can be ... [More]

83.6K lines of code

5 current contributors

over 4 years since last commit

2 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in Python

Licenses: lgpl3

Tags clustercomputing development framework gamess grid gridcomputing python rosetta

Crossdata

C

Analyzed 1 day ago

Easy access to big things. Library for Apache Spark extending and improving its capabilities

30.7K lines of code

2 current contributors

about 6 years since last commit

2 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in Scala

Licenses: apache_2

Tags akka bigdata cluster clustercomputing distributed_computing scala spark sql streaming streamingdata

gridscale

G

Analyzed about 10 hours ago

GridScale allows to access remote job and storage services and to manage files / jobs life cycle. It supporst EGI Grid, PBS / SGE clusters, SSH, HTTP, local filesystem...

3.17K lines of code

2 current contributors

about 1 month since last commit

2 users on Open Hub

Very Low Activity

0 Reviews

I Use This

Mostly written in Scala

Licenses: No declared licenses

Tags batch clustercomputing distributed gridcomputing

Tags : Browse Projects