Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

Apache Samza

Compare

Claimed by Apache Software Foundation Analyzed about 20 hours ago

Apache Samza is a distributed stream processing framework. It uses Apache Kafka for messaging, and Apache Hadoop YARN to provide fault tolerance, processor isolation, security, and resource management.

171K lines of code

0 current contributors

5 months since last commit

1 users on Open Hub

Very Low Activity
5.0
 
I Use This

cm_api

Compare

  Analyzed 1 day ago

Cloudera Manager API Client

25.5K lines of code

0 current contributors

over 8 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This

WebMapReduce

Compare

  Analyzed 1 day ago

WebMapReduce is a simple web-based user interface for creating and submitting Hadoop Map-Reduce jobs in practically any language. It is ideally suited for use in the introductory computer science classroom, requiring very little programming experience to write massively parallel programs.

76.1K lines of code

0 current contributors

about 13 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This

Apache PredictionIO

Compare

  No analysis available

Apache PredictionIO is an open source machine learning server. It enables developers and data engineers to build smarter web and mobile applications through a simple set of APIs. Admin UI is provided for developers to select and tune algorithms. Some benefits of using Apache PredictionIO: - ... [More] create predictive features quickly with built-in algorithms. - build your own ML algorithms on top of a state-of-the-art infrastructure. - find the best algorithm for your application. - handle big data well - PredictionIO is very scalable. - serve real-time prediction queries through robust APIs and SDKs. [Less]

0 lines of code

11 current contributors

0 since last commit

1 users on Open Hub

Activity Not Available
5.0
 
I Use This
Mostly written in language not available
Licenses: apache_2

hadoop-lzo

Compare

  Analyzed about 19 hours ago

Patched, refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20

35.1K lines of code

0 current contributors

over 13 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This
Tags hadoop

ansible-hadoop-cdh3

Compare

  Analyzed 1 day ago

Deploying Hadoop CDH3 Clusters using Ansible

204 lines of code

0 current contributors

about 12 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This

Oozie

Compare

  Analyzed about 12 hours ago

Oozie is a workflow scheduler system to manage Apache Hadoop jobs. Oozie Workflow jobs are Directed Acyclical Graphs (DAGs) of actions. Oozie Coordinator jobs are recurrent Oozie Workflow jobs triggered by time (frequency) and data availabilty. Oozie is integrated with the rest of the ... [More] Hadoop stack supporting several types of Hadoop jobs out of the box (such as Java map-reduce, Streaming map-reduce, Pig, Hive, Sqoop and Distcp) as well as system specific jobs (such as Java programs and shell scripts). Oozie is a scalable, reliable and extensible system. [Less]

221K lines of code

0 current contributors

8 months since last commit

1 users on Open Hub

Very Low Activity
3.0
   
I Use This

serialization

Compare

  Analyzed 1 day ago

Serialization library for the Dwarf framework

9.89K lines of code

0 current contributors

almost 13 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This

action-access

Compare

  Analyzed about 14 hours ago

Access library for the action-core

962 lines of code

0 current contributors

almost 14 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This

meteo

Compare

  Analyzed 1 day ago

Realtime Analytics

8.03K lines of code

0 current contributors

over 12 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This