Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

elasticsearch-hadoop

Compare

  Analyzed about 11 hours ago

Elasticsearch real-time search and analytics natively integrated with Hadoop

54.5K lines of code

18 current contributors

14 days since last commit

0 users on Open Hub

Low Activity
0.0
 
I Use This

haduzilla

Compare

  Analyzed about 11 hours ago

Automated Installation CD for Hadoop Cluster

446 lines of code

0 current contributors

over 11 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This

hadoop-monitor

Compare

  Analyzed about 2 hours ago

Monitoring tool for Hadoop

73 lines of code

0 current contributors

over 15 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This

vagrant-hadoop-examples

Compare

  Analyzed about 12 hours ago

some vagrant examples for different hadoop cluster deployment model

519 lines of code

0 current contributors

about 9 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This

oryxproject

Compare

  Analyzed 1 day ago

Oryx 2 (incubating): Lambda architecture on Spark for real-time large scale machine learning

131K lines of code

1 current contributors

about 4 years since last commit

0 users on Open Hub

Inactive
5.0
 
I Use This

Avenir

Compare

  Analyzed about 21 hours ago

Set of machine learning tools based on Hadoop and Storm

40.8K lines of code

1 current contributors

about 3 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This
Licenses: No declared licenses

Apache Ranger

Compare

  Analyzed about 15 hours ago

Ranger is a framework to enable, monitor and manage comprehensive data security across the Hadoop platform. The vision with Ranger is to provide comprehensive security across the Apache Hadoop ecosystem. With the advent of Apache YARN, the Hadoop platform can now support a true data lake ... [More] architecture. Enterprises can potentially run multiple workloads, in a multi tenant environment. Data security within Hadoop needs to evolve to support multiple use cases for data access, while also providing a framework for central administration of security policies and monitoring of user access. [Less]

422K lines of code

26 current contributors

about 16 hours since last commit

0 users on Open Hub

Moderate Activity
0.0
 
I Use This

Apache Atlas

Compare

  Analyzed 1 day ago

Atlas is a scalable and extensible set of core foundational governance services – enabling enterprises to effectively and efficiently meet their compliance requirements within Hadoop and allows integration with the whole enterprise data ecosystem.

137K lines of code

0 current contributors

about 8 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This

LF AI & Data Foundation

Compare

Claimed by The Linux Foundation Analyzed 1 day ago

The LF AI & Data Foundation supports open source projects within artificial intelligence, machine learning, deep learning and the data space. You can think of us as a greenhouse growing and sustaining open source AI, ML, DL and Data projects from seed to fruition. The LF AI & Data Foundation ... [More] provides the support to projects for open development to occur among a diverse and thriving community, in addition to a number of enabling services that include membership and funding management, ecosystem development, legal support, PR/marketing/communication, events support, and compliance scans. [Less]

575K lines of code

1 current contributors

9 days since last commit

0 users on Open Hub

Moderate Activity
0.0
 
I Use This
Licenses: No declared licenses

xgboost

Compare

  Analyzed about 2 hours ago

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow

164K lines of code

78 current contributors

2 days since last commit

0 users on Open Hub

High Activity
0.0
 
I Use This