Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

veles

Compare

  Analyzed about 3 hours ago

Distributed machine learning platform. Distributed platform for rapid Deep learning application development Consists of: Platform - https://github.com/Samsung/veles Znicz Plugin - Neural Network engine Mastodon - Veles Java bridge for Hadoop etc. SoundFeatureExtraction - audio feature ... [More] extraction library Written on Python, uses OpenCL or CUDA, employs Flow-Based Programming, under Apache 2.0. 1 Deploy VELES on Notebook or Cluster with a single command 2 Create the model from >250 optimized units 3 Analyze and serve the dataset on the go using Loaders 4 Train it on PC or High Performance Cluster Interactively monitor the training process 5 Publish the results 6 Automatically extract the trained model as an application 7 Run it in the cloud [Less]

68.8K lines of code

0 current contributors

about 2 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This

Apache SystemML

Compare

Claimed by Apache Software Foundation Analyzed about 16 hours ago

Declarative large-scale machine learning (ML) that aims at flexible specification of ML algorithms and automatic generation of hybrid runtime plans ranging from single-node, in-memory computations, to distributed computations on Apache Hadoop and Apache Spark. ML algorithms are expressed in an ... [More] R-like or Python-like syntax that includes linear algebra primitives, statistical functions, and ML-specific constructs. This high-level language significantly increases the productivity of data scientists as it provides (1) full flexibility in expressing custom analytics, and (2) data independence from the underlying input formats and physical data representations. Automatic optimization according to data and cluster characteristics ensures both efficiency and scalability. [Less]

2.05M lines of code

7 current contributors

5 days since last commit

0 users on Open Hub

Moderate Activity
0.0
 
I Use This

rdc.etl

Compare

  Analyzed about 9 hours ago

Extract Transform Load toolkit (python).

4.68K lines of code

0 current contributors

almost 12 years since last commit

0 users on Open Hub

Inactive
5.0
 
I Use This
Licenses: No declared licenses

BigInsights-on-Apache-Hadoop

Compare

  Analyzed 1 day ago

Example projects for 'BigInsights for Apache Hadoop' on IBM Bluemix

7.92K lines of code

0 current contributors

about 8 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This

movie-recommender-demo

Compare

  Analyzed about 13 hours ago

This project walks through how you can create recommendations using Apache Spark machine learning. There are a number of jupyter notebooks that you can run on IBM Data Science Experience, and there a live demo of a movie recommendation web application you can interact with. The demo also uses IBM ... [More] Message Hub (kafka) to push application events to topic where they are consumed by a spark streaming job running on IBM BigInsights (hadoop). [Less]

2.51K lines of code

0 current contributors

over 3 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This

Apache Fluo

Compare

  Analyzed about 19 hours ago

Apache Fluo (incubating) is an open source implementation of Percolator (which populates Google's search index) for Apache Accumulo. Fluo makes it possible to update the results of a large-scale computation, index, or analytic as new data is discovered.

32K lines of code

5 current contributors

2 months since last commit

0 users on Open Hub

Very Low Activity
0.0
 
I Use This

kylo

Compare

  Analyzed about 20 hours ago

Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on Apache Hadoop and Spark. Kylo is licensed under Apache 2.0 and contributed by Think Big, A Teradata Company

788K lines of code

12 current contributors

almost 7 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This