Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

WEKA

Compare

  Analyzed 2 days ago

Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is ... [More] also well-suited for developing new machine learning schemes. [Less]

780K lines of code

3 current contributors

over 2 years since last commit

38 users on Open Hub

Inactive
3.93333
   
I Use This
Licenses: No declared licenses

Apache Mahout

Compare

Claimed by Apache Software Foundation Analyzed 1 day ago

Apache Mahout's goal is to build scalable machine learning libraries. With scalable we mean: Scalable to reasonably large data sets. Our core algorithms for clustering, classfication and batch based collaborative filtering are implemented on top of Apache Hadoop using the map/reduce paradigm. ... [More] However we do not restrict contributions to Hadoop based implementations: Contributions that run on a single node or on a non-Hadoop cluster are welcome as well. The core libraries are highly optimized to allow for good performance also for non-distributed algorithms [Less]

146K lines of code

0 current contributors

9 months since last commit

25 users on Open Hub

Very Low Activity
3.6
   
I Use This

YALE Open-Source Java Data Mining

Compare

  Analyzed 1 day ago

YALE (Yet Another Learning Environment) is the most comprehensive open-source software for intelligent data analysis, data mining, knowledge discovery, machine learning, predictive analytics, forecasting, and analytics in business intelligence (BI). YALE provides more than 400 data mining operators ... [More] , a graphical user interface (GUI), an online tutorial with hands-on data mining applications, a comprehensive PDF tutorial, many visualization schemes for data sets and data mining results, many different learning and meta-learning schemes ranging from decision tree and rule learners to neural networks, SVMs, ensemble methods, etc. YALE is implemented in Java and available under GPL (GNU General Public License) as well as under a developer license (OEM license) for closed-source developers. [Less]

751K lines of code

0 current contributors

about 10 years since last commit

17 users on Open Hub

Inactive
4.25
   
I Use This
Licenses: No declared licenses

gCube

Compare

  Analyzed about 9 hours ago

gCube is a software system specifically designed and developed to enact the building and operation of *large scale infrastructures* providing their users with a rich array of services suitable for supporting the co-creation of *Virtual Research Environments* and promoting the implementation of *open ... [More] science* workflows and practices. It is at the heart of the D4Science.org infrastructure (www.d4science.org). [Less]

1.49M lines of code

15 current contributors

2 days since last commit

14 users on Open Hub

Very High Activity
4.66667
   
I Use This

PyMVPA

Compare

  Analyzed about 6 hours ago

Python module to ease pattern classification analyses of large datasets. It provides high-level abstraction of typical processing steps (e.g. data preparation, classification, feature selection, generalization testing), a number of implementations of some popular algorithms (e.g. kNN, Ridge ... [More] Regressions, Sparse Multinomial Logistic Regression, GPR. RFE, I-RELIEF), and bindings to external ML libraries (libsvm, shogun, R). While it is not limited to neuroimaging data (e.g. FMRI) it is eminently suited for such datasets. [Less]

113K lines of code

0 current contributors

over 8 years since last commit

8 users on Open Hub

Inactive
5.0
 
I Use This

MonetDB

Compare

  No analysis available

MonetDB is a open-source columnar database system for high-performance applications. It comes with a feature rich SQL interface, ready to perform analytical queries on large datasets with an unusual speed.

0 lines of code

10 current contributors

0 since last commit

7 users on Open Hub

Activity Not Available
5.0
 
I Use This
Mostly written in language not available
Licenses: mozilla_p...

KNIME

Compare

  Analyzed 1 day ago

KNIME [naim] is a user-friendly graphical workbench for the entire analysis process: data access, data transformation, initial investigation, powerful predictive analytics, visualisation and reporting. The open integration platform provides over 1000 modules (nodes), including those of the KNIME ... [More] community and its extensive partner network. [Less]

943K lines of code

31 current contributors

7 days since last commit

7 users on Open Hub

High Activity
5.0
 
I Use This

Orange

Compare

  Analyzed 1 day ago

Orange is a component-based data mining software. It includes a range of data visualization, exploration, preprocessing and modelling techniques. It can be used through a nice and intuitive user interface or, for more advanced users, as a module for Python programming language.

380K lines of code

32 current contributors

6 days since last commit

6 users on Open Hub

Moderate Activity
4.5
   
I Use This

Apache Taverna

Compare

Claimed by Apache Software Foundation No analysis available

The Taverna project aims to provide a language and software tools to facilitate easy use of workflow and distributed compute technology within the eScience community.

0 lines of code

0 current contributors

0 since last commit

4 users on Open Hub

Activity Not Available
4.6
   
I Use This
Mostly written in language not available
Licenses: apache_2

Apache Zeppelin

Compare

Claimed by Apache Software Foundation Analyzed 1 day ago

A web-based notebook that enables interactive data analytics. You can make beautiful data-driven, interactive and collaborative documents with SQL, Scala and more.

456K lines of code

57 current contributors

6 days since last commit

4 users on Open Hub

High Activity
0.0
 
I Use This