Projects tagged ‘data_mining’

WEKA

Analyzed about 8 hours ago

Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is ... [More]

780K lines of code

3 current contributors

almost 4 years since last commit

38 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in Java

Licenses: No declared licenses

Apache Mahout

Claimed by Apache Software Foundation Analyzed about 9 hours ago

Apache Mahout's goal is to build scalable machine learning libraries. With scalable we mean: Scalable to reasonably large data sets. Our core algorithms for clustering, classfication and batch based collaborative filtering are implemented on top of Apache Hadoop using the map/reduce paradigm. ... [More]

146K lines of code

0 current contributors

about 12 years since last commit

25 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in Java

Licenses: apache_2

Tags algorithms classifiers clustering collaborative_filtering data_mining datamining dimension_reduction distributed distributed_computing hadoop java library 5 more...

YALE Open-Source Java Data Mining

Y

Analyzed 2 days ago

YALE (Yet Another Learning Environment) is the most comprehensive open-source software for intelligent data analysis, data mining, knowledge discovery, machine learning, predictive analytics, forecasting, and analytics in business intelligence (BI). YALE provides more than 400 data mining operators ... [More]

751K lines of code

0 current contributors

over 11 years since last commit

17 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in Java

Licenses: No declared licenses

Tags analysis data data_analysis data_mining development education framework intelligent_data_analysis java java_data_mining kdd knowledge_discovery 5 more...

gCube

Analyzed 6 months ago

gCube is a software system specifically designed and developed to enact the building and operation of *large scale infrastructures* providing their users with a rich array of services suitable for supporting the co-creation of *Virtual Research Environments* and promoting the implementation of *open ... [More]

1.53M lines of code

15 current contributors

6 months since last commit

14 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in Java

Licenses: EUPL

Tags algorithms analysis batch_processing biodiversity_informatics data_access data_analysis data_cleansing data_infrastructure data_mining data_processing distributed_computing distributed_storage 8 more...

PyMVPA

Analyzed about 24 hours ago

Python module to ease pattern classification analyses of large datasets. It provides high-level abstraction of typical processing steps (e.g. data preparation, classification, feature selection, generalization testing), a number of implementations of some popular algorithms (e.g. kNN, Ridge ... [More]

113K lines of code

0 current contributors

over 10 years since last commit

8 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in Python

Licenses: mit

Tags analysis classifiers data_mining development education framework library machine_learning neuroscience pattern_recognition python research 1 more...

MonetDB

No analysis available

MonetDB is a open-source columnar database system for high-performance applications. It comes with a feature rich SQL interface, ready to perform analytical queries on large datasets with an unusual speed.

0 lines of code

10 current contributors

0 since last commit

7 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in language not available

Licenses: mozilla_p...

Tags database data_mining gis jaql json odbc olap rdbms sql

KNIME

Analyzed about 1 year ago

KNIME [naim] is a user-friendly graphical workbench for the entire analysis process: data access, data transformation, initial investigation, powerful predictive analytics, visualisation and reporting. The open integration platform provides over 1000 modules (nodes), including those of the KNIME ... [More]

958K lines of code

31 current contributors

about 1 year since last commit

7 users on Open Hub

Activity Not Available

2 Reviews

I Use This

Mostly written in Java

Licenses: gpl3

Tags business_intelligence data_mining data_science data_visualization interactive java machine_learning predictive_analytics workbench

Orange

Analyzed about 6 hours ago

Orange is a component-based data mining software. It includes a range of data visualization, exploration, preprocessing and modelling techniques. It can be used through a nice and intuitive user interface or, for more advanced users, as a module for Python programming language.

384K lines of code

32 current contributors

24 days since last commit

6 users on Open Hub

Moderate Activity

1 Review

I Use This

Mostly written in Python

Licenses: BSD-2-Clause

Tags analytics business_intelligence data_mining data_science data_visualization extensible interactive linux machine_learning predictive_analytics python statistics 3 more...

Apache Taverna

Claimed by Apache Software Foundation No analysis available

The Taverna project aims to provide a language and software tools to facilitate easy use of workflow and distributed compute technology within the eScience community.

0 lines of code

0 current contributors

0 since last commit

4 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in language not available

Licenses: apache_2

Tags bioinformatics data_mining escience numerical_simulations provenance reproducibility research rest scientific_computing web_interface webservices workbench 3 more...

Apache Zeppelin

Claimed by Apache Software Foundation Analyzed 1 day ago

A web-based notebook that enables interactive data analytics. You can make beautiful data-driven, interactive and collaborative documents with SQL, Scala and more.

482K lines of code

57 current contributors

2 days since last commit

4 users on Open Hub

Moderate Activity

0 Reviews

I Use This

Mostly written in Java

Licenses: apache_2

Tags analytics big_data data_analytics data_mining data_visualization interactive notebook scala visualization

Tags : Browse Projects