Projects tagged ‘data_analysis’

WEKA

Analyzed about 2 hours ago

Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is ... [More]

780K lines of code

3 current contributors

almost 4 years since last commit

38 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in Java

Licenses: No declared licenses

YALE Open-Source Java Data Mining

Y

Analyzed about 14 hours ago

YALE (Yet Another Learning Environment) is the most comprehensive open-source software for intelligent data analysis, data mining, knowledge discovery, machine learning, predictive analytics, forecasting, and analytics in business intelligence (BI). YALE provides more than 400 data mining operators ... [More]

751K lines of code

0 current contributors

over 11 years since last commit

17 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in Java

Licenses: No declared licenses

Tags analysis data data_analysis data_mining development education framework intelligent_data_analysis java java_data_mining kdd knowledge_discovery 5 more...

gCube

Analyzed 6 months ago

gCube is a software system specifically designed and developed to enact the building and operation of *large scale infrastructures* providing their users with a rich array of services suitable for supporting the co-creation of *Virtual Research Environments* and promoting the implementation of *open ... [More]

1.53M lines of code

15 current contributors

6 months since last commit

14 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in Java

Licenses: EUPL

Tags algorithms analysis batch_processing biodiversity_informatics data_access data_analysis data_cleansing data_infrastructure data_mining data_processing distributed_computing distributed_storage 8 more...

OpenRefine

Analyzed about 23 hours ago

OpenRefine is a free, open source power tool for working with messy data and improving it

173K lines of code

45 current contributors

14 days since last commit

4 users on Open Hub

Moderate Activity

0 Reviews

I Use This

Mostly written in Java

Licenses: googleBSD

Tags business_intelligence cleaner cleaning data data_analysis data_cleansing data_formats data_integration deduplication freebase java javascript 2 more...

SimMetrics

S

Analyzed about 6 hours ago

SimMetrics is a Similarity Metric Library, e.g. from edit distance's (Levenshtein, Gotoh, Jaro etc) to other metrics, (e.g Soundex, Chapman). Work provided by UK Sheffield University funded by (AKT) an IRC sponsored by EPSRC, grant number GR/N15764/01.

5.76K lines of code

0 current contributors

over 19 years since last commit

2 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in Java

Licenses: No declared licenses

Tags algorithms analysis artificial_intelligence computational_linguistics data data_analysis disimilarity fuzzy intelligent_data_analysis java natural_language natural_language_processing 9 more...

Java Data Mining Package (JDMP)

Analyzed about 12 hours ago

The Java Data Mining Package (JDMP) is an open source Java library for data analysis and machine learning. It facilitates the access to data sources and machine learning algorithms (e.g. clustering, regression, classification, graphical models, optimization) and provides visualization modules. It ... [More]

40.7K lines of code

0 current contributors

almost 11 years since last commit

2 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in Java

Licenses: lgpl3

Tags algorithms analysis artificial_intelligence classification classifiers clustering data data_analysis data_mining distributed intelligent_data_analysis java 11 more...

MyMediaLite

Analyzed about 10 hours ago

MyMediaLite is a recommender system algorithm library. It provides methods for two common tasks in recommender systems/collaborative filtering: rating prediction and item prediction from implicit feedback. MyMediaLite also contains command-line programs that let you use much of the library's functionality without having to program.

183K lines of code

2 current contributors

about 6 years since last commit

1 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in C#

Licenses: gpl3_or_l...

Tags artificial_intelligence cf collaborative_filtering command_line csharp data_analysis data_mining dotnet evaluation ironpython iron_ruby library 12 more...

dishevelled

No analysis available

dishevelled.org hosts Free and Open Source libraries for various user interface components and supporting code, with emphasis on views and editors for complex data structures, like collections, sets, lists, maps, graphs, and matrices.

0 lines of code

1 current contributors

0 since last commit

1 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in language not available

Licenses: lgpl3

Tags analysis bioinformatics biojava biological_networks biology cytoscape data_analysis data_structures data_visualization genomics graphs visualization

SOOT

S

Analyzed 2 days ago

The Perl wrapper for CERN's ROOT library, a comprehensive data analysis framework. SOOT is very similar to the Ruby-ROOT or PyROOT extensions for their respective languages. Specifically, the first revision of SOOT was implemented after the model of Ruby-ROOT. SOOT uses a very dynamic approach ... [More]

115K lines of code

0 current contributors

almost 10 years since last commit

1 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in Perl

Licenses: gpl

Tags c++ cern data_analysis graphing gui interface introspection perl perl5 perl-module perlxs root

ELKI

Analyzed about 20 hours ago

ELKI: "Environment for Developing KDD-Applications Supported by Index-Structures" is a development framework for data mining algorithms written in Java. It includes a large variety of popular data mining algorithms, distance functions and index structures. Its focus is particularly on clustering ... [More]

1.29K lines of code

2 current contributors

2 months since last commit

1 users on Open Hub

Low Activity

0 Reviews

I Use This

Mostly written in Java

Licenses: AGPL3_or_...

Tags algorithms analysis api clustering data data_analysis data_mining datamining dataminingframework java kdd knowledge_discovery 8 more...

Tags : Browse Projects