Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

Treex - NLP Framework

Compare

  Analyzed about 3 hours ago

Treex (formerly TectoMT) is a highly modular NLP software system implemented in Perl programming language under Linux. It is primarily aimed at Machine Translation, making use of the ideas and technology created during the Prague Dependency Treebank project. At the same time, it is also hoped to ... [More] significantly facilitate and accelerate development of software solutions of many other NLP tasks, especially due to re-usability of the numerous integrated processing modules (called blocks), which are equipped with uniform object-oriented interfaces. [Less]

242K lines of code

4 current contributors

about 1 month since last commit

4 users on Open Hub

Moderate Activity
5.0
 
I Use This

Ruby WordNet

Compare

  Analyzed about 6 hours ago

Ruby-WordNet is a Ruby interface to the WordNet® Lexical Database. WordNet? is an online lexical reference system whose design is inspired by current psycholinguistic theories of human lexical memory. English nouns, verbs, adjectives and adverbs are organized into synonym sets, each representing one ... [More] underlying lexical concept. Different relations link the synonym sets. [Less]

1.42K lines of code

1 current contributors

11 months since last commit

3 users on Open Hub

Very Low Activity
0.0
 
I Use This

airhead-research

Compare

  Analyzed about 20 hours ago

The S-Space Package is a collection of algorithms for building Semantic Spaces. These algorithms process text corpora and map semantic representations for words onto high dimensional vectors. These approaches are known by many names, such as word spaces, semantic spaces, or distributed semantics. ... [More] The research and development is being done by the Natural Language Processing group at UCLA led by David Jurgens and Keith Stevens, under the advisory of Dr. Michael Dyer. Our initial goal is to provide a uniform implementation for many common semantic space algorithms in order to facilitate researc [Less]

99.4K lines of code

0 current contributors

about 7 years since last commit

3 users on Open Hub

Inactive
5.0
 
I Use This

Stanford CoreNLP

Compare

  Analyzed about 17 hours ago

Stanford CoreNLP provides a set of natural language analysis tools in Java. It can take raw human language text input and give the base forms of words, their parts of speech, whether they are names of companies, people, etc., normalize dates, times, and numeric quantities, and mark up the structure ... [More] of sentences in terms of phrases and word dependencies, and indicate which noun phrases refer to the same entities. It was originally developed for English, but now also provides varying levels of support for Arabic, (mainland) Chinese, French, and German. [Less]

635K lines of code

17 current contributors

3 months since last commit

3 users on Open Hub

Moderate Activity
5.0
 
I Use This
Tags nlp

gensim

Compare

  No analysis available

Gensim is a Python library for topic modelling, document indexing and similarity retrieval with large corpora. Target audience is the natural language processing (NLP) and information retrieval (IR) community.

0 lines of code

39 current contributors

0 since last commit

3 users on Open Hub

Activity Not Available
5.0
 
I Use This
Mostly written in language not available
Licenses: lgpl21_or...

MeCab

Compare

  Analyzed 4 days ago

MeCab is a fast and customizable Japanese morphological analyzer. MeCab is designed for generic purpose and applied to variety of NLP tasks, such as Kana-Kanji conversion. MeCab provides parameter estimation functionalities based on CRFs and HMM

291K lines of code

0 current contributors

11 months since last commit

3 users on Open Hub

Very Low Activity
0.0
 
I Use This
Licenses: No declared licenses

matxin

Compare

  Analyzed 1 day ago

Machine translation engine based on a dependency grammar and XML interchange format. The Spanish-Basque (es-eu) translation direction is currently supported.

3.41M lines of code

0 current contributors

almost 7 years since last commit

3 users on Open Hub

Inactive
5.0
 
I Use This
Licenses: No declared licenses

ClearTK

Compare

  Analyzed about 1 hour ago

ClearTK is a toolkit developed at the Center for Computational Language and Education Research (CLEAR) at the University of Colorado at Boulder. ClearTK provides a framework for developing statistical natural language processing components in Java. It is based on the Apache UIMA framework for text ... [More] analysis, and provides: A rich feature extraction library A common interface and wrappers for popular machine learning libraries based on models such as maximum entropy, support vector machines and conditional random fields. Infrastructure for creating NLP components such as sequential taggers, chunkers, syntactic parsers, semantic role labeling, temporal resolution, etc. Collection readers for commonly used corpora wrappers for common NLP components such as the Snowball stemmer and OpenNLP sy [Less]

770K lines of code

0 current contributors

about 1 year since last commit

3 users on Open Hub

Very Low Activity
0.0
 
I Use This

Cascading

Compare

  No analysis available

Cascading is a feature rich API for defining and executing complex and fault tolerant data processing workflows on a Hadoop cluster.

0 lines of code

0 current contributors

0 since last commit

2 users on Open Hub

Activity Not Available
0.0
 
I Use This
Mostly written in language not available
Licenses: apache_2

MARF:Modular Audio Recognition Framework

Compare

  Analyzed 3 months ago

MARF is an open-source research platform and a collection of voice/sound/speech/text and natural language processing (NLP) algorithms written in Java and arranged into a modular and extensible framework facilitating addition of new algorithms. MARF can run distributedly over the network and may act ... [More] as a library in applications or be used as a source for learning and extension. [Less]

12.5K lines of code

0 current contributors

over 8 years since last commit

2 users on Open Hub

Activity Not Available
5.0
 
I Use This