Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

Natural Language Toolkit (NLTK)

Compare

  Analyzed 3 days ago

NLTK — the Natural Language Toolkit — is a suite of open source Python modules, linguistic data and documentation for research and development in natural language processing, supporting dozens of NLP tasks, with distributions for Windows, Mac OSX and Linux.

235K lines of code

42 current contributors

15 days since last commit

45 users on Open Hub

Moderate Activity
5.0
 
I Use This

Apertium

Compare

  Analyzed about 10 hours ago

Apertium is an open-source machine translation platform, aimed at related-language pairs but expanded to deal with more divergent language pairs. The platform provides 1. a language-independent machine translation engine 2. tools to manage the linguistic data necessary to build a machine ... [More] translation system for a given language pair and 3. linguistic data for a growing number of language pairs. Apertium uses a shallow-transfer machine translation engine which processes the input text in stages, as in an assembly line: de-formatting, morphological analysis, part-of-speech disambiguation, shallow structural transfer, lexical transfer, morphological generation, and re-formatting. [Less]

94.7K lines of code

0 current contributors

20 days since last commit

13 users on Open Hub

Moderate Activity
4.9
   
I Use This
Licenses: GNU_Free_..., gpl, gpl3_or_l...

Apache OpenNLP

Compare

Claimed by Apache Software Foundation Analyzed 1 day ago

Apache OpenNLP is a Java machine learning toolkit for natural language processing (NLP).

160K lines of code

8 current contributors

8 days since last commit

12 users on Open Hub

Moderate Activity
5.0
 
I Use This

Treex - NLP Framework

Compare

  Analyzed 3 days ago

Treex (formerly TectoMT) is a highly modular NLP software system implemented in Perl programming language under Linux. It is primarily aimed at Machine Translation, making use of the ideas and technology created during the Prague Dependency Treebank project. At the same time, it is also hoped to ... [More] significantly facilitate and accelerate development of software solutions of many other NLP tasks, especially due to re-usability of the numerous integrated processing modules (called blocks), which are equipped with uniform object-oriented interfaces. [Less]

242K lines of code

4 current contributors

4 days since last commit

4 users on Open Hub

Low Activity
5.0
 
I Use This

matxin

Compare

  Analyzed 1 day ago

Machine translation engine based on a dependency grammar and XML interchange format. The Spanish-Basque (es-eu) translation direction is currently supported.

3.41M lines of code

0 current contributors

over 7 years since last commit

3 users on Open Hub

Inactive
5.0
 
I Use This
Licenses: No declared licenses

OpenThesaurus

Compare

  Analyzed about 21 hours ago

A web-based application to build thesauri. Can export its data to text and OpenOffice/LibreOffice format.

21.2K lines of code

2 current contributors

3 months since last commit

2 users on Open Hub

Very Low Activity
5.0
 
I Use This

RelEx Semantic Relationship Extractor

Compare

  Analyzed 3 days ago

RelEx is an English-language semantic relationship extractor, built on the Carnegie-Mellon Link Grammar parser. It can identify dependency-grammar dependencies, such as subject, object, indirect object and many other relationships between words in a sentence. It can also provide part-of-speech ... [More] tagging, noun-number tagging, verb tense tagging, gender tagging, and so on. Relex includes a basic implementation of the Hobbs anaphora (pronoun) resolution algorithm. RelEx also provides semantic relationship framing, similar to that of FrameNet. [Less]

11.8K lines of code

4 current contributors

12 months since last commit

2 users on Open Hub

Very Low Activity
0.0
 
I Use This

SimMetrics

Compare

  Analyzed about 15 hours ago

SimMetrics is a Similarity Metric Library, e.g. from edit distance's (Levenshtein, Gotoh, Jaro etc) to other metrics, (e.g Soundex, Chapman). Work provided by UK Sheffield University funded by (AKT) an IRC sponsored by EPSRC, grant number GR/N15764/01.

5.76K lines of code

0 current contributors

almost 18 years since last commit

2 users on Open Hub

Inactive
5.0
 
I Use This
Licenses: No declared licenses

Affisix

Compare

  No analysis available

Affisix is a program for automatic recognition of affixes. It takes large amount of words and according to the user setting it tries to determine which segments of these words are prefixes.

0 lines of code

0 current contributors

0 since last commit

1 users on Open Hub

Activity Not Available
4.0
   
I Use This
Mostly written in language not available
Licenses: gpl3

Ruby LinkParser

Compare

  Analyzed about 9 hours ago

A high-level interface to the CMU Link Grammar. This binding wraps the link-grammar shared library provided by the AbiWord project for their grammar-checker.

2.41K lines of code

1 current contributors

almost 2 years since last commit

1 users on Open Hub

Very Low Activity
0.0
 
I Use This