Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

jWordSplitter

Compare

  Analyzed about 5 hours ago

jWordSplitter a a small Java library that splits compound words into their parts. This is especially useful for languages like German where an infinite number of new words can be formed by just appending nouns ("Donaudampfschifffahrtskapitän").

1.48K lines of code

1 current contributors

7 months since last commit

1 users on Open Hub

Very Low Activity
0.0
 
I Use This

Ruby LinkParser

Compare

  Analyzed about 11 hours ago

A high-level interface to the CMU Link Grammar. This binding wraps the link-grammar shared library provided by the AbiWord project for their grammar-checker.

2.41K lines of code

1 current contributors

over 1 year since last commit

1 users on Open Hub

Very Low Activity
0.0
 
I Use This

IMS Open Corpus Workbench

Compare

  Analyzed 4 months ago

The IMS Open Corpus Workbench is a collection of tools for managing and querying large text corpora (100 M words and more) with linguistic annotations. Its central component is the flexible and efficient query processor CQP.

281K lines of code

2 current contributors

5 months since last commit

1 users on Open Hub

Activity Not Available
0.0
 
I Use This
Licenses: No declared licenses

LexAt Lexical/Corpus Statistics

Compare

  No analysis available

The LexAt "lexical attraction" aka the RelEx Statistical Linguistics package adds statistical algorithms to the RelEx. Corpus statistics, including mutual information, are maintained in an SQL database, and drawn on to enhance various RelEx functions, such as parse ranking and chunk ranking, and word-sense disambiguation (Mihalcea algo).

0 lines of code

0 current contributors

0 since last commit

1 users on Open Hub

Activity Not Available
0.0
 
I Use This
Mostly written in language not available
Licenses: apache_2

opencorpora

Compare

  Analyzed 1 day ago

An engine for creating and annotating textual corpora

38.6K lines of code

3 current contributors

8 months since last commit

1 users on Open Hub

Very Low Activity
0.0
 
I Use This

pymorphy2

Compare

  Analyzed about 8 hours ago

Morphological analyzer / POS tagger / inflection engine for Russian language.

4.5K lines of code

0 current contributors

over 3 years since last commit

1 users on Open Hub

Inactive
5.0
 
I Use This
Licenses: No declared licenses

Atomic (multi-level annotation)

Compare

  Analyzed about 1 hour ago

Software for multi-level annotation of linguistic corpora

17K lines of code

0 current contributors

over 7 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This

Abydos NLP/IR

Compare

  Analyzed about 2 hours ago

Abydos NLP/IR library for Python

74.7K lines of code

1 current contributors

over 3 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This
Licenses: gpl3, gpl3_or_l...

FieldWorks

Compare

  Analyzed about 11 hours ago

FieldWorks consists of software tools that help you manage linguistic and cultural data. FieldWorks supports tasks ranging from the initial entry of collected data through to the preparation of data for publication: -dictionary development -interlinearization of texts -cultural ... [More] records, which can be categorized using the Outline of Cultural Materials -bulk editing of many fields -morphological analysis -complex non-Roman scripts using Unicode and SIL-developed Graphite -multi-user editing capability over a local area network. [Less]

8.59M lines of code

6 current contributors

3 months since last commit

1 users on Open Hub

Low Activity
0.0
 
I Use This

Giellatekno

Compare

  Analyzed about 2 hours ago

Giellatekno, Centre for Saami language technology at the University of Tromsø, started as a project for Saami grammatical analysis, later extended into syntax, proofing tools, interactive pedagogical programs, electronic dictionaries, and text-to-speech. The linguistic philosophy is that programs ... [More] for linguistic analysis should be funded on deep linguistic knowledge, where the analysis takes word forms as a starting point, and build the syntactic analysis bottom-up, rather than vice versa. In this way we are able to build analysers that are both robust but at the same time give deep rather than shallow linguistic analyses. These analysers form the basis both for practical programs for end users and for advanced linguistic research. [Less]

79.5M lines of code

52 current contributors

about 8 hours since last commit

1 users on Open Hub

High Activity
5.0
 
I Use This