Projects tagged ‘linguistics’

jWordSplitter

J

Analyzed about 5 hours ago

jWordSplitter a a small Java library that splits compound words into their parts. This is especially useful for languages like German where an infinite number of new words can be formed by just appending nouns ("Donaudampfschifffahrtskapitän").

1.48K lines of code

1 current contributors

7 months since last commit

1 users on Open Hub

Very Low Activity

0 Reviews

I Use This

Mostly written in Java

Licenses: apache_2

Ruby LinkParser

Analyzed about 11 hours ago

A high-level interface to the CMU Link Grammar. This binding wraps the link-grammar shared library provided by the AbiWord project for their grammar-checker.

2.41K lines of code

1 current contributors

over 1 year since last commit

1 users on Open Hub

Very Low Activity

0 Reviews

I Use This

Mostly written in C

Licenses: bsd

Tags classifier cmu computational_linguistics corpora grammar information_retrieval language linguistics machine_learning natural_language natural_language_processing nlp 5 more...

IMS Open Corpus Workbench

I

Analyzed 4 months ago

The IMS Open Corpus Workbench is a collection of tools for managing and querying large text corpora (100 M words and more) with linguistic annotations. Its central component is the flexible and efficient query processor CQP.

281K lines of code

2 current contributors

5 months since last commit

1 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in PHP

Licenses: No declared licenses

Tags corpus index language linguistics query search xml

LexAt Lexical/Corpus Statistics

L

No analysis available

The LexAt "lexical attraction" aka the RelEx Statistical Linguistics package adds statistical algorithms to the RelEx. Corpus statistics, including mutual information, are maintained in an SQL database, and drawn on to enhance various RelEx functions, such as parse ranking and chunk ranking, and word-sense disambiguation (Mihalcea algo).

0 lines of code

0 current contributors

0 since last commit

1 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in language not available

Licenses: apache_2

Tags computational_linguistics corpora corpus corpus_linguistics database java linguistics natural_language natural_language_processing nlp opencog perl 1 more...

opencorpora

O

Analyzed 1 day ago

An engine for creating and annotating textual corpora

38.6K lines of code

3 current contributors

8 months since last commit

1 users on Open Hub

Very Low Activity

0 Reviews

I Use This

Mostly written in PHP

Licenses: gpl

Tags computational_linguistics corpora corpus corpus_linguistics crowdsourcing disambiguation linguistics natural-language-processing natural_language_processing nlp part_of_speech russian 1 more...

pymorphy2

P

Analyzed about 8 hours ago

Morphological analyzer / POS tagger / inflection engine for Russian language.

4.5K lines of code

0 current contributors

over 3 years since last commit

1 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in Python

Licenses: No declared licenses

Tags inflection linguistics morphological_analysis morphologicalanalysis python

Atomic (multi-level annotation)

A

Analyzed about 1 hour ago

Software for multi-level annotation of linguistic corpora

17K lines of code

0 current contributors

over 7 years since last commit

1 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in Java

Licenses: apache_2

Tags annotation corpus eclipsercp linguistics multi-layer science

Abydos NLP/IR

Analyzed about 2 hours ago

Abydos NLP/IR library for Python

74.7K lines of code

1 current contributors

over 3 years since last commit

1 users on Open Hub

Inactive

0 Reviews

I Use This

Mostly written in Python

Licenses: gpl3, gpl3_or_l...

Tags distance fuzzy levenshtein linguistics machine_learning measures metaphone metrics natural_language_processing nlp phonetic python 4 more...

FieldWorks

Analyzed about 11 hours ago

FieldWorks consists of software tools that help you manage linguistic and cultural data. FieldWorks supports tasks ranging from the initial entry of collected data through to the preparation of data for publication: -dictionary development -interlinearization of texts -cultural ... [More]

8.59M lines of code

6 current contributors

3 months since last commit

1 users on Open Hub

Low Activity

0 Reviews

I Use This

Mostly written in C#

Licenses: lgpl21_or...

Tags anthropology c# c++ dictionary grammar interlinearization linguistics linux morphological_analysis morphology natural_language non_roman_scripts 2 more...

Giellatekno

Analyzed about 2 hours ago

Giellatekno, Centre for Saami language technology at the University of Tromsø, started as a project for Saami grammatical analysis, later extended into syntax, proofing tools, interactive pedagogical programs, electronic dictionaries, and text-to-speech. The linguistic philosophy is that programs ... [More]

79.5M lines of code

52 current contributors

about 8 hours since last commit

1 users on Open Hub

High Activity

0 Reviews

I Use This

Mostly written in JavaScript

Licenses: ccbysa3-0, gpl3

Tags computational_linguistics grammar linguistics morphology natural_language natural_language_processing nlp saami syntax

Tags : Browse Projects