NLTK — the Natural Language Toolkit — is a suite of open source Python modules, linguistic data and documentation for research and development in natural language processing, supporting dozens of NLP tasks, with distributions for Windows, Mac OSX and Linux.
Carrot2 is an Open Source Search Results Clustering Engine. It can automatically organize (cluster) search results into thematic categories.
Carrot2 provides an architecture for acquiring search results from various sources (YahooAPI, GoogleAPI, MSN Search API, OpenSearch, Lucene index)
... [More], clustering the results and visualising the clusters. Currently, 5 clustering algorithms are available that are suitable for different kinds of document clustering tasks. [Less]
Task of the project is a semantic annotation of texts using NLP tools.
Czsem Mining Suite is mainly a GATE plugin that allows to use Treex and TectoMT tools inside GATE. Bsides that is also a Information Extraction tool based on dependency liguistics. It si capable to learn tree queries
... [More] (dependecy based extraction rules) using Inducive Logic Programming. [Less]
This site uses cookies to give you the best possible experience.
By using the site, you consent to our use of cookies.
For more information, please see our
Privacy Policy