Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

Apache Tika

Compare

Claimed by Apache Software Foundation Analyzed about 1 hour ago

The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries. Tika is a project of the Apache Software Foundation, and was formerly a subproject of Apache Lucene.

391K lines of code

19 current contributors

about 19 hours since last commit

23 users on Open Hub

High Activity
5.0
 
I Use This

Apache Solr for TYPO3

Compare

  Analyzed about 6 hours ago

Open Source Enterprise Search meets Open Source Enterprise Content Management System. A TYPO3 extension that integrates the Apache Solr enterprise search server with TYPO3. Features include * User Access Groups Support * Multi Language Handling * File Indexing * Facetting & Filters * ... [More] Sorting * Field Boosting * Spellchecking * Search Word Highlighting * Auto Suggest * Multisite Support * Advanced Templating Engine * Index Reports [Less]

100K lines of code

22 current contributors

12 days since last commit

3 users on Open Hub

Moderate Activity
5.0
 
I Use This

Apache Tika for TYPO3

Compare

  Analyzed about 11 hours ago

Apache Tika for TYPO3 offers several services to extract meta data and content from files. The extension also comes with a service to detect the language of a text (requires Tika 0.8+). EXT:tika can use either a locally available Tika CLI app or a remote Apache Solr server. The provided ... [More] services can then be used by other extensions like EXT:dam or EXT:solr for example. [Less]

4.93K lines of code

2 current contributors

11 months since last commit

1 users on Open Hub

Very Low Activity
5.0
 
I Use This