Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

Apache Tika

Compare

Claimed by Apache Software Foundation Analyzed about 3 hours ago

The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries. Tika is a project of the Apache Software Foundation, and was formerly a subproject of Apache Lucene.

392K lines of code

19 current contributors

about 21 hours since last commit

23 users on Open Hub

Very High Activity
5.0
 
I Use This

Apache Solr for TYPO3

Compare

  Analyzed about 2 hours ago

Open Source Enterprise Search meets Open Source Enterprise Content Management System. A TYPO3 extension that integrates the Apache Solr enterprise search server with TYPO3. Features include * User Access Groups Support * Multi Language Handling * File Indexing * Facetting & Filters * ... [More] Sorting * Field Boosting * Spellchecking * Search Word Highlighting * Auto Suggest * Multisite Support * Advanced Templating Engine * Index Reports [Less]

98.9K lines of code

22 current contributors

18 days since last commit

3 users on Open Hub

Moderate Activity
5.0
 
I Use This

Apache Tika for TYPO3

Compare

  Analyzed about 13 hours ago

Apache Tika for TYPO3 offers several services to extract meta data and content from files. The extension also comes with a service to detect the language of a text (requires Tika 0.8+). EXT:tika can use either a locally available Tika CLI app or a remote Apache Solr server. The provided ... [More] services can then be used by other extensions like EXT:dam or EXT:solr for example. [Less]

4.94K lines of code

2 current contributors

6 months since last commit

1 users on Open Hub

Very Low Activity
5.0
 
I Use This