Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

Smart and Simple Web Crawler

Compare

  No analysis available

Simple framework to implement crawling technolgy in own programs and libraries.

0 lines of code

0 current contributors

0 since last commit

2 users on Open Hub

Activity Not Available
3.0
   
I Use This
Mostly written in language not available
Licenses: No declared licenses

Noodle NG

Compare

  Analyzed about 15 hours ago

Noodle is a web search engine for local smb/cifs network shares (Windows/Samba). It consists of a crawler which scans a given range of IP addresses for shares, collects data about shares, folders and files and stores that into a database and a front-end for making search queries to the database. ... [More] It is written in the powerful scripting language Python and using TurboGears 2.0 as a framework for rapid web development. We also use WebOb and pysmbc. [Less]

18.3K lines of code

0 current contributors

over 12 years since last commit

2 users on Open Hub

Inactive
5.0
 
I Use This

Asqatasun

Compare

  Analyzed 2 minutes ago

Opensource web site analyser, used for web accessibility "a11y" and search engine optimisation "SEO" http://asqatasun.org

428K lines of code

4 current contributors

almost 3 years since last commit

2 users on Open Hub

Inactive
5.0
 
I Use This

Anemone

Compare

  Analyzed 1 day ago

Anemone is a Ruby library that makes it quick and painless to write programs that spider a website. It provides a simple DSL for performing actions on every page of a site, skipping certain URLs, and calculating the shortest path to a given page on a site. The multi-threaded design makes Anemone ... [More] fast. The API makes it simple. And the expressiveness of Ruby makes it powerful. [Less]

2.15K lines of code

0 current contributors

over 13 years since last commit

1 users on Open Hub

Inactive
5.0
 
I Use This

constellio

Compare

  Analyzed about 16 hours ago

Constellio: The open-source solution for Enterprise Search Based on Apache Solr and Google Enterprise Connector Manager, Constellio makes all your relevant corporate information (Web, Email, ECM, CRM, etc.) available with a single click. Constellio is developed by Doculibre http://www.constellio.com

1.15M lines of code

19 current contributors

over 4 years since last commit

1 users on Open Hub

Inactive
5.0
 
I Use This

PUMz

Compare

  Analyzed about 2 hours ago

PUMz is web clipping service project and upgrade project of pumware(pumware.sf.net).

2.95K lines of code

0 current contributors

about 22 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This

Crawley

Compare

  Analyzed about 23 hours ago

Pythonic Crawling / Scraping Framework Built on Eventlet Features * High Speed WebCrawler built on Eventlet. * Supports databases engines like Postgre, Mysql, Oracle, Sqlite. * Command line tools. * Extract data using your favourite tool. XPath or Pyquery (A Jquery-like library for python). ... [More] * Cookie Handlers. * Very easy to use (see the example). Documentation http://packages.python.org/crawley/ [Less]

3.69K lines of code

0 current contributors

over 10 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This

Spidr

Compare

  Analyzed 1 day ago

Spidr is a versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

4.39K lines of code

1 current contributors

3 months since last commit

1 users on Open Hub

Very Low Activity
0.0
 
I Use This

libglyr

Compare

  Analyzed 1 day ago

Glyr is a searcheninge for musicrelated metadata It comes both in a commandline interface tool and as a C library, both with an easy to use interface. The sort of metadata glyr is searching (and downloading) is usually the data you see in your musicplayer. And indeed, originally it was written ... [More] to serve as internally library for a musicplayer, but has been extended to work as a standalone program which is able to download everything a modern musicplayer needs to make his user happy. Features: - Always having more than fallback, and a hit rate of approx. 97% for covers - Portable: Windows and Linux - Fuzzy matching: Search providers with Levenshtein algorithm to eliminate typos and enhance search results. - Fast Download: libcurl is used internally, and sources are searched in parallel [Less]

14.5K lines of code

0 current contributors

almost 6 years since last commit

1 users on Open Hub

Inactive
5.0
 
I Use This

FilmeUtils

Compare

  No analysis available

FilmeUtils baixa legendas do legendas.tv e os torrents do pirate bay com um clique

0 lines of code

2 current contributors

0 since last commit

1 users on Open Hub

Activity Not Available
5.0
 
I Use This
Mostly written in language not available
Licenses: Public_do...