0
I Use This!
Inactive
Analyzed about 23 hours ago. based on code collected 1 day ago.

Project Summary

Extracts and cleans text from Wikipedia database dump and stores output in a number of files of similar size in a given directory. This is a mirror of the script by Giuseppe Attardi.

Tags

mediawiki parser python tanl

In a Nutshell, wikipedia-extractor...

GNU General Public License v3.0 only
Permitted

Commercial Use

Modify

Distribute

Place Warranty

Use Patent Claims

Forbidden

Sub-License

Hold Liable

Required

Distribute Original

Disclose Source

Include Copyright

State Changes

Include License

Include Install Instructions

These details are provided for information only. No information here is legal advice and should not be used as such.

This Project has No vulnerabilities Reported Against it

Did You Know...

  • ...
    in 2016, 47% of companies did not have formal process in place to track OS code
  • ...
    by exploring contributors within projects, you can view details on every commit they have made to that project
  • ...
    use of OSS increased in 65% of companies in 2016
  • ...
    check out hot projects on the Open Hub

30 Day Summary

Dec 26 2022 — Jan 25 2023

12 Month Summary

Jan 25 2022 — Jan 25 2023

Ratings

Be the first to rate this project
Click to add your rating
  
Review this Project!