1
I Use This!
Inactive

Commits : Listings

Analyzed about 9 hours ago. based on code collected 1 day ago.
Dec 02, 2024 — Dec 02, 2025
Commit Message Contributor Files Modified Lines Added Lines Removed Code Location Date
generalized _has_vowel for use in Porter2/Snowball (added _sb to name) condensed _snowball to _sb in function names More... over 11 years ago
converted Porter stemmer to operate on & return lowercase (to match Porter2/Snowball behavior) fixed an incorrect set initialization in Porter2 More... over 11 years ago
completed Porter2 through Step 1a More... over 11 years ago
Porter2 through Step 0 More... over 11 years ago
fixed missed switch from upper to lower More... over 11 years ago
added helper function to test for short words More... over 11 years ago
added helper function to test for short syllable endings More... over 11 years ago
converted Porter2 tests/functions to operate on lowercase, in line with the algorithm description added short syllable helper function More... over 11 years ago
added _snowball_r1/_snowball_r2 R1/R2 calculator helper functions added definitions for vowels, doubles, & li-endings for Porter2 stemmer More... over 11 years ago
renamed R1/R2 helper function calls More... over 11 years ago
added tests for R1 & R2 calculation for Porter2 More... over 11 years ago
added test set and function skeleton for Porter2 More... over 11 years ago
renamed porter stemmer Snowball test set More... over 11 years ago
updated Porter algorithm & Snowball test set to match the "Points of difference from the published algorithm" More... over 11 years ago
fixed Porter stemmer so that it matches longest suffix and then quits (i.e. the stem of 'argument' is 'argument') This now passes the whole Snowball test set More... over 11 years ago
corrected fundamental aspects of Porter algorithm, which can now return something for every input in the Snowball test set (but not always the correct thing) More... over 11 years ago
added simple Porter test cases and test routine for the major set More... over 11 years ago
completed tests for Porter helpers More... over 11 years ago
added Porter stemmer test cases (from http://snowball.tartarus.org/algorithms/porter/diffs.txt) More... over 11 years ago
first attempt at completing implementation of Porter stemmer helper functions More... over 11 years ago
added Porter stemmer helper functions More... over 11 years ago
some progress on Porter stemmer More... over 11 years ago
starting Porter stemmer implementation (m-degree calculation completed) More... over 11 years ago
spacing, legibility More... over 11 years ago
needs refinement, but fixed bit-counting More... over 11 years ago
added some basic tests for cmp_features added -1 return for -1 in input of cmp_features More... over 11 years ago
spacing More... over 11 years ago
added simply comparison function for feature bundles (not intended to be a final implementation) More... over 11 years ago
switched phonetic symbol matching to use NFD--we're already matching maximum-length strings, and this allows catching a few base characters w/o their diacritics More... over 11 years ago
a little progress on tf-idf similarity (little more than a stub at this point) More... over 11 years ago