0
I Use This!
Inactive

Commits : Listings

Analyzed about 5 hours ago. based on code collected about 5 hours ago.
Jan 13, 2025 — Jan 13, 2026
Commit Message Contributor Files Modified Lines Added Lines Removed Code Location Date
Merge branch 'graph' of [email protected]:jeremybarnes/github_contest More... over 16 years ago
getting towards basic indexing More... over 16 years ago
reorganized some of the interfaces and improved the query code; still not being used More... over 16 years ago
moving towards more advanced functionality More... over 16 years ago
improved syntax for accessing nodes with a primary key More... over 16 years ago
modified example to load the rest of the data More... over 16 years ago
github import test now working properly (if slowly) More... over 16 years ago
fixed reference counting problem due to default assign on AttributeRef not being implemented; further improved test cases More... over 16 years ago
all attribute tests are now passing More... over 16 years ago
second test (reference counted attributes) is passing basic functionality More... over 16 years ago
gradual progress on attributes; first test is running More... over 16 years ago
gradually fleshing out graph data structures More... over 16 years ago
use random forests (from the latest jml update) More... over 16 years ago
merged in changes from other machine More... over 16 years ago
started work on a generic graph data structure to allow more efficient mining More... over 16 years ago
added missing data file More... over 16 years ago
added follower graph to the information; minor improvements More... over 16 years ago
added collaborator information; feature seems informative but no major effect on the results More... over 16 years ago
no real changes More... over 16 years ago
don't keep copying prob-results.txt More... over 16 years ago
added number of missing watchers and forks feature based upon data from hintly's submission More... over 16 years ago
removed user and author matches from coocs and made them only suggest repos that are watched at least twice. Not much difference; probably slightly worse More... over 16 years ago
don't allow same author or same name results in in_cluster to improve diversity More... over 16 years ago
use both keywords and watcher SVD for repo clustering More... over 16 years ago
using keyword vector for clustering repos; looks worse (we lose a lot of repos suggested by repo clustering) More... over 16 years ago
use binary features for keyword svd; big improvement More... over 16 years ago
used tf-idf not tf for svd; works less well More... over 16 years ago
more keyword features; still not making much of a difference More... over 16 years ago
finished SVD for keywords and various improvements to the preprocessing; added very basic features; still not much good More... over 16 years ago
added a probability propagation candidate generator (similar to the cooc and cooc2 generators). Slow and not much improvement. Thanks to Mark Lodato for the idea. More... over 16 years ago