This Php5 script scanns XHTML files exported from org. Pulls published contents into a DB and list snippets, images, duplicates...It's purposes are to validate the XHTML output of Org-mode and create a fulltext search for all pages in a directory tree.