0
I Use This!
Inactive

Commits : Listings

Analyzed about 20 hours ago. based on code collected about 20 hours ago.
Aug 09, 2024 — Aug 09, 2025
Commit Message Contributor Files Modified Lines Added Lines Removed Code Location Date
Update tests for spec rev 2990 that discards partial tag tokens on EOF More... about 16 years ago
Make non-ascii characters in meta elements work and make compatMode attribute set on the right object More... over 16 years ago
Start to add parse errors to PHP implementation. This introduces a 14% performance regression, and causes 23 tests to fail (all due to parse error issues). More... over 16 years ago
Misc. micro-optimizations in the PHP impl (this has around a 1% difference). More... over 16 years ago
This isn't actually quicker (~12% speed decrease, somehow), and makes the code less clear. More... over 16 years ago
Remove unused method HTML5_Tokenizer::seek from PHP impl More... over 16 years ago
Update line-col-tracking documentation. More... over 16 years ago
Update the tree building tests with frameset-ok, new AAA and (not yet in spec) WebKit-style foster-parenting More... over 16 years ago
PHP: grab multiple characters at once when getting attribute name. More... over 16 years ago
Change EOL style to native in PHP port to match everything else. More... over 16 years ago
PHP: grab multiple characters at once more often. This gives a fair performance boost on the spec (~12s), primarily by cutting out repeated method calls of each of the states. More... over 16 years ago
Get rid of more preg_match used for whitespace matching. More... over 16 years ago
Clean up the move to strspn by using constants for the groups of bytes. More... over 16 years ago
Instead of using regex to check whether a character is a whitespace character, just use === and ||. Although messy, this cuts out a function call, which makes a fair difference when tokenizing the spec. More... over 16 years ago
PHP: Really naïvely change Tokenizer::characters() to use strspn. This really needs to be tidied up, though. More... over 16 years ago
PHP: use strcspn in bogusCommentState. More... over 16 years ago
PHP: Calculate UTF-8 string length using count_chars, array_sum, and array_slice instead of looping over string in interpreted code. This is quicker in general for strings longer than four characters in my testing. Fuzzing strings a random length between 0 and 80 characters long shows a 10-times speed increase. More... over 16 years ago
Add note of possible optimization. More... over 16 years ago
I probably better add myself to the copyright notice now that there's actual code written by me. More... over 16 years ago
(PHP port) Get rid of UTF8COL: this halves the time taken to tokenize the HTML 5 spec. Replacements for HTML5_Tokenizer::$line and HTML5_Tokenizer::$col can be found at HTML5_Tokenizer::getCurrentLine() and HTML5_Tokenizer::getColumnOffset() respectively. These compute what they return upon being called. More... over 16 years ago
Start to remove PCRE and strtolower from PHP tokenizer. More... over 16 years ago
Add note of version required of SimpleTest to run the tests in the PHP port. More... over 16 years ago
Incorporate Geoffrey Sneddon's code review. More... over 16 years ago
Fixed issue 97 More... over 16 years ago
Add html5lib.serialize.serialize and html5lib.serialize More... over 16 years ago
Fixed a sanitiser regression from r1248 More... over 16 years ago
Fixes issue 91 by omitting the head element's start tag when the head element is empty. More... over 16 years ago
Fix attr-value-quoting test with line tab (U+000B) More... over 16 years ago
Quick hack to fix parsing from stdin More... over 16 years ago
Implement r2708, adding missing invalid codepoints. More... over 16 years ago