69 Commits (03533f5e4f48254255ad08d9abc115d728221328)

Author SHA1 Message Date
Jeremy Benoist f9e73fb49a
Fix bad encoding for `tidy_repair_string` 4 days ago
Jan Tojnar 8b89d70b1a Fix character decoding regression when `title` precedes `meta[charset]` 10 months ago
Jan Tojnar efbbc86df9 Fix discarding `html[lang]` 1 year ago
Jan Tojnar c7208f6ad2 Do not set domainRegExp for local files 1 year ago
Jan Tojnar 9a9373de4b Iterate node lists with `foreach` 1 year ago
Jan Tojnar d454c3a462 Remove dead iteration code 1 year ago
Jan Tojnar 8b1ef07401 Extract `for`-iterated items into variables 1 year ago
Jan Tojnar 5885dbbe78 Remove pointless `stdClass` 1 year ago
Jan Tojnar 80adfe870b Fix coding style 1 year ago
Jan Tojnar 677f3f096e Fix hasSingleTagInsideElement method 2 years ago
Jan Tojnar 89d3b74259 Rectorize to PHP 7.4 2 years ago
Jan Tojnar 648d8c605b Update coding style for upcoming PHP-CS-Fixer changes 2 years ago
Jan Tojnar f14428e4c0 Do not use `mb_convert_encoding` with `HTML-ENTITIES` as target encoding 3 years ago
Jan Tojnar 9ed89bde92 Fix PHP-Cs-Fixer changes 3 years ago
Kevin Decherf 6689f19956
Strip script and style tags through ::clean() method instead of preg_replace 4 years ago
Kevin Decherf 2ab87d7445 Fix isPhrasingContent conditions, text node replacement 4 years ago
Jeremy Benoist c2a1639b34
Add Rector 4 years ago
Kevin Decherf a44c4e5482 Add routine to remove invisible nodes 4 years ago
Kevin Decherf b580cf216d Backport some logics from mozilla/readability 4 years ago
Jeremy Benoist c4bba53dbe
Remove Scrutinizer 4 years ago
Jeremy Benoist 66215a6c80
Require PHP >= 7.2 4 years ago
Jérémy Benoist fabf096ce6
Fix deprecated message 4 years ago
Kevin Decherf eb72a315c4 Clean empty figure tags without ending 4 years ago
Jeremy Benoist ea1368fac0
Body can be wiped without tidy 5 years ago
Jan Tojnar 7cea79c23a
readability: stop tidy from wrapping noscript text 5 years ago
Jeremy Benoist 6a8ecf232f
Use a new deps for HTML5 parser 6 years ago
Jeremy Benoist b1acc9ed73
Fix PHPStan (again) 6 years ago
Jeremy Benoist 11d2946904
Add openload.co to media detection 7 years ago
nicofrand ff78c63e6d Skip empty (empty innerHTML) nodes when grabbing article 7 years ago
Jeremy Benoist bb65caf864
Fix “A non well formed numeric value encountered” 7 years ago
Simounet 2e20f76195 \bout removed from negative content 7 years ago
Jeremy Benoist 74d9cc605a
Enable PHPStan 7 years ago
Jeremy Benoist 2dce2879bf
Update fixer rules 7 years ago
Kevin Decherf 26c881d864
tidy: use tidy_repair_string instead of tidy_parse_string+tidy_clean_repair 7 years ago
Simounet 422c74f29c Giphy added to allowed medias 7 years ago
Simounet 63cd304dba Media class added to positive candidates 8 years ago
Kevin Decherf 4c68cc9f09 Keep elements with 'footnote' as possible candidates 9 years ago
Jeremy Benoist 613a63c062
CS 9 years ago
Jeremy Benoist 05089bbd03
Add missing HTML5 class 9 years ago
Jeremy Benoist f2a43b476c
Avoid PHP Warning 9 years ago
Jeremy Benoist 8b1c3f147d
Don't be to hard on 'links' attribute 9 years ago
Jeremy Benoist ff754b80bd
Avoid childnode becoming null to generate a warning 9 years ago
Jeremy Benoist d97bece7c5
Don’t be too agressive 10 years ago
Jeremy Benoist 3de4e918b4
Convert header & section to p 10 years ago
Jeremy Benoist 5182d6cb11
“info” is too agressive in unlikelyCandidates 10 years ago
Jeremy Benoist 2ef400bf73 Enable php-cs-fixer 10 years ago
Jeremy Benoist 00f622e9b7 Revert BC changes 10 years ago
Jeremy Benoist c756ec067e Fix tests 10 years ago
Jeremy Benoist 8ab7d76cd5 Use Monolog instead of custom solution 10 years ago
Jeremy Benoist 149a333b40 Remove addPreFilter 10 years ago