Jeremy Benoist
f2a43b476c
Avoid PHP Warning
...
This isn't the best solution but the previous one using `@` wasn't really better.
Appending a string into a fragment might generate some warning if the string contains bad entity.
For example `+`.
9 years ago
Jeremy Benoist
8b1c3f147d
Don't be to hard on 'links' attribute
9 years ago
Jeremy Benoist
ff754b80bd
Avoid childnode becoming null to generate a warning
9 years ago
Jeremy Benoist
d97bece7c5
Don’t be too agressive
...
Some links got a “tooltip-link” and shouldn’t be removed by php-readability because they are usefull to the content
10 years ago
Jeremy Benoist
3de4e918b4
Convert header & section to p
...
And took `pre` element in score
10 years ago
Jeremy Benoist
5182d6cb11
“info” is too agressive in unlikelyCandidates
...
Some contents have a `infocontent` node (ot sth different) and they are real content.
Using only `info` as regex is too agressive and remove legitimate content.
Matching the whole word `info` (or `infos`) should be a better choice
10 years ago
Jeremy Benoist
2ef400bf73
Enable php-cs-fixer
10 years ago
Jeremy Benoist
00f622e9b7
Revert BC changes
...
- avoid method signature update
- revert moving logic out of the constructor
10 years ago
Jeremy Benoist
c756ec067e
Fix tests
...
`getInnerText` might receive a null DOMElement if the xpath or query return no element.
10 years ago
Jeremy Benoist
8ab7d76cd5
Use Monolog instead of custom solution
...
Remove that ugly `openlog` & `syslog`
10 years ago
Jeremy Benoist
149a333b40
Remove addPreFilter
...
Pre filters are used in the __construct so adding more pre filters once the object is instantiated is useless.
10 years ago
Jeremy Benoist
209c404d7b
Fix instanceof DOMElement
...
We previously checked `instanceof DOMElement` which was wrong since we
are in the namespace class, the class `Readability\DOMElement` does not
exists.
10 years ago
Jeremy Benoist
2951936e00
CS & PHPDoc
10 years ago
Jeremy Benoist
850ade16b6
Cleanup
10 years ago
Jeremy Benoist
dc590542f0
Avoid adding id that might already exists
...
We append a new node when it isn't a `div` or `p` (like when it's an `article`) with the same id which generate a DOM error "DOMElement::setAttribute(): ID blabla already defined".
10 years ago
Jeremy Benoist
111cb08034
Improve negative element
...
- add unlikelyCandidates: head
- add negative: recommend
11 years ago
Jeremy Benoist
f71c3a4196
Do not remove html tag attributes
...
They might contains useful information (at least language)
11 years ago
Jeremy Benoist
b77876b30a
Do not remove nofollow links
...
Most the time, they can be usefull.
At least, it'll be a link to something unrelated. But we won't lose a link inside the content.
Also, adding some extra space.
11 years ago
Jeremy Benoist
175196d6c2
Avoid error with
...
Fix #5
11 years ago
Jeremy Benoist
2b5af601d5
Do not format output to avoid breaking apps
...
It'll require to jump to 2.0.0 and I think it's to soon
11 years ago
Jeremy Benoist
d01eb2ac1e
Use class instead of id to avoid error
...
It generates error like `ID XXX already defined`
11 years ago
Jeremy Benoist
c5a4a490e1
CS
11 years ago
Jeremy Benoist
c67189248e
Backport changes from wallabag
...
e9e4ff87f8
11 years ago
Jeremy Benoist
91b80b70e2
Update HTML5 tags
...
From https://github.com/htacg/tidy-html5/blob/master/src/tags.c#L296
11 years ago
Jeremy Benoist
814c6e4730
Restore compatibility with PHP 5.3
11 years ago
Jeremy Benoist
1963319a55
Improve Travis & add Scrutinizer
...
+ CS
+ Update README
11 years ago
Jeremy Benoist
f5d473780d
Fix javascript typo
...
And add coverage
11 years ago
Jeremy
881e441bdf
Initial commit
11 years ago