-
Notifications
You must be signed in to change notification settings - Fork 172
Open
Description
Hello,
I tried to apply readability on a specific layout of The Guardian, which heavily relies on JavaScript but still has most of the text available in the HTML source code:
Readability returned this chunk of HTML:
<div><div> comments <p>Sign in or create your Guardian account to join the discussion. </p> <p>This discussion is closed for comments.</p> <p> We’re doing some maintenance right now. You can still read comments, but please come back later to add your own. </p> <p> Commenting has been disabled for this account (why?) </p> </div></div>Do you know guys why the main content is not properly extracted, and if it fixable?
Metadata
Metadata
Assignees
Labels
No labels