readability v0.4.0 API Reference
Modules
Readability library for extracting & curating articles
Build article for readability
AuthorFinder extracts authors
Candidate can be article
Clean html tree for prepare candidates. It transforms misused tags and removes unlikely candidates
Score html tree
The builing and finding candidates engine It traverses the HTML tree searching, removing, socring nodes
Helpers for parsing, updating, removing html tree
Clean an element of all tags of type “tag” if they look fishy. “Fishy” is an algorithm based on content length, classnames, link density, number of images & embeds, etc
The TitleFinder engine traverses HTML tree searching for finding title