readability v0.6.0 API Reference

Modules

Readability library for extracting & curating articles

Build article for readability

AuthorFinder extracts authors

Candidate can be article

Clean html tree for prepare candidates. It transforms misused tags and removes unlikely candidates

The builing and finding candidates engine It traverses the HTML tree searching, removing, socring nodes

Helpers for parsing, updating, removing html tree

Clean an element of all tags of type “tag” if they look fishy. “Fishy” is an algorithm based on content length, classnames, link density, number of images & embeds, etc

The TitleFinder engine traverses HTML tree searching for finding title