readability-extract

The goal of this library is to provide a function that will extract article text from a webpage. It is based on the Readability bookmarklet and the Kerrick/readability-js repository.

Overview of changes

The Readability bookmarklet extracts article text from the current webpage, but it also modifies the current page to display article in an easy to read fashion. This fork's contribution is to pull that article extracting functionality out into a function that leaves the page unchanged.

Usage

readability.extract(function(article){
  console.log(article.title);   // logs the title of the article
  console.log(article.content); // logs the body of the article
});

extract is an asynchronous function. If the article is paginated, extract will retreive the subsequent pages and concatenate them. extract's only argument is a callback which will be executed when the entire article has been retrieved and processed.

Name		Name	Last commit message	Last commit date
Latest commit History 136 Commits
.gitignore		.gitignore
README.md		README.md
bower.json		bower.json
readability.js		readability.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

readability-extract

Overview of changes

Usage

About

Releases

Packages

Languages

NoahCarnahan/readability-extract

Folders and files

Latest commit

History

Repository files navigation

readability-extract

Overview of changes

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages