n8n-nodes-html-readability

This is an n8n community node. It lets you use Mozilla's Readability in your n8n workflows.

Mozilla's Readability is a standalone version of the algorithm used by Firefox Reader View to extract the main content from web pages, removing clutter and providing clean, readable text.

n8n is a fair-code licensed workflow automation platform.

Installation
Operations
Compatibility
Usage
Resources

Installation

Follow the installation guide in the n8n community nodes documentation.

Operations

HTML

Extract Content

Extracts the main content from HTML, removing navigation, ads, and other distracting elements.

Options:

Parameter	Type	Description
JSON Property	String	The property containing the HTML content to parse. Supports both dot notation (e.g., 'solution.response') and expressions
Continue on Error	Boolean	Whether to continue execution when the node encounters an error
Return Full Response	Boolean	Whether to return the full Readability response including metadata

Output:

Default output includes:

{
  "content": "<div>...extracted HTML content...</div>",
  "title": "Article Title",
  "excerpt": "Brief excerpt of the content"
}

With "Return Full Response" enabled, additional fields are included:

{
  "content": "<div>...extracted HTML content...</div>",
  "title": "Article Title",
  "excerpt": "Brief excerpt of the content",
  "length": 12345,
  "byline": "Author Name",
  "dir": "ltr",
  "siteName": "Website Name",
  "textContent": "Plain text version of the content"
}

Compatibility

Requires n8n version 1.0.0 or later
Uses Mozilla's Readability v0.6.0
Node.js v18.10 or later

Usage

Add the Readability node to your workflow
Connect it to a node that provides HTML content (e.g., HTTP Request)
Specify the JSON property containing the HTML (e.g., 'data' or 'response.body')
Optionally enable "Return Full Response" for additional metadata
Run the workflow to extract clean, readable content

Example Usage

This example shows how to extract readable content from a webpage:

HTTP Request node: Fetch a webpage
Readability node:
- Set "JSON Property" to "data"
- Enable "Return Full Response" if you need metadata
The node will output clean HTML content and metadata

Resources

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.cursor/rules		.cursor/rules
.vscode		.vscode
nodes/Readability		nodes/Readability
.cursorignore		.cursorignore
.cursorindexingignore		.cursorindexingignore
.editorconfig		.editorconfig
.eslintrc.js		.eslintrc.js
.eslintrc.prepublish.js		.eslintrc.prepublish.js
.gitignore		.gitignore
.npmignore		.npmignore
.prettierrc.js		.prettierrc.js
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE.md		LICENSE.md
README.md		README.md
gulpfile.js		gulpfile.js
index.js		index.js
package-lock.json		package-lock.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

n8n-nodes-html-readability

Installation

Operations

HTML

Extract Content

Compatibility

Usage

Example Usage

Resources

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

TechupBusiness/n8n-nodes-html-readability

Folders and files

Latest commit

History

Repository files navigation

n8n-nodes-html-readability

Installation

Operations

HTML

Extract Content

Compatibility

Usage

Example Usage

Resources

License

About

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages