This is an n8n community node. It lets you use Mozilla's Readability in your n8n workflows.
Mozilla's Readability is a standalone version of the algorithm used by Firefox Reader View to extract the main content from web pages, removing clutter and providing clean, readable text.
n8n is a fair-code licensed workflow automation platform.
Installation
Operations
Compatibility
Usage
Resources
Follow the installation guide in the n8n community nodes documentation.
Extracts the main content from HTML, removing navigation, ads, and other distracting elements.
Options:
| Parameter | Type | Description |
|---|---|---|
| JSON Property | String | The property containing the HTML content to parse. Supports both dot notation (e.g., 'solution.response') and expressions |
| Continue on Error | Boolean | Whether to continue execution when the node encounters an error |
| Return Full Response | Boolean | Whether to return the full Readability response including metadata |
Output:
Default output includes:
{
"content": "<div>...extracted HTML content...</div>",
"title": "Article Title",
"excerpt": "Brief excerpt of the content"
}With "Return Full Response" enabled, additional fields are included:
{
"content": "<div>...extracted HTML content...</div>",
"title": "Article Title",
"excerpt": "Brief excerpt of the content",
"length": 12345,
"byline": "Author Name",
"dir": "ltr",
"siteName": "Website Name",
"textContent": "Plain text version of the content"
}- Requires n8n version 1.0.0 or later
- Uses Mozilla's Readability v0.6.0
- Node.js v18.10 or later
- Add the Readability node to your workflow
- Connect it to a node that provides HTML content (e.g., HTTP Request)
- Specify the JSON property containing the HTML (e.g., 'data' or 'response.body')
- Optionally enable "Return Full Response" for additional metadata
- Run the workflow to extract clean, readable content
This example shows how to extract readable content from a webpage:
- HTTP Request node: Fetch a webpage
- Readability node:
- Set "JSON Property" to "data"
- Enable "Return Full Response" if you need metadata
- The node will output clean HTML content and metadata
