HTML Content Extractor - Burp Suite Extension

This is a Burp Suite Extension that applies CSS selectors to extract and analyze specific parts of HTML content directly from the HTTP message viewer. With the power of CSS selectors, users can target elements, attributes, and nested structures in the HTML document, enabling precise and efficient content analysis during security assessments.

For a quick demonstration of how this extension works, check out the video below:

Features

Real-time HTML content extraction using CSS selectors
Support for three extraction modes:
- Complete elements (outer HTML)
- Inner content only (inner HTML)
- Specific attribute values
Efficient analysis of potential security issues like XSS vectors and hidden fields
Integration with Burp's HTTP message viewer for seamless workflow
Element count and status feedback
Powered by jsoup for reliable HTML parsing

Usage

Load the extension in Burp Suite
Intercept HTTP traffic or browse through your proxy history
Select a response containing HTML content
Switch to the "HTML Content Extractor" tab
Enter your CSS selector with an optional prefix to control the output type:

Selector Prefixes

@outer: - Get complete elements with their tags (default if no prefix)

@outer:input[type=hidden]  -> Shows complete hidden input tags
@outer:form               -> Shows complete form elements

@inner: - Get only the contents inside elements

@inner:div.content       -> Shows only the content inside div
@inner:form             -> Shows only the form contents

@attr:name: - Get specific attribute values

@attr:href:a           -> Gets all link URLs
@attr:value:input      -> Gets all input values
@attr:class:div        -> Gets all div class names
@attr:src:img          -> Gets all image sources

Example Selectors

Find hidden inputs: @outer:input[type=hidden]
Extract form contents: @inner:form
Get all link URLs: @attr:href:a
Find input values: @attr:value:input[type=text]
Get class names: @attr:class:div.content

Build

$ gradle fatJar

The extension will be built as a JAR file in build/libs/html-content-extractor-all.jar

Requirements

Burp Suite Professional or Community Edition (2023.1 or later)
Java 8 or later

Credits

HTML Content Extractor relies on the following libraries:

jsoup - For HTML parsing and CSS selector support
Burp Extender API - For Burp Suite integration

License

This project is licensed under the MIT License - see the LICENSE file for details.

Inspired by

Burp-JQ

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
burp		burp
img		img
.gitignore		.gitignore
BappDescription.html		BappDescription.html
BappManifest.bmf		BappManifest.bmf
README.md		README.md
build.gradle		build.gradle
settings.gradle		settings.gradle

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HTML Content Extractor - Burp Suite Extension

Features

Usage

Selector Prefixes

Example Selectors

Build

Requirements

Credits

License

Inspired by

About

Uh oh!

Releases

Packages

Languages

ceylanb/html-content-extractor

Folders and files

Latest commit

History

Repository files navigation

HTML Content Extractor - Burp Suite Extension

Features

Usage

Selector Prefixes

Example Selectors

Build

Requirements

Credits

License

Inspired by

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages