HTML Extract

processing.html-extract Processing v0.1.0

Extracts values from an HTML string with CSS selectors — text, inner HTML, or an attribute per rule, first match or all matches. Pair with HTTP Request to scrape a page (this node does not fetch).

Ports

DirectionPortLabel
InputinputHTML
OutputoutputExtracted

Properties

PropertyTypeDefaultDescription
Source field
sourceField
string "" Dot-path to the HTML string on the input. Leave blank when the payload itself is the HTML.
Extractions
extractions
rows [{"field":"","selector":"","attr":"","multiple":false}] One output key per row: a CSS selector (tag, *, #id, .class, [attr], [attr=v], ^= $= *=, compounds, descendant, >, comma groups, :first-of-type, :nth-of-type(n)), what to take (text, html, or an attribute name), and first match or all matches.
Trim values
trim
boolean true Trim whitespace around extracted values.

This page is generated from the node registry by gen-node-docs.mjs on every site build — ports, properties, defaults and visibility rules cannot drift from the code.