Scrape URL logo

Catalog action Integrations

Scrape a webpage and convert it to clean, LLM-ready markdown or other formats. Handles JavaScript rendering, extracts main content, and returns structured data with metadata. Ideal for: content extraction, RAG data collection, web research, article summarization, data aggregation.

At a Glance

FieldValue
Action IDfirecrawl-scrape-url
CategoryIntegrations
ConnectorNot required
Requires gasNo
Funds movementNone declared
Tagsfirecrawl, scrape, web, content, markdown, extraction, read

Payload Schema

FieldTypeRequiredDescription
urlstringYesThe URL to scrape. Must be a valid HTTP/HTTPS URL.
formatsarrayNoOutput formats to return. Options: 'markdown' (clean text), 'html' (processed HTML), 'rawHtml' (original HTML), 'links' (extracted URLs), 'screenshot' (page image), 'json' (structured data), 'summary' (AI summary), 'images' (image URLs). Default: ['markdown']
onlyMainContentbooleanNoExtract only the main content, excluding headers, navigation, and footers. Default: true
waitFornumberNoTime in milliseconds to wait for JavaScript to render before scraping. Useful for dynamic sites. Range: 0-30000. Deprecated: use actions instead.
actionsarrayNoPage actions to perform before scraping. Use for JavaScript-heavy sites that need interaction before content is visible.
timeoutnumberNoRequest timeout in milliseconds. Default: 30000 (30 seconds). Max: 300000 (5 minutes).

Result Schema

FieldTypeRequiredDescription
successbooleanYesWhether the scrape request was successful
dataobjectYes-

Examples

json
{ "type": "firecrawl-scrape-url", "payload": { "url": "https://example.com/webhook" }, "children": []}

Payload fields can use workflow expressions such as {{$trigger.body.amount}}, {{$nodes.fetch.result.price}}, and {{$props.asset}} when the value should come from a trigger, prior node, or reusable workflow prop.

Ask a question... ⌘I