firecrawl
Firecrawl is a powerful web scraping and content extraction API that integrates seamlessly into Agent Forge, enabling developers to extract clean, structured content from any website. This integration provides a simple way to transform web pages into usable data formats like Markdown and HTML while preserving the essential content.
With Firecrawl in Agent Forge, you can:
Extract clean content: Remove ads, navigation elements, and other distractions to get just the main content
Convert to structured formats: Transform web pages into Markdown, HTML, or JSON
Capture metadata: Extract SEO metadata, Open Graph tags, and other page information
Handle JavaScript-heavy sites: Process content from modern web applications that rely on JavaScript
Filter content: Focus on specific parts of a page using CSS selectors
Process at scale: Handle high-volume scraping needs with a reliable API
Search the web: Perform intelligent web searches and retrieve structured results
Crawl entire sites: Crawl multiple pages from a website and aggregate their content
In Agent Forge, the Firecrawl integration enables your agents to access and process web content programmatically as part of their workflows. Supported operations include:
Scrape: Extract structured content (Markdown, HTML, metadata) from a single web page.
Search: Search the web for information using Firecrawl's intelligent search capabilities.
Crawl: Crawl multiple pages from a website, returning structured content and metadata for each page.
This allows your agents to gather information from websites, extract structured data, and use that information to make decisions or generate insights—all without having to navigate the complexities of raw HTML parsing or browser automation. Simply configure the Firecrawl block with your API key, select the operation (Scrape, Search, or Crawl), and provide the relevant parameters. Your agents can immediately begin working with web content in a clean, structured format.
Usage Instructions
Extract content from any website with advanced web scraping or search the web for information. Retrieve clean, structured data from web pages with options to focus on main content, or intelligently search for information across the web.
Tools
Extract structured content from web pages with comprehensive metadata support. Converts content to markdown or HTML while capturing SEO metadata, Open Graph tags, and page information.
Input
url
string
Yes
The URL to scrape content from
scrapeOptions
json
No
Options for content scraping
apiKey
string
Yes
Firecrawl API key
Output
markdown
string
Page content markdown
html
any
Raw HTML content
metadata
json
Page metadata
data
json
Search results data
warning
any
Warning messages
pages
json
Crawled pages data
total
number
Total pages found
creditsUsed
number
Credits consumed
Search for information on the web using Firecrawl
Input
query
string
Yes
The search query to use
apiKey
string
Yes
Firecrawl API key
Output
markdown
string
Page content markdown
html
any
Raw HTML content
metadata
json
Page metadata
data
json
Search results data
warning
any
Warning messages
pages
json
Crawled pages data
total
number
Total pages found
creditsUsed
number
Credits consumed
Crawl entire websites and extract structured content from all accessible pages
Input
url
string
Yes
The website URL to crawl
limit
number
No
Maximum number of pages to crawl (default: 100)
onlyMainContent
boolean
No
Extract only main content from pages
apiKey
string
Yes
Firecrawl API Key
Output
markdown
string
Page content markdown
html
any
Raw HTML content
metadata
json
Page metadata
data
json
Search results data
warning
any
Warning messages
pages
json
Crawled pages data
total
number
Total pages found
creditsUsed
number
Credits consumed
Notes
Category:
toolsType:
firecrawl
Was this helpful?
