Skip to content

Firecrawl Connector

The Firecrawl connector integrates Agentcy with the Firecrawl web scraping and extraction API. It provides 2 live tools for scraping web pages and extracting structured data using LLM-powered extraction.

Authentication

The connector authenticates using a Firecrawl API key from firecrawl.dev.

json
{
  "name": "Firecrawl",
  "source_type": "firecrawl",
  "config": {
    "api_key": "fc-..."
  }
}
VariableRequiredDescription
api_keyYesFirecrawl API key

Live Tools (2)

ToolDescriptionArguments
firecrawl_scrape_urlScrape a web page and return clean markdown contenturl, formats?, wait_for?, timeout?
firecrawl_extractExtract structured data from a URL using LLM-powered extractionurl, schema, prompt?

Example: Agent Using Firecrawl Tools

User: "Scrape the pricing page of competitor.com"

Agent: I'll scrape the page content.
→ Calls firecrawl_scrape_url(url: "https://competitor.com/pricing")
→ Returns clean markdown content of the pricing page
→ Summarizes: "Competitor offers 3 tiers:
  - Starter: $29/mo (5 users, 10GB)
  - Pro: $99/mo (25 users, 100GB)
  - Enterprise: Custom pricing"
User: "Extract all product names and prices from this catalog page"

Agent: I'll extract structured data.
→ Calls firecrawl_extract(
    url: "https://store.example.com/catalog",
    schema: {"products": [{"name": "string", "price": "number", "in_stock": "boolean"}]},
    prompt: "Extract all product listings with name, price, and availability"
  )
→ Returns structured JSON with extracted products

ETL Ingestion

This connector does not support ETL ingestion. All tools operate as live API calls to the Firecrawl service.

Configuration Reference

json
{
  "name": "Firecrawl",
  "source_type": "firecrawl",
  "config": {
    "api_key": "fc-..."
  }
}
FieldTypeDefaultDescription
api_keystringrequiredFirecrawl API key

Troubleshooting

ErrorCauseFix
401 UnauthorizedInvalid API keyVerify your Firecrawl API key
402 Payment RequiredCredits exhaustedTop up credits on firecrawl.dev
Scrape failedTarget page blocks scraping or uses heavy JavaScriptTry adding wait_for parameter to wait for dynamic content
TimeoutPage took too long to loadIncrease the timeout parameter
Extraction failedLLM could not match the schema to page contentSimplify the schema or add a more specific prompt
URL unreachableTarget URL is invalid or returns an errorVerify the URL is accessible

Built by AgentcyLabs. For in-house deployment or Agentcy Cloud (PaaS) access, visit agentcylabs.com.