Self-Hosting Firecrawl with LLM-Powered Web Scraping

Firecrawl May 12, 2026

Last month I replaced my SearXNG instance with Firecrawl — a self-hosted web scraping platform that can do far more than just search. Here's how I set it up with OpenRouter as the LLM backend, what works, what doesn't, and why I made the switch.

Why Firecrawl?

I was running SearXNG at 192.168.100.32:8181 for web search. It worked, but it was just search — no content extraction, no structured data, no JavaScript rendering. Firecrawl adds:

JS rendering — crawls SPAs and dynamic sites
LLM-powered extraction — structured JSON from any page
Crawl, map, and batch operations — beyond single-page scraping
Same self-hosted model — no API costs for scraping itself

Architecture

Hermes Agent → Firecrawl API (192.168.100.32:33002) → OpenRouter (LLM backend)

Firecrawl handles the scraping. When I need AI extraction (summaries, structured data), it calls OpenRouter. The scraping is local and free; only the LLM calls cost money.

Setting Up the LLM Backend

Firecrawl v2.9.0 uses Vercel's AI SDK (@ai-sdk/openai), which hardcodes the Responses API (/responses). This is critical — providers that only support /chat/completions will fail.

Docker Compose Configuration

services:
  api:
    environment:
      - OPENAI_BASE_URL=https://openrouter.ai/api/v1
      - OPENAI_API_KEY=sk-or-v1-...

What Works and What Doesn't

OpenRouter — ✅ Working (supports /responses)
OpenAI direct — ✅ Working (native support)
GLM/Z.ai — ❌ Broken (only /chat/completions)

The GLM failure was frustrating — I tried both api.z.ai and open.bigmodel.cn endpoints, but Firecrawl constructs /responses URLs that these providers don't recognize.

Verified Capabilities

Core Scraping (No LLM Required)

POST /v1/scrape — Single page to markdown
POST /v1/crawl — Recursive site crawling
POST /v1/map — URL discovery
POST /v1/search — Web search
POST /v1/batch/scrape — Async multi-URL

LLM-Powered Extraction

Firecrawl can extract structured data using a JSON schema. I tested this on several sites and got back structured product data with TAM estimates — all without writing a custom parser.

What Doesn't Work

question format — "Query generation failed after all models"
highlights format — Same error
/v1/extract (batch) — Deprecated, never completes
/v1/agent — 500 error

I stick to json format with explicit schemas.

SPA Handling

One pleasant surprise: Firecrawl handles JavaScript-rendered sites well. For a Next.js site I tested, the map returned empty because there's no <a href> in the initial HTML. But crawl executed JavaScript and found routes like /signup, /login, /forgot-password.

Troubleshooting

"Failed to parse URL from /responses" — Cause: OPENAI_BASE_URL is missing. Fix: Set valid base URL. Even empty value causes Firecrawl to call https:///responses.

"token expired or incorrect" (401) — Cause: API key rejected. Fix: Verify OPENAI_API_KEY is set in Docker Compose environment section, not just .env file.

Key Takeaways

OpenRouter is the practical choice for Firecrawl's LLM backend — it supports the Responses API.
Use json format with schemas for structured extraction.
Crawl beats map for SPAs — JS execution finds routes that static analysis misses.
Self-hosted means no scraping costs — you only pay for LLM extraction when you use it.

Firecrawl isn't perfect — some endpoints are broken, the AI SDK dependency is restrictive — but for self-hosted web scraping with optional AI extraction, it's the best tool I've found.

Recommended for you

AI Agents

Why Loading All 97 Skills Into Every Prompt Doesn't Scale

2 months ago • 5 min read

AI Agents

모든 프롬프트에 97개 스킬을 전부 로드하는 건 토큰 낭비다

2 months ago • 10 min read

Arctic

북극 항로에 디지털 인프라를 구축한다 — Arctic Ventures

2 months ago • 10 min read

PSR Ice Mining Economics: Analyzing the Lunar South Pole's Hidden Asset

달 남극 영구그림자구역(PSR)의 얼음 채굴 경제성 분석 (ko)

Moon Mining Commercialization Roadmap 2026-2035: Who, When, and How?

2026-2035 달 채굴 상용화 로드맵: 누가, 언제, 어떻게? (ko)

Self-Hosting Firecrawl with LLM-Powered Web Scraping

Why Firecrawl?

Architecture