Our review
Firecrawl Property Extraction extracts structured real estate data from listing URLs (Zillow, Redfin, etc.) using Firecrawl AI.
Strengths
- Handles JavaScript rendering and anti-bot measures
- Zod schema for data validation
- Simple API integration with Next.js
- Built-in competitor analysis
Limitations
- Rate limited to 10 requests per minute on standard plan
- Depends on source site data quality
- Not all broker sites are supported
When you need to extract property data from multiple listings for video generation or market analysis.
For simple scraping tasks without validation or transformation, or when an official API is available.
Security analysis
SafeThe skill describes using a third-party API (Firecrawl) for legitimate data extraction, with no destructive, exfiltrating, or obfuscated actions. Code snippets correctly use environment variables for API keys, and there is no instruction to execute arbitrary bash commands or bypass safety measures.
No concerns found
Examples
Extract property details from this Zillow listing: https://www.zillow.com/homedetails/123-Main-St-New-York-NY-10001/123456789_zpid/Scrape competitor listings from Redfin in San Francisco to compare prices, days on market, and price per sqft.Extract data from these three Realtor.com listings and output as JSON: [list URLs]name: firecrawl-scraper description: Extract property data from real estate listing URLs using Firecrawl AI. Use when scraping Zillow, Redfin, Realtor.com, or any property listing site. Returns structured data ready for video generation. allowed-tools: Read, Grep, Glob, Bash
Firecrawl Property Extraction
Overview
Firecrawl transforms any real estate listing URL into structured JSON data for video generation. It handles JavaScript rendering, anti-bot measures, and image extraction automatically.
Quick Start
import Firecrawl from '@mendable/firecrawl-js';
import { z } from 'zod';
const firecrawl = new Firecrawl({
apiKey: process.env.FIRECRAWL_API_KEY
});
const result = await firecrawl.extract({
urls: [listingUrl],
prompt: 'Extract property details for video generation',
schema: PropertySchema
});
Supported Sites
| Site | URL Pattern | Data Quality | |------|-------------|--------------| | Zillow | zillow.com/homedetails/* | Excellent | | Redfin | redfin.com//home/ | Excellent | | Realtor.com | realtor.com/realestateandhomes-detail/* | Excellent | | Trulia | trulia.com/home/* | Good | | Homes.com | homes.com/property/* | Good | | MLS Sites | Varies by region | Good | | Broker Sites | Any | Variable |
Property Schema
See rules/property-extraction.md for complete schema.
const PropertySchema = z.object({
address: z.string(),
city: z.string(),
state: z.string(),
zipCode: z.string(),
price: z.number(),
bedrooms: z.number(),
bathrooms: z.number(),
sqft: z.number(),
lotSize: z.string().optional(),
yearBuilt: z.number().optional(),
propertyType: z.string(),
description: z.string(),
features: z.array(z.string()),
images: z.array(z.string()),
agent: z.object({
name: z.string(),
phone: z.string().optional(),
brokerage: z.string().optional(),
}).optional(),
});
Advanced Extraction
Competitor Analysis
const CompetitorSchema = z.object({
listings: z.array(z.object({
address: z.string(),
price: z.number(),
daysOnMarket: z.number(),
pricePerSqft: z.number(),
})),
marketTrends: z.object({
medianPrice: z.number(),
averageDaysOnMarket: z.number(),
inventoryCount: z.number(),
}),
});
Market Data
Best Practices
- Rate Limiting: Max 10 requests/minute on standard plan
- Error Handling: Always wrap in try/catch
- Image Quality: Request high-res images when available
- Caching: Cache results for 24 hours to save credits
- Validation: Always validate extracted data with Zod
API Integration
Next.js Route Handler
// /app/api/scrape/route.ts
export async function POST(request: Request) {
const { url } = await request.json();
const firecrawl = new Firecrawl({
apiKey: process.env.FIRECRAWL_API_KEY!
});
const result = await firecrawl.extract({
urls: [url],
prompt: 'Extract property listing data',
schema: PropertySchema,
});
return Response.json({
success: true,
data: result.data
});
}
Credit Usage
| Operation | Credits | |-----------|---------| | /scrape (single page) | 1 | | /crawl (per page) | 1 | | /extract (AI) | Tokens-based | | /map (URL discovery) | 1 per 100 URLs |
Error Handling
try {
const result = await firecrawl.extract({ ... });
} catch (error) {
if (error.statusCode === 429) {
// Rate limited - wait and retry
} else if (error.statusCode === 403) {
// Site blocked - try alternative approach
} else {
// Log and return fallback
}
}
Prompt Engineering
Data & AI
Prompt engineering best practices and templates to maximize AI outputs.
Data Visualization
Data & AI
Generates data visualizations and charts tailored to your data.
RAG Architecture Setup
Data & AI
Setup guide for RAG (Retrieval-Augmented Generation) architectures.