Notre avis
Firecrawl Property Extraction extrait des données structurées de propriétés à partir d'URLs de sites immobiliers (Zillow, Redfin, etc.) en utilisant Firecrawl AI.
Points forts
- Rendu JavaScript et contournement des anti-bots
- Schéma Zod pour validation des données
- Intégration API simple avec Next.js
- Analyse concurrentielle intégrée
Limites
- Limité à 10 requêtes/minute sur le plan standard
- Dépend de la qualité des données des sites sources
- Ne fonctionne pas avec tous les sites de courtiers
Lorsque vous devez extraire des données immobilières de plusieurs listes pour générer des vidéos ou faire des analyses de marché.
Pour des besoins de scraping simples sans validation ou transformation, ou si vous utilisez une source avec une API officielle.
Analyse de sécurité
SûrThe skill describes using a third-party API (Firecrawl) for legitimate data extraction, with no destructive, exfiltrating, or obfuscated actions. Code snippets correctly use environment variables for API keys, and there is no instruction to execute arbitrary bash commands or bypass safety measures.
Aucun point d'attention détecté
Exemples
Extract property details from this Zillow listing: https://www.zillow.com/homedetails/123-Main-St-New-York-NY-10001/123456789_zpid/Scrape competitor listings from Redfin in San Francisco to compare prices, days on market, and price per sqft.Extract data from these three Realtor.com listings and output as JSON: [list URLs]name: firecrawl-scraper description: Extract property data from real estate listing URLs using Firecrawl AI. Use when scraping Zillow, Redfin, Realtor.com, or any property listing site. Returns structured data ready for video generation. allowed-tools: Read, Grep, Glob, Bash
Firecrawl Property Extraction
Overview
Firecrawl transforms any real estate listing URL into structured JSON data for video generation. It handles JavaScript rendering, anti-bot measures, and image extraction automatically.
Quick Start
import Firecrawl from '@mendable/firecrawl-js';
import { z } from 'zod';
const firecrawl = new Firecrawl({
apiKey: process.env.FIRECRAWL_API_KEY
});
const result = await firecrawl.extract({
urls: [listingUrl],
prompt: 'Extract property details for video generation',
schema: PropertySchema
});
Supported Sites
| Site | URL Pattern | Data Quality | |------|-------------|--------------| | Zillow | zillow.com/homedetails/* | Excellent | | Redfin | redfin.com//home/ | Excellent | | Realtor.com | realtor.com/realestateandhomes-detail/* | Excellent | | Trulia | trulia.com/home/* | Good | | Homes.com | homes.com/property/* | Good | | MLS Sites | Varies by region | Good | | Broker Sites | Any | Variable |
Property Schema
See rules/property-extraction.md for complete schema.
const PropertySchema = z.object({
address: z.string(),
city: z.string(),
state: z.string(),
zipCode: z.string(),
price: z.number(),
bedrooms: z.number(),
bathrooms: z.number(),
sqft: z.number(),
lotSize: z.string().optional(),
yearBuilt: z.number().optional(),
propertyType: z.string(),
description: z.string(),
features: z.array(z.string()),
images: z.array(z.string()),
agent: z.object({
name: z.string(),
phone: z.string().optional(),
brokerage: z.string().optional(),
}).optional(),
});
Advanced Extraction
Competitor Analysis
const CompetitorSchema = z.object({
listings: z.array(z.object({
address: z.string(),
price: z.number(),
daysOnMarket: z.number(),
pricePerSqft: z.number(),
})),
marketTrends: z.object({
medianPrice: z.number(),
averageDaysOnMarket: z.number(),
inventoryCount: z.number(),
}),
});
Market Data
Best Practices
- Rate Limiting: Max 10 requests/minute on standard plan
- Error Handling: Always wrap in try/catch
- Image Quality: Request high-res images when available
- Caching: Cache results for 24 hours to save credits
- Validation: Always validate extracted data with Zod
API Integration
Next.js Route Handler
// /app/api/scrape/route.ts
export async function POST(request: Request) {
const { url } = await request.json();
const firecrawl = new Firecrawl({
apiKey: process.env.FIRECRAWL_API_KEY!
});
const result = await firecrawl.extract({
urls: [url],
prompt: 'Extract property listing data',
schema: PropertySchema,
});
return Response.json({
success: true,
data: result.data
});
}
Credit Usage
| Operation | Credits | |-----------|---------| | /scrape (single page) | 1 | | /crawl (per page) | 1 | | /extract (AI) | Tokens-based | | /map (URL discovery) | 1 per 100 URLs |
Error Handling
try {
const result = await firecrawl.extract({ ... });
} catch (error) {
if (error.statusCode === 429) {
// Rate limited - wait and retry
} else if (error.statusCode === 403) {
// Site blocked - try alternative approach
} else {
// Log and return fallback
}
}
Ingénierie de Prompts
Data & IA
Bonnes pratiques et templates de prompt engineering pour maximiser les résultats IA.
Visualisation de Données
Data & IA
Génère des visualisations de données et graphiques adaptés à vos données.
Architecture RAG
Data & IA
Guide de configuration d'architectures RAG (Retrieval-Augmented Generation).