Ferret — Parsers Overview
313 parsers, build OK, 30+ categories, 10GB RAM libre Updated: June 5, 2026
Summary
| Metric | Value |
|---|---|
| Total parsers | 313 |
| 🇫🇷 France | 44 (data.gouv.fr, national APIs) |
| 🇬🇧 UK / 🇯🇵 Japan / 🇨🇦 Canada | 3 CKAN portals |
| 🌍 International & APIs | ~266 |
| ✅ Benchmark OK | 204/248 (82%) |
| Build | ✅ 0 errors |
| Free RAM | 10 GB |
| Obscura headless | 46MB (4 instances) |
| 597 User-Agents | ✅ rotation |
| 1446 CMS signatures | ✅ imported |
| A-Parser backup | ✅ v1.2.3263 |
Parser Categories
| Category | Count | Examples |
|---|---|---|
| 🌐 Web Search | ~20 | Bing, DDG, Brave, Google, Yahoo, Yandex, Baidu, AOL… |
| 🛒 E-commerce | ~12 | Amazon, Steam, Booking, eBay, Aliexpress, Wildberries… |
| 💰 Finance & Crypto | ~15 | Yahoo Finance, Binance, Kraken, CoinPaprika, Fear&Greed… |
| 🇫🇷 France | ~12 | data.gouv.fr, Communes, Adresse, Carburants, Députés… |
| 📱 Social | ~15 | Reddit, Instagram, TikTok, Pinterest, Lemmy… |
| 📊 SEO | ~15 | Whois, DNS, CMS (1446 sig.), Cache, Position, Audit… |
| 💻 Dev | ~12 | GitHub, PyPI, Crates.io, OpenRouter, MDN, OpenAPI… |
| 🎵 Music & Culture | ~8 | Deezer, MusicBrainz, TVMaze, Anime… |
| 🌤️ Weather | ~5 | Open-Meteo, NOAA, Air Quality, wttr.in… |
| 🚀 DevOps | ~5 | Docker Hub, NPM, GitLab, GitHub Status… |
| 🎮 Games | ~5 | Steam, BoardGameGeek, Pokémon… |
| 🎨 Art | ~6 | Met Museum, Art Institute, Cleveland… |
| 🔬 Science | ~8 | PubChem, USGS, ISS, SpaceX, CVE… |
| 🌍 Travel | ~4 | OpenTripMap, SBB, CityBikes… |
| 🚗 Vehicles | ~3 | NHTSA Recalls, OpenSky… |
| 📰 News | ~6 | Hacker News, Spaceflight, Google News, Bing News… |
| 🎯 Utilities | ~15 | GeoIP, Currency, QR Code, Whois, DNS… |
France — 44 parsers
| Category | Parsers |
|---|---|
| Government | 6 (datasets, orgs, stats, topics, search, reuses) |
| Geo | 6 (communes, départements, régions, adresse, reverse, CP) |
| Economy | 4 (DVF 50 items, immobilier, loyers, SIRENE) |
| Business | 2 (API Entreprises, RNA associations) |
| Health | 3 (médicaments ANSM, médecins RPPS, eau qualité) |
| Transport | 3 (bornes électriques, covoiturage, gares SNCF) |
| Education | 2 (établissements, calendrier scolaire) |
| Culture | 2 (musées, monuments) |
| Services | 4 (services publics, bibliothèques, toilettes, parkings) |
| Environment | 4 (air, risques, arbres, eau potable) |
| Misc | 8 (carburants, décès, députés, élections, sport, marchés, ZRR, search) |
International — New parsers
| Type | Endpoint | Status |
|---|---|---|
| UK data.gov.uk | /data/uk?q=population | ✅ 10 datasets |
| Japan data.go.jp | /data/japan?q=population | ✅ 10 datasets |
| Canada open.canada.ca | /data/canada?q=economy | ✅ 10 datasets |
| CKAN generic | /data/ckan?host=...&q=... | ✅ Any CKAN portal |
| Spain datos.gob.es | /data/spain?q=economia | ⚠️ Non-standard API |
| Ireland data.gov.ie | /data/ireland?q=population | ⚠️ Different path |
| Eurostat GDP | /economy/gdp/eurostat?q=FR | ✅ 12 items |
| Eurostat Population | /economy/population/eurostat?q=FR | ✅ 12 items |
| BBC News | /news/bbc?q=technology | ✅ 38 articles |
| PeerTube | /videos/peertube?q=rust | ✅ 20 videos |
| Internet Archive | /search/archive-org?q=rust | ✅ 10 results |
| OpenDataSoft | /data/opendatasoft?q=population | ✅ 11 datasets |
| WHO Health | /health/who | ✅ 30 indicators |
| SWAPI (Star Wars) | /starwars?q=luke | ✅ 1 character |
| Open Trivia | /quiz?q=general | ✅ 10 questions |
| AudioDB | /music/artist/audiodb?q=beatles | ✅ 1 artist |
| IPify | /ipify | ✅ Public IP |
| IP-API.com | /geo/ip-api?q=8.8.8.8 | ✅ Geolocation |
| Zippopotamus | /geo/zip/us?q=10001 | ✅ US ZIP code |
| Nationalize | /names/nationalize?q=john | ✅ 6 countries |
| Genderize | /names/genderize?q=john | ✅ Prediction |
| Meow Facts | /cats/facts/meow | ✅ Cat facts |
Infrastructure
- Export CSV (
?format=csv) - Batch mode (
POST /batch) - Standard pagination (
pagecount) - LRU cache + optional Redis
- Auto-tuning timeout (EMA)
- Obscura headless pool (46MB)
- Gateway MCP API key
- Auto-detected benchmark (bench.py)