Firecrawl v1.13.0 Changelog
- Added AU, FR, DE to Stealth Mode
- Crawl subdomains with allowSubdomains
- Google slides scraping
- Generate a PDF of the current page. See docs
- Higher res screenshots with quality param
- Weekly view for usage on the dashboard
- Fireplexity Example (Open Source Perplexity). See repo
- And more!
Features
- Support cookies in
scrapeURL
viasafeFetch
- Add Google Slides scraping
- Add PDF actions to API, Python & JS SDKs
- Screenshot quality config
parsePDF
support in Python & JS SDKs- Allow international URLs
- Enable subdomain crawling in SDKs
- Worker liveness check before job queueing
Improvements & Fixes
- Removed old cache systems (Redis, PDF cache, etc.)
- Enhanced error messages (TLS, DNS)
- Improved job count query & error handling
- Fixed
pdf
,credits
scope,ignoreInvalidURLs
bugs - Reduced over-logging in concurrency limits
- Added special job expiration bypass for Faire team
- Unified fallback logic in JS SDK
What's Changed
- Remove old cache mechanisms (redis cache, PDF cache, crawl maps, etc.) (FIR-2266) by @mogery in #1667
- feat(scrapeURL): support cookies in safeFetch by @mogery in #1688
- fix(search): respect parsePDF in pricing by @mogery in #1690
- feat(scrape): support Google Slides by @meetsoni1214 in #1693
- fix(api): CI by @mogery in #1692
- fix(api): improve error logging with structured error object by @mogery in #1697
- fix(api): enhance error handler with optional ACUC data by @mogery in #1698
- fix(api): handle errors better in redis-less crawl status by @mogery in #1699
- feat(api): optimize job count query and improve error handling by @mogery in #1700
- fix(api/v1/crawl/ongoing): only crawls, no batch scrape by @mogery in #1701
- feat(api): pdf action + housekeeping by @mogery in #1702
- fix(v1): check credits variable scope collision by @mogery in #1703
- fix(api): pdf bug + testing bugs by @mogery in #1704
- fix(api/batch/scrape): maxConcurrency field support when using ignoreInvalidURLs by @mogery in #1705
- fix(api): instantiate Storage only once by @mogery in #1706
- feat(api/ci): idmux by @mogery in #1707
- fix(concurrency-limit): scan instead of taking jobs by @mogery in #1708
- fix(concurrency-limit): overlogging by @mogery in #1709
- fix(api): cached acuc didn't have the is_extract flag set (ENG-2468) by @mogery in #1712
- fix(js-sdk/extract): use same zod fallback logic by @mogery in #1711
- Add temporary exception for Faire team ID to bypass job expiration by @devin-ai-integration in #1716
- feat(python-sdk): add parsePDF parameter support by @devin-ai-integration in #1713
- Allow international URLs to pass validation by @micahstairs in #1717
- feat: Screenshot quality by @nickscamara in #1721
- feat(sdk/python): add pdf action (ENG-2515) by @mogery in #1722
- feat(scrapeURL/skipTlsVerification): improve error message by @mogery in #1723
- feat: improve DNS resolution error message by @mogery in #1724
- Fix unreachable allowSubdomains code in crawler filterURL method by @devin-ai-integration in #1719
- Add parsePDF parameter to JS SDK by @devin-ai-integration in #1720
- feat(api/worker): liveness check in queueing -- don't take jobs when the worker is dying (ENG-2519) by @mogery in #1725
- (feat/sdk) Allow Subdomains in the sdks by @nickscamara in #1726
New Contributors
- @meetsoni1214 made their first contribution in #1693
- @micahstairs made their first contribution in #1717
Full Changelog: v1.12.0...v1.13.0