FireCrawl

AI/FireCrawl

Fork 0

RSS Feed

v1.14.0 61059d689d

Compare
1.14.0 Stable

lrss3 released this 2025-07-04 21:36:57 +08:00 | 11 commits to main since this release
v1.14.0 Release

We're excited to announce the release of Firecrawl v1.14.0, packed with cool updates.
- Authenticated scraping (Join the waitlist here)
- Zero data retention for enterprise (Email us at help@firecrawl.com to enable it)
- Improved p75 speeds
- New MCP version w/ maxAge + better tool calling
- Open Researcher Example (Open Source Researcher). See repo
- And so much more 🔥
What's Changed
- fix json format on search (ENG-2529) by @mogery in #1729
- proxy used improvement by @mogery in #1727
- add pdf prefetch log for debugging (ENG-2542) by @mogery in #1734
- (feat/ledger) Ledger events by @nickscamara in #1728
- fix(auto_charge): bad hourly counter logic by @mogery in #1736
- feat: implement IDN support with Punycode encoding by @devin-ai-integration in #1735
- feat(scrapeURL): ask user to increase timeout if there's a DOM.getDocument or queryAXTree error by @mogery in #1739
- feat(queue-worker): decrease job lock duration to pick up jobs on dead workers faster by @mogery in #1737
- feat(crawl-status): better creditsUsed field by @mogery in #1738
- (fix/sdk) Fixes Extract with show_sources=True causes JSON parsing error by @nickscamara in #1740
- fix(map): pass timeout to sitemap fetch by @mogery in #1741
- feat(api): zero data retention (ENG-2376) by @mogery in #1687
- Fix protocol change in firecrawl services by @nickscamara in #1742
- Fix queue worker liveness endpoint for self-hosted environments (ENG-2587) by @devin-ai-integration in #1747
- feat(firecrawl): add integration parameter support and enhance kwargs handling by @ftonato in #1715
- bugfix zero_data_retention param and certifi dependency by @rafaelsideguide in #1749
- Add health check endpoint for Playwright service (ENG-2585) by @devin-ai-integration in #1752
- Add URL depth validation to crawl requests (ENG-2617) by @devin-ai-integration in #1753
- fix(api/deep-research): bad default values leading to bug on self-host by @mogery in #1754
- Make error message for backwards crawling more clear by @micahstairs in #1758
- Implement JavaScript-only base href handling (ENG-2302) by @devin-ai-integration in #1756
- fix(html-transformer): check base tag when resolving relative URLs by @mogery in #1761
- html-transformer improvements by @mogery in #1762
- Implement fallback mechanism for onlyMainContent scraping (ENG-2499) by @devin-ai-integration in #1759
- Add end-to-end test for deep research functionality (ENG-2627) by @devin-ai-integration in #1760
- feat: add waitFor validation rule to enforce waitFor <= timeout/2 by @devin-ai-integration in #1751
- fix: replace hardcoded search billing with calculateCreditsToBeBilled by @devin-ai-integration in #1714
- feat(scrapeURL/fire-engine): start AB testing by @mogery in #1763
Full Changelog: v1.13.0...v1.14.0
Downloads
- Source Code (ZIP)
- Source Code (TAR.GZ)
v1.13.0 17ff8be67b

Compare
1.13.0 Stable

lrss3 released this 2025-06-27 23:02:15 +08:00 | 43 commits to main since this release
Firecrawl v1.13.0 Changelog
- Added AU, FR, DE to Stealth Mode
- Crawl subdomains with allowSubdomains
- Google slides scraping
- Generate a PDF of the current page. See docs
- Higher res screenshots with quality param
- Weekly view for usage on the dashboard
- Fireplexity Example (Open Source Perplexity). See repo
- And more!
Features
- Support cookies in scrapeURL via safeFetch
- Add Google Slides scraping
- Add PDF actions to API, Python & JS SDKs
- Screenshot quality config
- parsePDF support in Python & JS SDKs
- Allow international URLs
- Enable subdomain crawling in SDKs
- Worker liveness check before job queueing
Improvements & Fixes
- Removed old cache systems (Redis, PDF cache, etc.)
- Enhanced error messages (TLS, DNS)
- Improved job count query & error handling
- Fixed pdf, credits scope, ignoreInvalidURLs bugs
- Reduced over-logging in concurrency limits
- Added special job expiration bypass for Faire team
- Unified fallback logic in JS SDK
What's Changed
- Remove old cache mechanisms (redis cache, PDF cache, crawl maps, etc.) (FIR-2266) by @mogery in #1667
- feat(scrapeURL): support cookies in safeFetch by @mogery in #1688
- fix(search): respect parsePDF in pricing by @mogery in #1690
- feat(scrape): support Google Slides by @meetsoni1214 in #1693
- fix(api): CI by @mogery in #1692
- fix(api): improve error logging with structured error object by @mogery in #1697
- fix(api): enhance error handler with optional ACUC data by @mogery in #1698
- fix(api): handle errors better in redis-less crawl status by @mogery in #1699
- feat(api): optimize job count query and improve error handling by @mogery in #1700
- fix(api/v1/crawl/ongoing): only crawls, no batch scrape by @mogery in #1701
- feat(api): pdf action + housekeeping by @mogery in #1702
- fix(v1): check credits variable scope collision by @mogery in #1703
- fix(api): pdf bug + testing bugs by @mogery in #1704
- fix(api/batch/scrape): maxConcurrency field support when using ignoreInvalidURLs by @mogery in #1705
- fix(api): instantiate Storage only once by @mogery in #1706
- feat(api/ci): idmux by @mogery in #1707
- fix(concurrency-limit): scan instead of taking jobs by @mogery in #1708
- fix(concurrency-limit): overlogging by @mogery in #1709
- fix(api): cached acuc didn't have the is_extract flag set (ENG-2468) by @mogery in #1712
- fix(js-sdk/extract): use same zod fallback logic by @mogery in #1711
- Add temporary exception for Faire team ID to bypass job expiration by @devin-ai-integration in #1716
- feat(python-sdk): add parsePDF parameter support by @devin-ai-integration in #1713
- Allow international URLs to pass validation by @micahstairs in #1717
- feat: Screenshot quality by @nickscamara in #1721
- feat(sdk/python): add pdf action (ENG-2515) by @mogery in #1722
- feat(scrapeURL/skipTlsVerification): improve error message by @mogery in #1723
- feat: improve DNS resolution error message by @mogery in #1724
- Fix unreachable allowSubdomains code in crawler filterURL method by @devin-ai-integration in #1719
- Add parsePDF parameter to JS SDK by @devin-ai-integration in #1720
- feat(api/worker): liveness check in queueing -- don't take jobs when the worker is dying (ENG-2519) by @mogery in #1725
- (feat/sdk) Allow Subdomains in the sdks by @nickscamara in #1726
New Contributors
- @meetsoni1214 made their first contribution in #1693
- @micahstairs made their first contribution in #1717
Full Changelog: v1.12.0...v1.13.0
Downloads
- Source Code (ZIP)
- Source Code (TAR.GZ)
v1.12.0 80f7177473

Compare
Nick: bump version Stable

lrss3 released this 2025-06-20 23:05:15 +08:00 | 79 commits to main since this release
Firecrawl v1.12.0 is here!
- New Concurrency System - Specify max concurrency by request for better control. See docs.
- Crawl Entire Domain Param - Follow internal links to sibling or parent URLs, not just child paths (prev. allowBackwardLinks). See docs.
- Google Docs Scraping - We now officially support scraping Google Docs files
- Improved Activity Logs - Better support for FIRE-1 requests. See your logs here.
- /search Playground Enhanced - Location Params added. Check out the playground.
- Firestarter Example - Open Source Chatbot building platform. Repo here.
- Plus tons of performance improvements and bug fixes.
P.S. Have feedback or ideas for v1.13.0? Hit reply and let us know. We're always listening to our community to build the features you need most.

What's Changed
- feat(vertex): fix vertex ai provider bug and update model references… by @tomkosm in #1668
- Improve URL filtering error messages with specific denial reasons (FIR-2352) by @devin-ai-integration in #1676
- feat(index): store short titles and descriptions (FIR-2356) by @mogery in #1677
- feat: Test mu v3 by @tomkosm in #1678
- Add deployment type field to bug report template by @devin-ai-integration in #1681
- feat(crawl-status): refactor to work after a redis flush (FIR-2271) by @mogery in #1664
- fix(scrape): log FIRE-1 credits billed on failures properly (FIR-2331) by @mogery in #1682
- feat(scrape, extract): creditsUsed, tokensUsed fields (FIR-2336) by @mogery in #1683
- Concurrency limit refactor + maxConcurrency parameter (FIR-2191) by @mogery in #1643
- feat(scrape): support Google Docs (FIR-1365) by @mogery in #1686
- feat: add followInternalLinks parameter as semantic replacement for allowBackwardLinks by @devin-ai-integration in #1684
Full Changelog: v1.11.0...v1.12.0
Downloads
- Source Code (ZIP)
- Source Code (TAR.GZ)

3 Releases 21 Tags

1.14.0 Stable

v1.14.0 Release

What's Changed

1.13.0 Stable

Firecrawl v1.13.0 Changelog

Features

Improvements & Fixes

What's Changed

New Contributors

Nick: bump version Stable

Firecrawl v1.12.0 is here!

What's Changed