Bot Protection

Learn how to manage bot traffic to your site.

Bots generate nearly half of all internet traffic, with many originating from malicious sources. These automated threats scrape content, attempt unauthorized logins, or overload servers. Bot protection mitigates these risks by distinguishing between verified bots and potentially harmful automated traffic.

Bot protection systems analyze incoming traffic to identify whether a request originates from a real user, a trusted bot like a search engine crawler, or an unknown automated source.

  • Allowing legitimate bots that correctly identify themselves
  • Challenging suspicious traffic that behaves abnormally or does not resemble real browser activity
  • Enforcing browser-like behavior by verifying navigation patterns and cache usage

To filter out harmful bot traffic, various techniques are used, including:

  • Signature-based detection: Inspecting HTTP requests for known bot signatures
  • Rate limiting: Restricting how often certain actions can be performed to prevent abuse
  • Challenges: Using JavaScript checks to verify human presence
  • Behavioral analysis: Detecting unusual patterns in user activity that suggest automation

With Vercel, you can use:

  • Managed rulesets to challenge specific bot traffic
  • Rate limiting and challenge actions with WAF custom rules to prevent bot activity from reaching your application

Bot filter managed ruleset is available in Beta on all plans

With Vercel, you can use the bot filter managed ruleset to challenge non-browser traffic from accessing your applications. It filters out automated threats while allowing legitimate traffic.

  • It identifies clients that violate browser-like behavior and serves a javascript challenge to them.
  • It prevents requests that falsely claim to be from a browser such as a curl request identifying as Chrome.
  • It automatically excludes verified bots, such as Google's crawler, from evaluation.

To learn more about how the ruleset works, review the Challenge section of Firewall actions. To understand the details of what get logged and how to monitor your traffic, review Firewall Observability.

For trusted automated traffic, you can create custom WAF rules with bypass actions that will allow this traffic to skip the bot filter ruleset.

You can apply the ruleset to your project in log or challenge mode. Learn how to Configure bot filter managed ruleset.

Vercel maintains and continuously updates a comprehensive directory of known legitimate bots from across the internet. This directory is regularly updated to include new legitimate services as they emerge. Attack Challenge Mode and bot filter automatically recognize and allow these bots to pass through without being challenged. You can block access to some or all of these bots by writing WAF custom rules with the User Agent match condition. To learn how to do this, review WAF Examples.

Bot nameDescriptionDocumentation
AdsBot-GoogleAdsBot-Google is Google's web crawler used for quality control of Google Ads.View
AdsenseThe AdSense crawler visits participating sites in order to provide them with relevant ads.View
AhrefsBotPowers the database for both Ahrefs, a marketing intelligence platform, and Yep, an independent, privacy-focused search engine.View
AhrefsSiteAuditPowers Ahrefs’ Site Audit tool. Ahrefs users can use Site Audit to analyze websites and find both technical SEO and on-page SEO issues.View
AlgoliaThe Algolia Crawler extracts content from your site and makes it searchable.View
Amazon KendraAmazon Kendra is a managed information retrieval and intelligent search service that uses natural language processing and advanced deep learning model.View
Amazon QAmazon Q Business is a generative artificial intelligence (generative AI)-powered assistant that you can tailor to your business needs.View
AmazonbotAmazonbot is Amazon's web crawler used to improve our services, such as enabling Alexa to more accurately answer questions for customers.View
APIs-GoogleCrawling preferences addressed to the APIs-Google user agent affect the delivery of push notification messages by Google APIs.View
ApplebotApplebot is used to power search in Spotlight, Siri, and Safari.View
Artemis Web CrawlerArtemis is a calm web reader with which you can follow websites and blogs.View
Better StackBetter Stack is a platform for monitoring and alerting on your applications.View
BingbotBingbot is Microsoft's web crawler used for indexing websites for Bing Search.View
ChatGPT-UserHandles user-initiated requests in ChatGPT, accessing external content to provide real-time information; not used for automated crawling or AI training.View
ChecklyCheckly is a platform for monitoring and alerting on your applications.View
Chrome LighthousePageSpeed Insights (PSI) reports on the user experience of a page on both mobile and desktop devices, and provides suggestions on how that page may be improved.View
Chrome Privacy Preserving Prefetch ProxyChrome's Privacy Preserving Prefetch Proxy service that fetches /.well-known/traffic-advice to enable privacy-preserving prefetch hints.View
Claude-SearchBotClaude-SearchBot navigates the web to improve search result quality for users. It analyzes online content specifically to enhance the relevance and accuracy of search responses.View
Claude-UserClaude-User supports Claude AI users. When individuals ask questions to Claude, it may access websites using a Claude-User agent.View
ClaudeBotClaudeBot helps enhance the utility and safety of our generative AI models by collecting web content that could potentially contribute to their training.View
CookiebotCookiebot automates compliance with cookie laws and helps you manage your cookie consent preferences.View
Datadog Synthetic Monitoring RobotDatadog's automated monitoring service that performs synthetic tests to verify website availability and performance.View
DuckAssistBotDuckAssistBot is a web crawler for DuckDuckGo Search that crawls pages in real-time for AI-assisted answers, which prominently cite their sources. This data is not used in any way to train AI models.View
DuckDuckBotDuckDuckBot is a web crawler for DuckDuckGo. DuckDuckBot’s job is to constantly improve search results and offer users the best and most secure search experience possible.View
FacebookExternalHitFetches content for shared links on Meta platforms to generate rich previews.View
FeedfetcherFeedfetcher is used for crawling RSS or Atom feeds for Google News and PubSubHubbub.View
GitHub CamoGitHub's image proxy serviceView
GitHub HookshotGitHub's webhooks for events like push, pull request, etc.View
Google-CloudVertexBotCrawling preferences addressed to the Google-CloudVertexBot user agent affect crawls requested by the site owners' for building Vertex AI Agents. It has no effect on Google Search or other products.View
Google-ExtendedGoogle-Extended is a standalone product token that web publishers can use to manage whether their sites help improve Gemini Apps and Vertex AI generative APIs, including future generations of models that power those products. Grounding with Google Search on Vertex AI does not use web pages for grounding that have disallowed Google-Extended. Google-Extended does not impact a site's inclusion or ranking in Google Search.View
Google-InspectionToolCrawling preferences addressed to the Google-InspectionTool user agent affect Search testing tools such as the Rich Result Test and URL inspection in Search Console. It has no effect on Google Search or other products.View
Google PageRendererUpon user request, Google Page Renderer fetches and renders web pages.View
Google Publisher CenterGoogle Publisher Center fetches and processes feeds that publishers explicitly supplied for use in Google News landing pages.View
Google Read AloudUpon user request, Google Read Aloud fetches and reads out web pages using text-to-speech (TTS).View
Google-SafetyThe Google-Safety user agent handles abuse-specific crawling, such as malware discovery for publicly posted links on Google properties. As such it's unaffected by crawling preferences.View
Google Site VerifierGoogle Site Verifier fetches Search Console verification tokens.View
Google StoreBotCrawling preferences addressed to the Storebot-Google user agent affect all surfaces of Google Shopping (for example, the Shopping tab in Google Search and Google Shopping).View
GooglebotCrawling preferences addressed to the Googlebot user agent affect Google Search (including Discover and all Google Search features), as well as other products such as Google Images, Google Video, Google News, and Discover.View
GoogleOtherCrawling preferences addressed to the GoogleOther user agent don't affect any specific product. GoogleOther is the generic crawler that may be used by various product teams for fetching publicly accessible content from sites. For example, it may be used for one-off crawls for internal research and development. It has no effect on Google Search or other products.View
GPTBotCrawls web content to improve OpenAI's generative AI models; respects 'robots.txt' directives to exclude sites from training data.View
HetrixTools Uptime Monitoring BotHetrixTools Uptime Monitoring Bot is used by HetrixTools's monitoring services to perform various checks on websites, including uptime and performance monitoring.View
HookdeckA reliable Event Gateway for event-driven applicationsView
HydrozenHydrozen is a tool for monitoring availability of your websites, Cronjobs, APIs, Domains, SSL etc.View
ImagesiftBotImageSiftBot is a web crawler that scrapes the internet for publicly available images to support Hive's suite of web intelligence products.View
InngestInngest is a platform for building event-driven applications.View
LinkedInBotLinkedInBot is a bot that renders links shared on LinkedIn.View
LumarThe Lumar website intelligence platform is used by SEO, engineering, marketing and digital operations teams to monitor the performance of their site’s technical health, and ensure a high-performing, revenue-driving website.View
meta-externalagentThe Meta-ExternalAgent crawler crawls the web for use cases such as training AI models or improving products by indexing content directly.View
meta-externalfetcherThe Meta-ExternalFetcher crawler performs user-initiated fetches of individual links to support specific product functions. Because the fetch was initiated by a user, this crawler may bypass robots.txt rules.View
OAI-SearchBotIndexes websites for inclusion in ChatGPT's search results; does not crawl content for AI model training.View
PayPalPayPal delivers real-time event notifications for payments, subscriptions, and account updates.View
Perplexity-UserHandles user-initiated requests in Perplexity, accessing external content to provide real-time information; not used for automated crawling or AI training.View
PerplexityBotIndexes websites for inclusion in Perplexity's search results; does not crawl content for AI model training.View
Pingdom BotPingdom Bot is used by Pingdom's monitoring services to perform various checks on websites, including uptime and performance monitoring.View
Pinterest BotPinterest Bot is a bot that crawls the web and indexes images and videos.View
QStashQStash is a platform for building event-driven applications.View
Amazon Route 53 Health Check ServiceAmazon Route 53 Health Check ServiceView
SemrushSemrush is a platform for SEO, content marketing, competitor research, PPC and social media marketing.View
Sentry Uptime Monitoring BotSentry's Uptime Monitoring Bot performs health checks on configured URLs to monitor the availability and reliability of web services.View
Site24x7Site24x7 Bot is used by Site24x7's monitoring services to perform various checks on websites, including uptime and performance monitoring.View
StatusCakeStatusCake is a website monitoring service that checks the uptime and performance of your website.View
Stripe WebhooksStripe's webhook service that delivers real-time event notifications for payment processing and account updates.View
svixsvix is a webhook service for sending events to webhooks.View
TwitterbotFetches content for shared links on X/Twitter to generate rich previews.View
Uptime RobotUptime Robot is a platform for monitoring and alerting on your applications.View
v0botBot for v0 services.View
Vercel Favicon BotVercel Favicon BotView
vercelflagsvercel flagsView
Vercel Screenshot BotVercel Screenshot BotView
YandexbotYandexBot is a web crawler operated by Yandex, a major Russian search engine.View
Last updated on April 26, 2025