Question 1

What is the User-Agent for Google Scholar?

Accepted Answer

Google Scholar identifies itself with the User-Agent string "google scholar" (alternate forms: Google Scholar). Google uses several variants for different products — see developers.google.com/search/docs/crawling-indexing/overview-google-crawlers for the full list.

Question 2

Should I block Google Scholar?

Accepted Answer

No. Blocking Google Scholar removes your pages from Google search results and directly hurts your organic traffic. The only legitimate use case for blocking is on staging or development environments where you do not want indexing.

Question 3

Should I block Google Scholar on my staging or dev site?

Accepted Answer

Yes — staging environments should not be indexed. Use robots.txt with "User-agent: google scholar / Disallow: /" or apply HTTP basic auth. Better: use a noindex meta tag plus a different hostname (staging.example.com) so production is unaffected.

Question 4

Why has Google Scholar stopped visiting my site?

Accepted Answer

Common causes: robots.txt misconfiguration (accidental Disallow), server errors (5xx responses cause crawl-rate to drop), slow page load, soft 404s, or natural crawl budget adjustment. Check Search Console (or equivalent) for crawl errors first.

Question 5

How does Google Scholar decide which pages to crawl?

Accepted Answer

Google Scholar prioritizes based on perceived page importance (links, freshness, content quality), site authority, and crawl budget. Submit a sitemap and ensure your most important pages are reachable from the homepage in 2-3 clicks for best coverage.

Question 6

How can I tell if Google Scholar traffic is real and not spoofed?

Accepted Answer

User-Agent strings can be faked by scrapers pretending to be Google Scholar. For Googlebot, do reverse DNS: the hostname must end in .googlebot.com or .google.com, then forward DNS back to the same IP. BotSights flags spoofed traffic automatically and shows a verified badge per visit.

Question 7

Does Google Scholar respect Crawl-delay?

Accepted Answer

No. Googlebot ignores Crawl-delay. Use Search Console's crawl rate setting instead, or return 503 Service Unavailable temporarily if your server is overloaded.

Google Scholar

Google Scholar Traffic (Last 90 Days)

What is Google Scholar?

What Google Scholar means for your site

What should you do?

How to identify Google Scholar

How to block Google Scholar

Option 1: Block all access

Option 2: Block specific paths only

Option 3: Slow down with a crawl delay

Frequently Asked Questions

Monitor search crawlers before visibility drops