Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update robots.txt to improve crawler access #323

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

fgo
Copy link

@fgo fgo commented Dec 29, 2024

Allow search engine visibility and ensures proper crawling behavior for Google.

Revise the robots.txt file to disallow all crawlers by default while allowing specific access for Googlebot, DuckDuckBot, and Bingbot.

Currently, searching for "Rails contributors" on Google results in:

No information is available for this page.

image

The reason is the page is blocked by Robots.txt
image

Robots.txt validator shows the site is disallowed if the User Agent is Googlebot.
image

Changing the User-Agent order allows Google to crawl the site.

image

@fxn
Copy link
Member

fxn commented Dec 29, 2024

I guess the problem is that the rule says "Google" instead of "Googlebot".

The patch does unnecessary additional edits, I think, would you mind doing that small edit only?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants