I have this situation in Google Search Console:
Under
Page indexing > Crawled - currently not indexed:
https://<my website URL>/forums/<private forum name>/<private topic name>/
This was last crawled 2 days ago.
- Normal users ("Registered") have a No access Forum Permission associated with that forum.
- Guests (Google should be a Guest) have a No access Forum Permission associated with that forum.
"Crawled" means that a Google bot was able to get to that page.
How is this possible?
I believe the forum is embedded onto the page. It's possible that the page was crawled, but none of the content of the forum was shown.
If you navigate to the link for a private forum in an incognito browser you'll see the experience.
I think the question is how Google knew about the link, perhaps there's an issue where private forums are still being added to the sitemap? The would seem like a bug IMO.
I think the question is how Google knew about the link, perhaps there's an issue where private forums are still being added to the sitemap? The would seem like a bug IMO.
Thanks Vanessa, that's exactly the question I have.
I have already tested the link with a private browsing session and, of course, the site asks you to log in to view it, which is the expected behaviour.
The problem is, Google shouldn't even have known what the URL of that topic was!
Google indexes something when it can view it.
Do some tests to see how the private topics are public. sitemap? Check it.
Do some work ......
Do you have a robots.txt file on the site? If do have excluded the forum/s in there?
Example:
User-agent: * Disallow: /community/name-of-private_forum1/ Disallow: /community/name-of-private_forum2/