Your Webflow site may be blocked for bots if it's unintentionally set to block indexing or if there's an issue with your robots.txt or sitemap settings. This can directly affect Google Search Console's ability to verify your sitemap.
1. Check Page and Site-Wide SEO Settings
- Go to Project Settings > SEO tab and make sure “Disable Webflow Subdomain Indexing” is not selected if you're using the Webflow subdomain.
- Check the individual page settings in the Webflow Designer:
- Open the page settings (gear icon on the page).
- Ensure “SEO Settings > Hide this page from search engines” is not checked.
2. Review robots.txt Settings
- In Project Settings > SEO tab, find the robots.txt field.
- Ensure it doesn't contain Disallow: / which would block all bots from crawling your site.
- Typical default should allow crawling:
```
User-agent: *
Allow: /
Sitemap: https://yourdomain.com/sitemap.xml
```
- Update the sitemap URL to match your actual domain correctly (use https and include the
www if your domain uses it).
3. Confirm Sitemap Availability
- Visit https://yourdomain.com/sitemap.xml in your browser.
- If it returns a 404 or any error, then the sitemap isn't published or the domain is misconfigured.
4. Make Sure Your Domain Is Properly Connected
- In Project Settings > Hosting, confirm your domain is connected and published without errors.
- DNS A records should point to (a) 75.2.70.75, (b) 99.83.190.102
5. Test and Resubmit Sitemap in Google Search Console
- In Google Search Console, use the URL Inspection Tool to test a live URL from your site to confirm that it is crawlable.
- Go to the Sitemaps section and resubmit
sitemap.xml. - If you still receive errors, check if Google reports a crawl error (e.g., blocked by robots.txt or 403 Forbidden).
Summary
Your site might be blocked for bots due to a misconfigured robots.txt, disabled indexing settings, or a broken sitemap link. Make sure page-level and project-level SEO settings allow indexing, your robots.txt is not overly restrictive, and your sitemap is accessible and correctly linked in Google Search Console.