Webflow’s auto-generated sitemap can sometimes include unwanted URLs, causing SEO or indexing issues. Here are better workarounds to manage and prevent the inclusion of these URLs without constant manual cleanup.
1. Use Webflow’s Page Settings to Disable Indexing
- Open each page in Webflow Designer or from the Pages panel.
- Under Page Settings, enable “Hide this page from search engines” to add
noindex to the page. - This prevents the page from being included in your sitemap.
- Works well for utility pages (thank-you, login redirects, test pages, etc.)
2. Redirect Old or Incorrect URLs
- Go to Project Settings > Hosting tab.
- Scroll to the 301 Redirects section.
- Add redirect rules for outdated or incorrect URLs to valid ones.
- While not preventing entries in the sitemap directly, this ensures traffic goes to the right place until you rebuild or clean the structure.
3. Disable Auto-Generated Sitemap and Manually Upload a Correct Sitemap
- In Project Settings > SEO, toggle “Auto-generate Sitemap” to Off.
- Manually create a sitemap using a tool (e.g., Screaming Frog, XML-sitemaps.com, or a static text editor).
- Host your custom sitemap (e.g., in the Webflow Assets folder or via external hosting).
- Submit this custom sitemap to Google Search Console.
4. Exclude CMS Template Pages
- CMS Collection template pages auto-generate URLs for any CMS item published.
- If a CMS item shouldn’t be indexed:
- Unpublish it by setting a filter or status that excludes it.
- Or use page-level noindex meta tags through conditional visibility.
- Add custom code in the Page Settings > Before </head> tag:
- Use conditional logic via CMS fields to insert
<meta name="robots" content="noindex">.
5. Structure Your Site to Prevent Unwanted URLs
- Avoid creating blank or test CMS items/pages.
- Use naming and slug conventions that make it clear which items are draft or staging content.
- Use Webflow-specific user roles and Editor permissions to prevent collaborators from publishing test entries.
Summary
To avoid managing bad sitemap entries daily, disable auto-sitemap generation and upload a clean custom one, unindex specific pages in Webflow, redirect outdated URLs, and apply CMS filters to keep unneeded content from surfacing. This ensures control over what search engines crawl without constant manual edits.