What are better workarounds for managing and correcting the auto generated sitemap in Webflow without having to manually remove incorrect URLs multiple times per day?

TL;DR
  • Disable auto-generated sitemap and upload a custom one via Webflow's settings and Google Search Console.  
  • Use page settings to add noindex to utility or test pages, redirect outdated URLs, filter or unpublish unnecessary CMS items, and structure content to avoid unwanted pages.

Webflow’s auto-generated sitemap can sometimes include unwanted URLs, causing SEO or indexing issues. Here are better workarounds to manage and prevent the inclusion of these URLs without constant manual cleanup.

1. Use Webflow’s Page Settings to Disable Indexing

  • Open each page in Webflow Designer or from the Pages panel.
  • Under Page Settings, enable “Hide this page from search engines” to add noindex to the page.
  • This prevents the page from being included in your sitemap.
  • Works well for utility pages (thank-you, login redirects, test pages, etc.)

2. Redirect Old or Incorrect URLs

  • Go to Project Settings > Hosting tab.
  • Scroll to the 301 Redirects section.
  • Add redirect rules for outdated or incorrect URLs to valid ones.
  • While not preventing entries in the sitemap directly, this ensures traffic goes to the right place until you rebuild or clean the structure.

3. Disable Auto-Generated Sitemap and Manually Upload a Correct Sitemap

  • In Project Settings > SEO, toggle “Auto-generate Sitemap” to Off.
  • Manually create a sitemap using a tool (e.g., Screaming Frog, XML-sitemaps.com, or a static text editor).
  • Host your custom sitemap (e.g., in the Webflow Assets folder or via external hosting).
  • Submit this custom sitemap to Google Search Console.

4. Exclude CMS Template Pages

  • CMS Collection template pages auto-generate URLs for any CMS item published.
  • If a CMS item shouldn’t be indexed:
  • Unpublish it by setting a filter or status that excludes it.
  • Or use page-level noindex meta tags through conditional visibility.
  • Add custom code in the Page Settings > Before </head> tag:
  • Use conditional logic via CMS fields to insert <meta name="robots" content="noindex">.

5. Structure Your Site to Prevent Unwanted URLs

  • Avoid creating blank or test CMS items/pages.
  • Use naming and slug conventions that make it clear which items are draft or staging content.
  • Use Webflow-specific user roles and Editor permissions to prevent collaborators from publishing test entries.

Summary

To avoid managing bad sitemap entries daily, disable auto-sitemap generation and upload a clean custom one, unindex specific pages in Webflowredirect outdated URLs, and apply CMS filters to keep unneeded content from surfacing. This ensures control over what search engines crawl without constant manual edits.

Rate this answer

Other Webflow Questions