This page crawls the Origin you provide (same‑origin) and builds a live sitemap.
It works when served over HTTP(S). If you open it from file://, start a tiny local server first (see instructions below).
You’re viewing this via file://
Browsers block crawling local files for security. Serve your folder and open this page through http://localhost:
- Python (already on most machines):
py -m http.server 8000 - Node:
npx http-server -p 8000 - Then open
http://localhost:8000/legal/sitemap/ (example local dev URL)
All URLs
Grouped by Folder
Notes
- Same‑origin only. If a page blocks cross‑origin fetches or requires JS routing, it may be skipped.
- We ignore non‑HTML assets by extension (images, fonts, CSS/JS, PDFs, ZIPs, etc.).
- To keep things fast, use a reasonable depth (2–4 is typical for static sites).
- When you’re ready for production, you can export XML and submit it to Search Console.
How to use
-
Serve your site root (same as you do now):
py -m http.server 8000 --directory "D:\site_reorg_v1" --bind 127.0.0.1 -
Put this file in
/legal/sitemap/(replacing the old one) and open:http://rrthalore.local:8000/legal/sitemap/ - Click Rescan — you should see the full URL list. Use Download XML to export.