Crawls and exports Classic Google Sites into a .docx file, preserving recipe titles and bullet lists. Written in Python.
-
Updated
May 29, 2025 - Python
Crawls and exports Classic Google Sites into a .docx file, preserving recipe titles and bullet lists. Written in Python.
A bash script for mirroring and creating recursive snapshots of static and semi-static websites using `wget`. Includes dynamic proxy and user agent rotation, asset filtering and download, adjustable concurrency, multi-domain support, depth control, optional offline link conversion, archive creation, and detailed logging.
Add a description, image, and links to the site-archiving topic page so that developers can more easily learn about it.
To associate your repository with the site-archiving topic, visit your repo's landing page and select "manage topics."