Description: ArchiveBox is an open source self-hosted web archiving solution that lets you archive web pages and collect media assets. It aims to create local, browsable copies of sites from the internet.
Type: software
Pricing: Open Source
Description: SiteCrawler is a website crawler and scraper software tool. It allows users to crawl websites to extract data, mine content, monitor sites for changes, and perform SEO analysis. SiteCrawler has features like visual point-and-click configuration, flexible crawling rules, and data exports.
Type: software