Skip to content

Heritrix vs lsdisk

A side-by-side look at Heritrix and lsdisk. For an in-depth review of either product, follow the links below.

Heritrix

Heritrix

Development

Heritrix is an open-source, extensible, web-scale, archival-quality web crawler project built on the Apache stack. It is designed for archiving periodic captures of content from the web and large intranets.

archivingweb-crawleropen-source
lsdisk

lsdisk

Os & Utilities

lsdisk is a command line tool on Linux systems that lists available disk drives and their partitions. It provides a simple overview of disk usage and availability.

diskpartitionstorageutility

Related Comparisons

Google Custom Search Engine
Apache Nutch
Expertrec Search Engine