Skip to content

Heritrix vs PDFOptim

A side-by-side look at Heritrix and PDFOptim. For an in-depth review of either product, follow the links below.

Heritrix

Heritrix

Development

Heritrix is an open-source, extensible, web-scale, archival-quality web crawler project built on the Apache stack. It is designed for archiving periodic captures of content from the web and large intranets.

archivingweb-crawleropen-source
PDFOptim

PDFOptim

Office & Productivity

PDFOptim is a free open source software that optimizes PDF files by reducing the file size. It works by analyzing the internal structure of the PDF, removing unnecessary metadata, compressing images and fonts, and performing other size reduction optimizations.

pdfoptimizercompression

Related Comparisons

StormCrawler
ApowerCompress
Online PDF Compressor
Compress PDF (by SmallPDF)