Skip to content

Heritrix vs redsocks

A side-by-side look at Heritrix and redsocks. For an in-depth review of either product, follow the links below.

Heritrix

Heritrix

Development

Heritrix is an open-source, extensible, web-scale, archival-quality web crawler project built on the Apache stack. It is designed for archiving periodic captures of content from the web and large intranets.

archivingweb-crawleropen-source
redsocks

redsocks

Network & Admin

Redsocks is an open source software that allows redirecting TCP connections through proxy servers like SOCKS or HTTPS. It works at low level of operating system kernel, so all TCP connections can go through proxies transparently without any configuration in applications.

proxysockstcpredirection

Related Comparisons

Google Custom Search Engine
Apache Nutch
Expertrec Search Engine
Easy-Hide-IP