Skip to content

Apache Cassandra vs Heritrix

A side-by-side look at Apache Cassandra and Heritrix. For an in-depth review of either product, follow the links below.

Apache Cassandra

Apache Cassandra

Databases

Apache Cassandra is a free, open-source, distributed NoSQL database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure.

distributedscalablehigh-availabilityfault-tolerantwide-column-store
Heritrix

Heritrix

Development

Heritrix is an open-source, extensible, web-scale, archival-quality web crawler project built on the Apache stack. It is designed for archiving periodic captures of content from the web and large intranets.

archivingweb-crawleropen-source