Skip to content

Apache HBase vs Heritrix

A side-by-side look at Apache HBase and Heritrix. For an in-depth review of either product, follow the links below.

Apache HBase

Apache HBase

Development

Apache HBase is an open-source, distributed, versioned, non-relational database modeled after Google's Bigtable. It is written in Java and provides fast random access to large amounts of structured data.

distributednonrelationalbig-datahadoop
Heritrix

Heritrix

Development

Heritrix is an open-source, extensible, web-scale, archival-quality web crawler project built on the Apache stack. It is designed for archiving periodic captures of content from the web and large intranets.

archivingweb-crawleropen-source

Related Comparisons