Apache Nutch vs dirty.us
A side-by-side look at Apache Nutch and dirty.us. For an in-depth review of either product, follow the links below.
Apache Nutch
Development
Apache Nutch is an open source web crawler software project written in Java. It is used to build web search engines and web archiving systems. Nutch can crawl websites and index page content and metadata.
web-crawlersearch-enginejava
dirty.us
Online Services
dirty.us is a website that provides recommendations for alternative software. It allows you to search for software you currently use and suggests free, open source alternatives with similar features and capabilities.
open-sourcealternativesrecommendationsfree-software
Related Comparisons
Scrapy
SaidIt.net
Slashdot
Crawlbase
StormCrawler
Heritrix