Description: Dirpy is an open-source web crawler and web scraper software. It allows users to efficiently crawl websites and extract data from web pages. Key features include recursive crawling, export of scraped data to CSV/JSON, handling of Javascript-heavy sites, and customization of crawling behavior.
Type: software
Pricing: Open Source
Description: Docparser is a document parsing API that can extract data from invoices, receipts, resumes and more. It uses machine learning to identify and extract key-value pairs, tables and other structured data from documents.
Type: software