Description: Dirpy is an open-source web crawler and web scraper software. It allows users to efficiently crawl websites and extract data from web pages. Key features include recursive crawling, export of scraped data to CSV/JSON, handling of Javascript-heavy sites, and customization of crawling behavior.
Type: software
Pricing: Open Source
Description: Tabula is an open source software tool that allows users to extract data tables from PDF files. It provides a graphical user interface that lets users visually select parts of a PDF they want to extract into a spreadsheet or CSV file.
Type: software
Pricing: Open Source