Description: Apache Oozie is an open source workflow scheduling and coordination system for managing Hadoop jobs. It allows users to define workflows that describe multi-stage Hadoop jobs and then execute those jobs in a dependable, repeatable fashion.
Type: software
Pricing: Free
Description: Spider Jack is a web scraping and data extraction tool. It allows users to easily scrape data from websites without needing to write code. Spider Jack has a graphical interface where users can point and click to extract data.
Type: software