Xidel
Xidel is an open-source command line tool and library for parsing and extracting data from HTML/XML documents. It allows users to query documents using XPath and extract data to formats like JSON and CSV.
Xidel: Open-Source Command Line Tool for Parsing HTML/XML Data
An open-source command line tool and library for parsing and extracting data from HTML/XML documents, allowing users to query documents using XPath and extract data to formats like JSON and CSV.
What is Xidel?
Xidel is an open-source command line tool and C/C++ library for filtering, transforming and querying HTML/XML documents. Some key features of Xidel include:
- Supports XPath 3.1 for powerful querying of XML/HTML documents
- Can extract and output data to various formats like JSON, CSV, XML, HTML
- Command line interface for automation and scripting
- C/C++ library for integrating into applications
- Fast processing of large files
- SSL/TLS support for fetching documents
- Modular filter pipeline for transforming documents
Xidel is useful for web scraping, data processing pipelines, testing APIs, and more. It provides a simple but flexible way to extract and transform data from HTML and XML documents from the command line or by integrating the library into an application.
Xidel Features
Features
- XPath queries
- CSS selectors
- JSON/CSV/TSV output
- Scripting interface
- HTTP client
- XML/HTML parser
Pricing
- Open Source
Pros
Powerful XPath engine
Fast performance
Lightweight
Cross-platform
Open source
Cons
Command line only
Steep learning curve
Reviews & Ratings
Login to ReviewThe Best Xidel Alternatives
View all Xidel alternatives with detailed comparison →
Top Development and Web Scraping and other similar apps like Xidel
Xmllint
xmllint is a useful command line utility for working with XML files. It comes standard with most Linux distributions and is helpful for developers or anyone who needs to parse, transform, query, or validate XML documents.Some key features of xmllint include:Checking XML files for well-formedness and validating against DTDs or...
XMLStarlet
XMLStarlet is a popular open source command line XML toolkit for Unix/Linux systems. It provides a set of versatile utilities and commands for processing, querying, transforming, validating, and editing XML documents and data on the command line interface.Some of the key features and capabilities of XMLStarlet include:Filtering and querying XML...
TXR
TXR is a programming language and interpreter designed for text processing, data munging, and information extraction tasks. Some key features and capabilities:Powerful built-in string handling functions and operationsNative regular expression support integrated into the languageAbility to concisely parse, analyze, transform and aggregate textual dataLightweight interpreted implementation for scripting needsFunctional programming...