Xidel

Xidel

Xidel is an open-source command line tool and library for parsing and extracting data from HTML/XML documents. It allows users to query documents using XPath and extract data to formats like JSON and CSV.
Xidel image
xpath html xml json csv parsing extracting

Xidel: Open-Source Command Line Tool for Parsing HTML/XML Data

An open-source command line tool and library for parsing and extracting data from HTML/XML documents, allowing users to query documents using XPath and extract data to formats like JSON and CSV.

What is Xidel?

Xidel is an open-source command line tool and C/C++ library for filtering, transforming and querying HTML/XML documents. Some key features of Xidel include:

  • Supports XPath 3.1 for powerful querying of XML/HTML documents
  • Can extract and output data to various formats like JSON, CSV, XML, HTML
  • Command line interface for automation and scripting
  • C/C++ library for integrating into applications
  • Fast processing of large files
  • SSL/TLS support for fetching documents
  • Modular filter pipeline for transforming documents

Xidel is useful for web scraping, data processing pipelines, testing APIs, and more. It provides a simple but flexible way to extract and transform data from HTML and XML documents from the command line or by integrating the library into an application.

Xidel Features

Features

  1. XPath queries
  2. CSS selectors
  3. JSON/CSV/TSV output
  4. Scripting interface
  5. HTTP client
  6. XML/HTML parser

Pricing

  • Open Source

Pros

Powerful XPath engine

Fast performance

Lightweight

Cross-platform

Open source

Cons

Command line only

Steep learning curve


The Best Xidel Alternatives

Top Development and Web Scraping and other similar apps like Xidel

Here are some alternatives to Xidel:

Suggest an alternative ❐

Xmllint icon

Xmllint

xmllint is a useful command line utility for working with XML files. It comes standard with most Linux distributions and is helpful for developers or anyone who needs to parse, transform, query, or validate XML documents.Some key features of xmllint include:Checking XML files for well-formedness and validating against DTDs or...
Xmllint image
XMLStarlet icon

XMLStarlet

XMLStarlet is a popular open source command line XML toolkit for Unix/Linux systems. It provides a set of versatile utilities and commands for processing, querying, transforming, validating, and editing XML documents and data on the command line interface.Some of the key features and capabilities of XMLStarlet include:Filtering and querying XML...
XMLStarlet image
TXR icon

TXR

TXR is a programming language and interpreter designed for text processing, data munging, and information extraction tasks. Some key features and capabilities:Powerful built-in string handling functions and operationsNative regular expression support integrated into the languageAbility to concisely parse, analyze, transform and aggregate textual dataLightweight interpreted implementation for scripting needsFunctional programming...
TXR image