Datamash
Datamash: Command-Line Program for Tabular Data Operations
A command-line program performing basic numeric, textual, and statistical operations on tabular data, useful for calculations, sorting, and summarizations on CSV files.
What is Datamash?
datamash is an open-source command-line program used to perform basic numeric, textual and statistical operations on tabular data files. It allows you to easily do tasks like calculations, sorting, and summarizations on data in text files, CSVs, and other tabular data formats.
Some key features and capabilities of datamash include:
- Performing basic statistics like mean, median, max, min, count, sum, stddev etc. on numeric data columns
- Textual operations like count, unique, groupby on text columns
- Sorting data on one or more columns
- Filtering rows based on conditions
- Joining multiple files by a common field
- Handling large data files with good performance
- Easy to use syntax, even for those without programming experience
- Output results to console, files, or as JSON/YAML
datamash can help with exploratory data analysis and data cleaning tasks in data science, analysis and reporting workflows. It's included by default in many Linux distributions. With its focus on tabular data transformations, datamash can be a lightweight and faster alternative to other solutions like R, Python or Excel for some use cases.
Datamash Features
Features
- Perform basic calculations on data
- Sort data
- Summarize data
- Operate on CSV files and tabular data
Pricing
- Open Source
Pros
Cons
Official Links
Reviews & Ratings
Login to ReviewThe Best Datamash Alternatives
View all datamash alternatives with detailed comparison →
Top Office & Productivity and Data Processing and other similar apps like Datamash
Here are some alternatives to Datamash:
Suggest an alternative ❐R (programming language)
Gawk
Mawk
Surveytagger