A command-line program performing basic numeric, textual, and statistical operations on tabular data, useful for calculations, sorting, and summarizations on CSV files.
datamash is an open-source command-line program used to perform basic numeric, textual and statistical operations on tabular data files. It allows you to easily do tasks like calculations, sorting, and summarizations on data in text files, CSVs, and other tabular data formats.
Some key features and capabilities of datamash include:
datamash can help with exploratory data analysis and data cleaning tasks in data science, analysis and reporting workflows. It's included by default in many Linux distributions. With its focus on tabular data transformations, datamash can be a lightweight and faster alternative to other solutions like R, Python or Excel for some use cases.
Here are some alternatives to Datamash:
Suggest an alternative ❐