Installation
Requirements
Python 3.9+ (v0.3.0+ requires modern Python versions)
NumPy 2.0+ and pandas 2.0+ compatible
Optional: HuggingFace transformers for advanced text processing
Stable release
Basic Installation
To install datawrangler, run this command in your terminal:
$ pip install pydata-wrangler
This installs the core functionality including sklearn-based text processing.
Full Installation with ML Libraries
For advanced text processing with sentence-transformers models:
$ pip install "pydata-wrangler[hf]"
This includes sentence-transformers, transformers, and related HuggingFace libraries.
Upgrade from Previous Versions
If upgrading from v0.2.x, ensure you have Python 3.9+:
$ pip install --upgrade "pydata-wrangler[hf]"
This is the preferred method to install datawrangler, as it will always install the most recent stable release.
If you don’t have pip installed, this Python installation guide can guide you through the process.
From sources
The sources for datawrangler can be downloaded from the Github repo.
You can either clone the public repository:
$ git clone git://github.com/ContextLab/data-wrangler
Or download the tarball:
$ curl -OJL https://github.com/ContextLab/data-wrangler/tarball/master
Once you have a copy of the source, you can install it with:
$ python setup.py install