Installation

Requirements

  • Python 3.9+ (v0.3.0+ requires modern Python versions)

  • NumPy 2.0+ and pandas 2.0+ compatible

  • Optional: HuggingFace transformers for advanced text processing

Stable release

Basic Installation

To install datawrangler, run this command in your terminal:

$ pip install pydata-wrangler

This installs the core functionality including sklearn-based text processing.

Full Installation with ML Libraries

For advanced text processing with sentence-transformers models:

$ pip install "pydata-wrangler[hf]"

This includes sentence-transformers, transformers, and related HuggingFace libraries.

Upgrade from Previous Versions

If upgrading from v0.2.x, ensure you have Python 3.9+:

$ pip install --upgrade "pydata-wrangler[hf]"

This is the preferred method to install datawrangler, as it will always install the most recent stable release.

If you don’t have pip installed, this Python installation guide can guide you through the process.

From sources

The sources for datawrangler can be downloaded from the Github repo.

You can either clone the public repository:

$ git clone git://github.com/ContextLab/data-wrangler

Or download the tarball:

$ curl -OJL https://github.com/ContextLab/data-wrangler/tarball/master

Once you have a copy of the source, you can install it with:

$ python setup.py install