Assembling

fuzzy_join(), Joining tables on non-normalized categories with approximate matching. Example

Joiner, AggJoiner, transformers for joining multiple tables together. Example

Encoding

TableVectorizer: turn a pandas dataframe into a numerical array for machine learning. Example

GapEncoder, OneHotEncoder but robust to typos or non-normalized categories. Example

Cleaning

deduplicate(): merge categories of similar morphology (spelling). Example