DeltaPy: A Framework for Tabular Data Augmentation in Python

3 Pages Posted: 19 May 2020

See all articles by Derek Snow

Derek Snow

The Alan Turing Institute

Date Written: April 22, 2020

Abstract

A range of data abstractions have come to the fore since the re-emergence of machine learning. This includes procedures like feature engineering, extraction, transformation, and selection, as well as data pre-processing, generation, synthesisation, and augmentation. This report attempts to unify some of this terminology with the development of a bare-bones Python package, DeltaPy.

Keywords: Tabular Data, Augmentation Methods, Machine Learning, Data Science, Feature Engineering, Synthetic Data, Colab Notebook

JEL Classification: C02, C13, C21, C38, C53, C87

Suggested Citation

Snow, Derek, DeltaPy: A Framework for Tabular Data Augmentation in Python (April 22, 2020). Available at SSRN: https://ssrn.com/abstract=3582219 or http://dx.doi.org/10.2139/ssrn.3582219

Derek Snow (Contact Author)

The Alan Turing Institute ( email )

British Library, 96 Euston Rd
London, NW1 2DB
United Kingdom

HOME PAGE: http://www.turing.ac.uk/

Do you want regular updates from SSRN on Twitter?

Paper statistics

Downloads
847
Abstract Views
2,586
rank
39,525
PlumX Metrics