Synthetic minority oversampling technique

In statistics, synthetic minority oversampling technique (SMOTE) is a method for oversampling samples when dealing with imbalanced classification categories within a dataset. The problem with doing statistical inference and modelling on imbalanced datasets is that the inferences and results from those analyses will be biased towards the majority class. Other solutions undersample the majority class to be equivalently represented in the data with the minority class. Instead of undersampling the majority class, SMOTE oversamples the minority class. However, this technique has been shown to yield poorly calibrated models, with an overestimated probability to belong to the minority class.