SMOTE implementation in Python

machine_learning
python
smote

#1

Hi,
I am working on imbalanced dataset in Python.I am referring to SMOTE example from this link
http://contrib.scikit-learn.org/imbalanced-learn/generated/imblearn.over_sampling.SMOTE.html

Can you please explain me the example ?Does X and y corresponds to features and labels?Can you please explain me with some other example of smote implementation in python?

Thanks


#2

For data imbalance problems, mostly handled in three steps.

  1. Over-sample the minority class.
  2. Under-sample the majority class.
  3. Synthesize new minority classes.

SMOTE (Synthetic Minority Over-sampling TEchnique) is coming under the third step. It’s the process of creating a new minority classes from the datasets.

The process in SMOTE is mentioned below.

SMOTE are available in R in the unbalanced package and in Python in the UnbalancedDataset package.

Limitation of SMOTE:

It can only generate examples within the body of available examples—never outside. Formally, SMOTE can only fill in the convex hull of existing minority examples, but not create new exterior regions of minority examples.

Refer - Learning from Imbalanced Classes by TOM FAWCETT for more understanding about imbalanced classes.

Thanks.


#3

Thanks Bhuvaneshwaran for the explanation