In the tutorial -"Twitter sentiment analysis" How can we solve the problem of data imbalance?

As I’m learning from this provided course.

I found that there is huge data imbalance between the provided dataset. This will affect the model in future. How can we solve this problem?

Hi @chaitralip

There are multiple techniques to deal with class imbalance. Some of them I have listed below:

  1. Undersampling the majority class
  2. Oversampling the minority class
  3. SMOTE

For more techniques you can refer these articles:

Prateek Joshi

Thanks @pjoshi15

© Copyright 2013-2019 Analytics Vidhya