I have a biased Data Set -
It contains more than 100 independent variables, with one dependent variable (Yes or No).
I have unbalanced classes, e.g. 98% ‘Yes’ labels vs 2% ‘No’ labels.
On Real Production : For New testing data , Accuracy for ‘Yes’ is : 99% and for ‘No’ it is 30%.
I have made my model on Logistic Regression.
With help of which statistical model/method , i can increase my accuracy of ‘No’ to 50%.
Any Suggestion will be appreciated!!!