Can anyone give any pointers on how to proceed further for improving accuracy for this hackathon (Mine is only 0.54)
Steps followed by me so far:
- Joined Internship data to train data.
2 Created features for student data such as distinct profiles got by a student, and distinct work exp for student , and then made my data frame to contain only unique student info
- Joined my student data with the data in first step
Once I had the complete data, I tried developing models( LR, RF, Xgboost) by considering intuitive features which can help in a candidate getting shortlisted. Additionally I created variables such as duration check (Duration given by Student < Duration required by Company), part-time check etc. But all this effort could not help me in improving my accuracy.
Further I used SMOTE package to remove the imbalance of classes, which also didn’t help much
Am I going somewhere wrong!! please assist