Currently I’m working on a human resources project where I need to predict attrition.
I have the active list of employees as on Sep’17 and the list of churned employees for different years starting 2012. I need to predict which employees will leave from Oct’17 to Mar’18.
My approach has been:
- Tag all active employees as on Sep’17 as active
- Tag all churned employees in 2017 as inactive
- Split data into train and validation
- Build model and check performance
Is this approach correct? Are there any other alternative approaches in predicting employee churn?