•Explain how you validate that the data received from sources is suitable for modelling?
@tillutony- Each time you get any data from any sources you should check that the data is tidy data or not .
Tidy data- 1-Variables in columns, observations in rows, one type per dataset
2-Data that is easy to visualise and aggregate (i.e. works well with lm,ggplot, and ddply)
Hope this helps!
Thank a ton