For BNP kaggle competition, we are given anonymized dataset containing both categorical and numeric variables, i.e. we are not given with the column names of the variables, just the data. So how do you go through the process of understanding this kind of dataset?
Also, it is said that domain knowledge can be a key to a good Machine learning model. But in this kind of data, how do you do this?