Let says this ,

I’m working on a Machine learning project and i’m working on a dataset with a 4250,13 as shape and it is already group in 7 categories!

Note that those categories can’t be considered as prectictor,

here is how my data is grouped in the categories

```
A 32.852598 % of the dataset
B 19.151644 % of the dataset
c 19.003181 % of the dataset
D 16.076352 % of the dataset
E 5.132556 % of the dataset
F 4.814422 % of the dataset
G 2.969247 % of the dataset
```

I have a continuos output that I want to predict , so the task is a regression,

my goal is to predict it in each category , and the final decision will be the category where the predicted output is maximized.

My approach to deal with this problem is to sample my dataset into 7 sub-dataset and train the model in all that 7 dataset. and for a new input predict the output in each category and the final category will be where the predicted output is maximal.

Now I want to know is there any way to do it in one dataset and automatically predict the category where my output is max? with a single model instead of 7??

PS: I’m using python ans scikit learn

Sound like random-forrest but not sure that is it …;

can someone help? Any help will be appreciated…