Based on my reading on clustering and euclidean distance, my understanding was on similar line only. But when not changing the products into numbers and doing the clustering, I will get one with more products and others with very very few like 1/2/10 so I was not able to make much sense out of it. Including the numeric form of the products, 2 clusters will be kind of fine.
I get your explanation and my understanding also on the same line.
Some, dummy data as below:
sampledata.csv (259 Bytes)
volatility is the standard deviation of the booking quantity divided by mean over 2 years of data.
my need is to segment the clusters quarterly basis and see if any patterns exist across the 4 quarters to take any decision as which products to give priority for manufacturing.
all feedback and suggestions welcome.
Thanks and Regards