how can I randomly partition a dataset into smaller subsets using a mean based splitting approach with python. Thanjs