Cleanlab 2.0: Automatically Find Errors in ML Datasets

We just open-sourced the new cleanlab 2.0 Python package for automatically finding errors in datasets and machine learning/analytics with real-world, messy data and labels. tl;dr - cleanlab provides a framework to streamline data-centric AI.

Engineers have used cleanlab at Google to clean and train robust models on speech data, at Amazon to estimate how often the Alexa device doesn’t wake, at Wells Fargo to train reliable financial prediction models, and at Microsoft, Tesla, Facebook, etc.

Full blog-post with details here: cleanlab 2.0: Automatically Find Errors in ML Datasets