I am working on large dataset where there are lots of spelling error while recording the data manually or electronically. I am facing issues like for a unique id there could be different providers but for a same provider, first name last name varies due to spelling error. For example
provider id first name last name
12345 arun rastogi
12345 arrun rastogi
12345 arun raastogi
1234 aruun rastoge
Although the names are same but have been spelled incorrectly. We have millions of data like this. Please suggest how to deal with the spell error and treat them as one.