I had a similar problem a year ago or so, and if I remember well, I decided (more or less) to use similar_text and soundex , as other people said in the comments. Something like that:
<?php $str1 = "Store 1 for you"; $str2 = "Store One 4 You"; similar_text(soundex($str1), soundex($str2), $percent); if ($percent >= 66){ echo "Equal"; //Send an email for review }else{ echo "Different"; //Proceed to insert in database } ?>
In my case, use a percentage of 66% to determine that the companies are the same (in this case, do not paste into the database, but send me an email to check and see if this is correct.)
After a few months with the help of these solutions, I decided to use some unique code for companies (CIF in my case, because it is unique to the company here in Spain).
source share