Given the complete table, how to change the sorting from utf8_bin to utf8_unicode_ce? The usual "alter" request does not work due to "repeated input errors". For example, there are two entries
David Hussa
and
David Hußa
I know that they are the same. Is there an elegant way to tell MySQL to "merge" records? I should mention that the record identifier is used in other tables as a reference, so MySQL must be respected too. Or do I need to do this in a long and annoying way: does it mean merging each duplicate manually and then changing the sort?
The table looks like this:
delimiter $$ CREATE TABLE `authors` ( `id` int(11) NOT NULL AUTO_INCREMENT, `name` varchar(100) COLLATE utf8_bin NOT NULL, `count` int(11) NOT NULL DEFAULT '1', PRIMARY KEY (`id`), UNIQUE KEY `name_UNIQUE` (`name`), FULLTEXT KEY `name_FULLTEXT` (`name`) ) ENGINE=MyISAM AUTO_INCREMENT=930710 DEFAULT CHARSET=utf8 COLLATE=utf8_bin COMMENT='Stores all authors from dblp.xml.'$$
source share