I have an application which is mostly based in czech language, that is why we uft8_czech_ci.
Given this example:
WHERE `firstName` = 'ales' collate utf8_czech_ci
I am unable to find result aleš
(which is common czech name). When I try this:
WHERE `firstName` = 'ales' collate utf8_general_ci
It successfuly finds aleš
. Is the utf8_czech_ci
definition in MySQL incorrect? I don't want to just blindly start using general_ci.
Thanks.
The reason is that š and s are two different letters in CZ alphabet, so that's that's why it's not found when using
utf8_czech_ci
collation.See also http://collation-charts.org/mysql60/mysql604.utf8_general_ci.european.html and http://collation-charts.org/mysql60/mysql604.utf8_czech_ci.html
Another exhibit of the ordering for utf8 : utf8_czech_ci
Versus utf8 : utf8_general_ci