I'm trying to compare two string lets say Émilie and Zoey. Well 'E' comes before 'Z' but on the ASCII chart Z comes before É so a normal if ( str1 > str2 )
Won't work.
I tried with if (strcmp(str1,str2) > 0)
still don't work. So i'm looking into a native way to compare string with UTF-8 characters.
There is no native way to do this, however a PECL extension: http://php.net/manual/de/class.collator.php
IMPORTANT
This answer is meant for situations where it's not possible to run/install the 'intl' extension, and only sorts strings by replacing accented characters to non-accented characters. To sort accented characters according to a specific locale, using a Collator is a better approach -- see the other answer to this question for more information.
Sorting by non-accented characters in PHP 5.2
You may try converting both strings to ASCII using iconv() and the //TRANSLIT option to get rid of accented characters;
Then do the comparison
See the documentation here:
http://www.php.net/manual/en/function.iconv.php
[updated, in response to @Esailija's remark] I overlooked the problem of //TRANSLIT translating accented characters in unexpected ways. This problem is mentioned in this question: php iconv translit for removing accents: not working as excepted?
To make the 'iconv()' approach work, I've added a code sample below that strips all non-word characters from the resulting string using preg_replace().
Here's something that works for me although I'm not sure if it will serve the purpose of comparing the special characters other languages have.
I'm just using the
mb_strpos
function and looking at the results. I guess that would be as close as you can get to a native comparing of UTF8 strings: