I want to compare 2 strings which have some non English character in them
String1 = debarquer
String2 = débárquér
On comparing above 2 strings, they should say equal.
I want to compare 2 strings which have some non English character in them
String1 = debarquer
String2 = débárquér
On comparing above 2 strings, they should say equal.
There is a way to compare 2 strings values in java.
To do this you can use Java's Normalizer class. Just normalize the Strings, then strip out the diacritical marks, like so:
You can then use this to compare the two strings minus the accents:
Use the Collator class. It allows you to set a strength and locale and it will compare characters appropriately.
It should be something similar to this (NOTE: I have not tested the program)
UPDATE: A point to note is that "débárquér" and "debarquer" should never be considered as equal. But if you will be sorting them out, then you do not want them to be compared based on their ASCII value. Take for example "Joao" and "João": If you sort them out based on ASCII, you might get Joao, John, João. This is obviously not good. Using the collator class handles this correctly.