I'm trying to convert a UTF-8 string
to a ISO-8859-1 char*
for use in legacy code. The only way I'm seeing to do this is with iconv
.
I would definitely prefer a completely string
-based C++ solution then just call .c_str()
on the resulting string.
How do I do this? Code example if possible, please. I'm fine using iconv
if it is the only solution you know.
Alfs suggestion implemented in C++11
First convert UTF-8 to 32-bit Unicode.
Then keep the values that are in the range 0 through 255.
Those are the Latin-1 code points, and for other values, decide if you want to treat that as an error or perhaps replace with code point 127 (my fav, the ASCII "del") or question mark or something.
The C++ standard library defines a
std::codecvt
specialization that can be used,C++11 §22.4.1.4/3: “the specialization
codecvt <char32_t, char, mbstate_t>
converts between the UTF-32 and UTF-8 encoding schemes”I'm going to modify my code from another answer to implement the suggestion from Alf.
Invalid UTF-8 input results in dropped characters.