Unicode output on windows console

2019-06-05 17:29发布

问题:

The article Unicode apps in the MinGW-w64 wiki explains the following example for an Unicode application, e.g. _main.c_:

#define UNICODE
#define _UNICODE
#include <tchar.h>

int _tmain(int argc, TCHAR * argv[])
{
  _tprintf(argv[1]);
  return 0;
}

The above code makes use of tchar.h mapping, which allows it to both compile in Unicode and non-Unicode mode. [...] The -municode option is still required when linking if Unicode mode is used.

So I used

C:\> i686-w64-mingw32-gcc main.c -municode -o hello
_tmain.c:1:0: warning: "UNICODE" redefined
#define UNICODE
^
<command-line>:0:0: note: this is the location of the previous definition

to compile a Unicode application. But, when I run it, it returns

C:\> hello Süßer
S³▀er

So the Unicode string is wrong. I used the latest version 4.9.2 of MinGW-w64, i686 architecture and tried the Win32 and POSIX theads variants, both result in the same error. My operating system is 32-bit German Windows 7. When I used the Unicode codepage (chcp 65001), I have to use the font "Lucida Console". With this setting I get a similar error:

C:\> hello Süßer
S��er

I want to use a parameter with "ü" or "ß" in a Windows C++ program.

Solution

nwellnhof is right: The problem is the output on the console. This problem is explained in Unicode part 1: Windows console i/o approaches und Unicode part 2: UTF-8 stream mode. The latter gives a solution for Visual C++ - it worked also with Intel C++ 15. This blog post does "not yet consider the g++ compiler. All this code is Visual C++ specific. However, [the blog author has] done generally the same with g++, and [he] will probably discuss that in a third installment."

I want to open a file, which name is given by a parameter. This works simple, e. g. main.c:

#include <iostream>
#include <fstream>

using namespace std;

int main(int argc, char* argv[])
{

    if ( argc > 1 ) {
        // The output will be wrong, ...
        cout << argv[1] << endl;

        // but the name of this file will be right:
        fstream fl_rsl(argv[1], ios::out);
        fl_rsl.close();
    }
   return 0;
} 

and the compilation without unicode mode

C:\> g++ main.cpp -o hello && hello Süßer

It's console output is still wrong, but the created filename is right. This is okay for me.