I created a new project with the following code segment:
char* strange = "(Strange??)";
cout << strange << endl;
resulting in the following output:
(Strange]
Thus translating '??)' -> ']'
Debugging it shows that my char* string literal is actually that value and it's not a stream translation. This is obviously not a meta-character sequence I've ever seen. Some sort of Unicode or wide char sequence perhaps? I don't think so however... I've tried disabling all related project settings to no avail.
Anyone have an explanation?
- search : 'question mark, question mark, close brace' c c++ string literal
Trigraphs are the reason. The talk about C in the article also applies to C++
While trying to cross-compile on GCC it picked my sequence up as a trigraph:
So all I need to do now is figure out how to disable this in projects by default since I can only see it creating problems for me. (I'm using a US keyboard layout anyway)
The default behavior on GCC is to ignore but give a warning, which is much more sane and is indeed what Visual Studio 2010 will adopt as the standard as far as I know.
Easy way to avoid the trigraph surprise: split a "??" string literal in two:
Edit
gcc has an option to warn about trigraphs:
-Wtrigraphs
(enabled with-Wall
also)end edit
Quotes from the Standard
That's trigraph support. You can prevent trigraph interpretation by escaping any of the characters:
It's a Trigraph!
What you're seeing is called a trigraph.
In written language by grown-ups, one question mark is sufficient for any situation. Don't use more than one at a time and you'll never see this again.
GCC ignores trigraphs by default because hardly anyone uses them intentionally. Enable them with the
-trigraph
option, or tell the compiler to warning you about them with the-Wtrigraphs
option.Visual C++ 2010 also disables them by default and offers
/Zc:trigraphs
to enable them. I can't find anything about ways to enable or disable them in prior versions.