I was experimenting with regular expression in trying to make an answer to this question, and found that while regex_match
finds a match, regex_search
does not.
The following program was compiled with g++ 4.7.1:
#include <regex>
#include <iostream>
int main()
{
const std::string s = "/home/toto/FILE_mysymbol_EVENT.DAT";
std::regex rgx(".*FILE_(.+)_EVENT\\.DAT.*");
std::smatch match;
if (std::regex_match(s.begin(), s.end(), rgx))
std::cout << "regex_match: match\n";
else
std::cout << "regex_match: no match\n";
if (std::regex_search(s.begin(), s.end(), match, rgx))
std::cout << "regex_search: match\n";
else
std::cout << "regex_search: no match\n";
}
Output:
regex_match: match regex_search: no match
Is my assumption that both should match wrong, or might there a problem with the library in GCC 4.7.1?
Your regex works fine (both match, which is correct) in VS 2012rc.
In g++ 4.7.1
(-std=gnu++11)
, if using:".*FILE_(.+)_EVENT\\.DAT.*"
,regex_match
matches, butregex_search
doesn't.".*?FILE_(.+?)_EVENT\\.DAT.*"
, neitherregex_match
norregex_search
matches (O_o).All variants should match but some don't (for reasons that have been pointed out already by betabandido). In g++ 4.6.3
(-std=gnu++0x)
, the behavior is identical to g++ 4.7.1.Boost (1.50) matches everything correctly w/both pattern varieties.
Summary:
Regarding your pattern, if you mean a dot character
'.'
, then you should write so ("\\."
). You can also reduce backtracking by using non-greedy modifiers (?
):Looking through the latest libstdc++ source code for
regex_search
, you will find:Unfortunately this is not the only TODO item left. GCC's
<regex>
implementation is currently incomplete. I recommend using Boost or Clang and#ifdef
the code until GCC has caught up.(This has neither been fixed in the 4.8 branch.)
I tried to use the regex library in C++11 and I ran into many problems (both using g++ 4.6 and 4.7). Basically, the support is either not there or there is only partial support. That is true even for the SVN version. Here you have a link describing the current status for the SVN version of libstdc++.
So, for the time being, I guess the best option is to continue using Boost.Regex.
Alternatively, you can try to use libc++. According to this document, support for regular expressions is complete.
Assuming that C++ and Boost Regex have a similar structure and functionality, the difference between
regex_match
andregex_search
is explained here: