regex greedy problem (C#)

2019-06-20 02:55发布

I've a input string like "===text=== and ===text===" and I want to replace wiki syntax with the corresponding html tag.

input:

===text=== and ===text===

desirable output:

<h1>text</h2> and <h1>text</h2>

but with the following code I get this output:

var regex = new Regex("---(.+)---");
var output = regex.Replace("===text=== and ===text===", "<h1>$1</h1>");

<h1>text=== and ===text</h1>

I know the problem is that my regex matches greedy. But how to make them non greedy.

Thank you and kind regards. Danny

5条回答
ら.Afraid
2楼-- · 2019-06-20 03:41

To make a Regex not greedy you use ?

So the expression "===(.+?)===" would have two matches for you - so should allow you to generate <h1>text</h1> and <h1>text</h1>

查看更多
兄弟一词,经得起流年.
3楼-- · 2019-06-20 03:41

Simply dd a ? maybe?

===.+?===
查看更多
smile是对你的礼貌
4楼-- · 2019-06-20 03:50

I'll add another variant: ===((?:(?!===).)*)=== (stop catching any character when you encounter ===)... Oh... and for the . problem suggested by WiseGuyEh, I suggest RegexOptions.SingleLine, so that the . match even the newline.

查看更多
祖国的老花朵
5楼-- · 2019-06-20 03:55

And just for info if others have the same issue then I had - to avoid matching also ====Text==== instead of ===Text=== I've extended the pattern like this: (?<!=)===([^=]+)===(?!=)

查看更多
Lonely孤独者°
6楼-- · 2019-06-20 03:56

Add the question mark to your regex: ===(.+?)===

A better alternative would be to have a regex of the following form: ===([^\=]+)===. See this guide on the dot character for an explanation of using the dot sparingly. When benchmarking my supplied regex, it is approx. 50% faster than your regex.

查看更多
登录 后发表回答