Regexp Greek chars by number

2020-04-19 04:39发布

I deal with strings that contain Greek and English (Latin) text. I'd like to use a regex to catch all the Greek words that contain 4 or more characters on them.

Using regexp manual I figure out that I can use \p{Greek} to grab all Greek words and \w{4,} in order to grab 4+ character words. However, these two don't work together, from various tests I made.

Is there any way to do what I want using 1 regexp expression? Strings are UTF-8 and come out of tweets.

Regards

标签： ruby regex utf-8

1条回答

可以哭但决不认输i

2楼-- · 2020-04-19 04:59

Are you using the UTF-8 pattern modifier?

/\p{Greek}{4,}/u

0人赞添加讨论(0) 举报

Regexp Greek chars by number

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间