Anyone know how to use Regex in notepad++ to find

2019-02-22 20:17发布

I am trying to detect Arabic characters in a webpage's HTML using Notepad++ CTRL+F with regular expressions. I am entering the following as my search terms and it is returning all characters.

[\u0600-\u06FF]

Sample block of random text I'm working with -

awr4tgagas
بqa4tq4twْq4tw4twtfwd
awfasfrw34جَ4tw4tg
دِيَّة عَرqaw4trawfَبِيَّ

Any ideas why this Regular Expression won't detect the Arabic characters properly and how I should go about this? I have the document encoded as UTF-8.

Thanks!

标签： regex unicode notepad++ arabic

2条回答

我命由我不由天

2楼-- · 2019-02-22 20:35

This is happening because Notepadd++ regex engine is PCRE which doesn't support the syntax you have provided.

To match a unicode codepoint you have to use \x{NNNN} so your regular expression becomes:

[\x{0600}-\x{06FF}]

0人赞添加讨论(0) 举报

beautiful°

3楼-- · 2019-02-22 20:49

Because Notepad++'s implementation of Regular Expressions requires that you use the

\x{NNNN}

notation to match Unicode characters.

In your example,

\x{0628}

can be used to match the ب (bāʾ,bet,beth,vet) character.

The \u symbol is used to match uppercase letters.

See http://sourceforge.net/apps/mediawiki/notepad-plus/index.php?title=Regular_Expressions#Ranges_or_kinds_of_characters

for an explanation of Notepad++'s regex syntax.

0人赞添加讨论(0) 举报

Anyone know how to use Regex in notepad++ to find

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间