Finding a DOI in a document or page-第2页回答

Finding a DOI in a document or page

2019-03-07 18:40发布

The DOI system places basically no useful limitations on what constitutes a reasonable identifier. However, being able to pull DOIs out of PDFs, web pages, etc. is quite useful for citation information, etc.

Is there a reliable way to identify a DOI in a block of text without assuming the 'doi:' prefix? (any language acceptable, regexes preferred, and avoiding false positives a must)

标签： regex doi

7条回答

我只想做你的唯一

2楼-- · 2019-03-07 19:09

This is a really old and answered question, but here's another potential substitute.

\b10\.(\d+\.*)+[\/](([^\s\.])+\.*)+\b

This assumes that white space is not part of the DOI.

Haven't tested this for false positives, but it seems to be able to find all the edge cases mentioned in this page.

0人赞添加讨论(0) 举报

上一页 1 2

Finding a DOI in a document or page

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间