Head-finding rules for noun phrases [closed]

2019-02-16 03:15发布

The Penn Treebank format does not annotate the internal structure of a noun phrase, e.g.

(NP (JJ crude) (NN oil) (NNS prices))

(NP
    (NP (DT the) (JJ big) (JJ blue) (NN house))
    (SBAR
      (WHNP (WDT that))
      (S
        (VP (VBD was)
          (VP (VBN built)
            (PP (IN near)
              (NP (DT the) (NN river)))))))

I would like to extract the heads (prices and house). Do you know of any tool that can do this?

标签： parsing nlp

3条回答

在下西门庆

2楼-- · 2019-02-16 03:32

Michael Collins dissertation (Appendix A) includes head-finding rules for the Penn Treebank that work reasonably well and are not difficult to implement. They're far from perfect, though, since it's not the easiest task.

The work by David Vadas and James Curran on NP structure in the Penn Treebank could also be relevant:

David Vadas's website with additional NP annotation:
Papers:
- Adding Noun Phrase Structure to the Penn Treebank
- Parsing Noun Phrases in the Penn Treebank

0人赞添加讨论(0) 举报

贪生不怕死

3楼-- · 2019-02-16 03:41

As aab suggested, simple deterministic head-finding rules can work quite well (also see references to Magerman or Charniak head-finding rules for similar approaches).

You might also look at extracting dependency structure from the constituent trees. The Stanford toolset does this quite well: See http://nlp.stanford.edu/software/stanford-dependencies.shtml

0人赞添加讨论(0) 举报

趁早两清

4楼-- · 2019-02-16 03:45

You can also find head finding rules of English in Dan Bikel 's thesis (if you need source code, you can find in his homepage in parser software)

0人赞添加讨论(0) 举报

Head-finding rules for noun phrases [closed]

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间