Sentence segmentation tools to use when input sent

2019-04-10 12:12发布

问题:

Suppose there is a sentence like "find me some jazz music and play it", where all the text is normalized and there are no punctuation marks (output of a speech recognition library).

What online/offline tools can be used to do "sentence segmentation" other than the naive approach of splitting on conjunctions ?

Input:

find me some jazz music and play it

Output:

find me some jazz music
play it

回答1:

A dependence parser should help.



回答2:

You can use a semantic role tagger like mate tools etc... for this. It will extract the predicates and the related arguments in prop bank style.