Does anyone know what is a chunker in the context of text processing and what is it's usage?
相关问题
- How to get a list of antonyms lemmas using Python,
- How to match dependency patterns with spaCy?
- LUIS - Can we use phrases list for new values in t
- How to initialize a `Doc` in textacy 0.6.2?
- How to initialize second glove model with solution
相关文章
- What's the difference between WordNet 3.1 and
- How should I vectorize the following list of lists
- What created `maxent_treebank_pos_tagger/english.p
- How to determine if a sentence is talking about a
- Triple extraction from a sentance
- How to use Mallet for NER [closed]
- How to translate words in NTLK swadesh corpus rega
- In Keras elmo embedding layer has 0 parameters? is
I don't personally disagree with the other answers, but Jurafsky and Martin give a slightly different definition. For them, chunking is specifically the type of shallow parsing in which there are no recursive phrases.
One example they give is the phrase "the flight from Denver". One parse that would not be generated by a chunker is "[NP the flight [PP from [NP Denver]]]" because it implies a grammar with NP-recursivity.
According to these slides, chunking is an alternative to parsing that provides a partial syntactic structure of a sentence, with a limited tree depth, as opposed to full on parsing.
It is more limited than full parsing, but is sufficient when it comes to extracting or ignoring information, and is thus many times used, as it's faster and more robust than parsing.
Much more information is available in the slides.
Further links:
It's a very simplistic type of parsing, called shallow parsing. The OpenNLP project has a chunker module available, and you can see its documentation for an example of chunking in action