I'm interested in learning more about Natural Language Processing (NLP) and am curious if there are currently any strategies for recognizing proper nouns in a text that aren't based on dictionary recognition? Also, could anyone explain or link to resources that explain the current dictionary-based methods? Who are the authoritative experts on NLP or what are the definitive resources on the subject?
相关问题
- How to get a list of antonyms lemmas using Python,
- How to match dependency patterns with spaCy?
- LUIS - Can we use phrases list for new values in t
- How to initialize a `Doc` in textacy 0.6.2?
- How to initialize second glove model with solution
相关文章
- What's the difference between WordNet 3.1 and
- How should I vectorize the following list of lists
- What created `maxent_treebank_pos_tagger/english.p
- How to determine if a sentence is talking about a
- Triple extraction from a sentance
- How to use Mallet for NER [closed]
- How to translate words in NTLK swadesh corpus rega
- In Keras elmo embedding layer has 0 parameters? is
It depends on what you mean by dictionary-based.
For example, one strategy would be to take things that aren't in a dictionary and try to proceed on the assumption that they're proper nouns. If this leads to a sensible parse, consider the assumption provisionally validated and keep going, otherwise conclude that they aren't.
Other ideas:
-- MarkusQ
some toolkits suggested: 1. Opennlp: there is a Named Entity Recognition component for your task 2. LingPipe: also a NER component for it 3. Stanford NLP package: excellent package for academic usage, maybe not commercial friendly. 4. nltk: a Python NLP package
Though this is for Bengali language, but it can draw a common procedure identified proper noun. So I hope this will be helpful for you. Please check the folowing link: http://www.mecs-press.org/ijmecs/ijmecs-v6-n8/v6n8-1.html
Besides the dictionary-based approach, two others come to my mind:
The field is mostly called named-entity extraction and often considered a subfield of information extraction. A good starting point for the different fields of NLP is usually the according chapter in the Oxford Handbook of Computational Linguistics:
Oxford Handbook of Computational Linguistics http://ukcatalogue.oup.com/images/en_US/covers/medium/9780198238829_140.jpg
If you're interested in the implementation of natural language processing and python is your programming language, then this can be a very informative resource: http://www.youtube.com/watch?v=kKe4M4iSclc
Try searching for "named entity recognition"--that's the term that's used in the NLP literature for this sort of thing.