we are trying to implement name entity recognition on millions of comments/feedback and the process appears to be slow. We are thinking of removing stop words/frequent words from the texts and apply ner on them. Does removing stop words affect the accuracy of ner?
标签:
stanford-nlp
相关问题
- Stanford NLP - VP vs NP
- NLTK tokenizer and Stanford corenlp tokenizer cann
- Extracting clause from a Penn Treebank-formatted t
- Extracting triplets from a sentence in Python
- How can I differentiate between a person's nam
相关文章
- Stanford Parser and NLTK windows
- Triple extraction from a sentance
- Why is stanford corenlp gender identification nond
- Stanford Universal Dependencies on Python NLTK
- Issue using Stanford CoreNLP parsing models
- NLTK was unable to find the java file! for Stanfor
- how to run stanford corenlp server on google colab
- Running Stanford corenlp server with custom models
I think it's plausible you will get respectable F1 scores if you run on a sentence with the stop words removed. Ultimately you will have to experiment with it and see if the quality is acceptable for your needs.