I'm researching how to extract keyphrases from document for my thesis.
In my research, I used Naive Bayes classifier machine learning for creating a training model of the candidate term features. One of features is PoS tag, I think this feature is important for specifying a term is keyphrase or not.
But the input of Naive Bayes (NB) classifier is numbers and the PoS tag is a string.
So I don't know the way to represent PoS tag feature as a number in order to become a input feature for NB classifier.
Please help me to give your advice.
Thanks and regards, Hien Su