I just downloaded a Spanish Wordnet from the project GRIAL, the format is XML. How can I use it in Python NLTK?
Besides that, in the same page you can download a tagged corpus in Spanish. How can I incorporate it as well?
I just downloaded a Spanish Wordnet from the project GRIAL, the format is XML. How can I use it in Python NLTK?
Besides that, in the same page you can download a tagged corpus in Spanish. How can I incorporate it as well?
Use XMLCorpusReader to load XML data as corpus
Here's the code to do that
from nltk.corpus.reader import XMLCorpusReader
reader = XMLCorpusReader(dir, file)
A fully working example which uses XMLCorpusReader is given here