How to use the Spanish Wordnet in NLTK?

2020-07-22 19:18发布

问题:

I just downloaded a Spanish Wordnet from the project GRIAL, the format is XML. How can I use it in Python NLTK?

Besides that, in the same page you can download a tagged corpus in Spanish. How can I incorporate it as well?

回答1:

Use XMLCorpusReader to load XML data as corpus

Here's the code to do that

from nltk.corpus.reader import XMLCorpusReader
reader = XMLCorpusReader(dir, file)

A fully working example which uses XMLCorpusReader is given here