I put the Spanish text with the Stanford POS Tagger (via NLTK in Python).
Here is my code:
import nltk from nltk.tag.stanford import POSTagger spanish_postagger = POSTagger('models/spanish.tagger', 'stanford-postagger.jar') spanish_postagger.tag('esta es una oracion de prueba'.split())
Result:
[(u'esta', u'pd000000'), (u'es', u'vsip000'), (u'una', u'di0000'), (u'oracion', u'nc0s000'), (u'de', u'sp000'), (u'prueba', u'nc0s000')]
I want to know where I can find what exactly means pd000000, vsip000, di0000, nc0s000, sp000?
source share