I run POS tags with the Stanford POS Tagger . The tagger returns only one possible tag for the input sentence. For example, when an input clause cries is provided, the POS marker generates (erroneous) βThe_DT clown_NN weeps_NNS._.β.
However, my application will try to analyze the result and may reject the POS marking because there is no way to analyze it. Therefore, in this example, he would reject "The_DT clown_NN weeps_NNS._". but will accept "The_DT clown_NN weeps_VBZ._". which I assume is a low confidence tag for the parser.
Therefore, I would like the POS tag to provide several hypotheses for labeling each word annotated with some kind of confidence value. In this way, my application can select a POS tag with a high degree of certainty that achieves valid parsing for its purposes.
I did not find a way to ask Stanford POS Tagger to create several (n-best) tags for each word (or even for the whole sentence). Is there any way to do this? (Alternatively, I am also fine using another POS tagger with comparable performance that will support this.)
source share