Using tensor flow to mark a sequence: synchronized sequence input and output

Question

Using tensor flow to mark a sequence: synchronized sequence input and output

I would like to use Tensorflow to mark a sequence, namely for part of speech tags. I tried to use the same model described here: http://tensorflow.org/tutorials/seq2seq/index.md (which describes a model for translating English into French).

Since the input and output sequences are the same length when marking, I configured the buckets so that the input and output sequences are the same length and tried to learn the marker tag using this model on ConLL 2000.

However, it seems that the decoder sometimes outputs taggedsequence shorter than the input sequence (it seems that it seems that the EOS tag appears prematurely)

For example: He believes that the current account deficit will be reduced to 1.8 billion in September.

In the above sentence, 18 tokens are indicated, which are filled up to 20 (due to balancing).

When decoding the request above, the decoder spits out the following:

PRP VBD DT JJ JJ NN MD VB TO VB DT NN IN NN. _EOS. Cd cd

So here he ends the sequence (EOS) after 15 tokens, not 18.

How can I make a sequence know that the decoded sequence should be the same length as the encoded one in my script.

+4

tensorflow

vvknitk Nov 16 '15 at 14:21

source share

3 answers

mat kelcey · Answer 1 · 2015-11-18T04:56:58+0000

, , , - , seq2seq ( )

( → )?

: - , , .

, , ( , , , , ), !: D

Anurag Ranjan · Answer 2 · 2015-11-18T01:51:12+0000

. translate <EOS> . , , . 225-227 translate.py:

# If there is an EOS symbol in outputs, cut them at that point.
if data_utils.EOS_ID in outputs:
    outputs = outputs[:outputs.index(data_utils.EOS_ID)]

, , <EOS>. , . <EOS> , .

DDragon · Answer 3 · 2016-08-15T20:28:30+0000

I came to the same problem. In the end, I found the ptb_word_lm.py example in tensorflow examples exactly for what we need for tokenization, marking NER and POS.

If you look at the details of an example language model, you can find out that it processes the input sequence of characters as X and the right shift of X for 1 space as Y. This is what the fixed-length sequence marking requires.

Using tensor flow to mark a sequence: synchronized sequence input and output

More articles: