TopKDecoder #177

Hongzl1996 · 2018-11-05T12:02:44Z

Hi,
I wonder if rnn.forward_step changes the order of (batch_size*self.k) dimension ?
With the code about initializing sequence_scores：

and in each step:

It seems like sequence_scores is updated as (assume that selk.k = 3):

If hidden and inflated_encoder_outputs should be calculated as follow?

inflated_encoder_outputs =  _inflate(encoder_outputs,self.k,1).view(batch_size*self.k, -1)  
hidden_shape = encoder_hidden.size()
hidden = _inflate(encoder_hidden, self.k, 2).view(hidden_shape[0], batch_size*self.k, hidden_shape[2])

The text was updated successfully, but these errors were encountered:

KwanWaiChung · 2019-05-29T15:43:25Z

hi, I am studying the code and have similar doubts. However, can you be clear what you mean by decoder_output? do you actually mean log_softmax_output?

Hongzl1996 · 2019-05-30T03:52:47Z

@JojoFisherman Yeah, I mean the output probability of decoder, i.e. log_softmax_output.

KwanWaiChung · 2019-05-30T06:34:45Z

I have the same question. It surprised me that no one has answered this. If theres really something wrong in the beam search, surely it will output some weird sequence. Do you have any conclusion about this?

Hongzl1996 · 2019-05-30T06:59:40Z

It seems some issues have referred that beam search doesn't work correctly. Unfortunately, maybe this repo is not active maintained now. Currently, I use fairseq (pytorch version) to conduct some related experiments.

GZJAS · 2019-06-11T14:45:03Z

I studied the codes these days, and I thought you can use the torch.repeat_interleave. Such as follow:
hidden = tuple([torch.repeat_interleave(h, self.k, dim=1) for h in encoder_hidden])
inflated_encoder_outputs = torch.repeat_interleave(encoder_outputs, self.k, dim=0)

muncok · 2020-02-21T08:50:27Z

I studied the codes these days, and I thought you can use the torch.repeat_interleave. Such as follow:
hidden = tuple([torch.repeat_interleave(h, self.k, dim=1) for h in encoder_hidden])
inflated_encoder_outputs = torch.repeat_interleave(encoder_outputs, self.k, dim=0)

I had the problem with batch_size > 1, but after applying this comment, then it works now.

Thank you!!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TopKDecoder #177

TopKDecoder #177

Hongzl1996 commented Nov 5, 2018 •

edited

Loading

KwanWaiChung commented May 29, 2019

Hongzl1996 commented May 30, 2019

KwanWaiChung commented May 30, 2019

Hongzl1996 commented May 30, 2019

GZJAS commented Jun 11, 2019

muncok commented Feb 21, 2020

TopKDecoder #177

TopKDecoder #177

Comments

Hongzl1996 commented Nov 5, 2018 • edited Loading

KwanWaiChung commented May 29, 2019

Hongzl1996 commented May 30, 2019

KwanWaiChung commented May 30, 2019

Hongzl1996 commented May 30, 2019

GZJAS commented Jun 11, 2019

muncok commented Feb 21, 2020

Hongzl1996 commented Nov 5, 2018 •

edited

Loading