Questions about Beam search algorithim #90

gabbyzyk · 2024-10-01T08:27:47Z

Thansk for your great work,
how ever I have a question about the use of beam search
During the inference , the generative model M sequentially generates textual outputs y, which consist of multiple segments y = [y1, ..., yT]. At each time step t, I want to understand if beam search generates b candidate sequences [y1, y2, ..., yt] based on their scores. After the generation is complete, the sequence with the highest score is selected.
I would greatly appreciate it if you could find the time to answer my question.

fate-ubw · 2024-10-06T06:51:15Z

First of all, if you want to understand the logic of beam search in the selfrag algorithm, then you need to first understand the beam search algorithm based on tokens.
The difference between the beam search algorithm used in the long form is that sentence is used as the basic unit.
For your question, I have to admit that it is still very difficult to understand the longform beam search in the selfrag algorithm, I think the best way is to compare the code and the paper at the same time, and use debug to understand the reasoning process of the beam search step by step, it is really difficult to explain accurately by language and can be easily understood.
There are some bugs in the official selfrag code longform reasoning that can confuse beginners. I fixed some bugs in the selfrag code in raglab, and rewrote this part of the algorithm according to the algorithm proposed in the paper, and implemented the real sentence level beam search, selfrag longform beam search
Hopefully, the above answers can help you ~

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about Beam search algorithim #90

Questions about Beam search algorithim #90

gabbyzyk commented Oct 1, 2024

fate-ubw commented Oct 6, 2024

Questions about Beam search algorithim #90

Questions about Beam search algorithim #90

Comments

gabbyzyk commented Oct 1, 2024

fate-ubw commented Oct 6, 2024