topk / Nucleus sampling selections #138

vince62s · 2024-10-29T09:48:52Z

vince62s
Oct 29, 2024
Maintainer

When we perform topk / nucleus sampling in greedy_search, we do the following:

For a given batch_size / beam_size couple, we start with tiling beam_size times each ex.
Then we advance step by step by picking a "random" token at each step.
At the end, when all beams are finished we do this:
https://github.com/eole-nlp/eole/blob/main/eole/predict/greedy_search.py#L286-L295

which means we retain the n_best or beam_size beams of each ex in the batch according to "scores" (logprob)

Question is:
Shall we pick randomly n_best / beam_size beams in the pool of finished beams to make sure we have a diversity ?

francoishernandez · 2024-10-29T10:49:32Z

francoishernandez
Oct 29, 2024
Maintainer

Well, n_best calls for the "top scores", so changing its behaviour to sample randomly might be a bit confusing.

I think if we really need to sample from the beam_size outputs, this can always be done as a post-processing step: let n_best and beam_size be equal, and do whatever we need with the outptuts. (We may even enforce this at config level for clarity.)

That being said, if we really want this embedded as an option, we might want to add some form of num_return_sequences, with the following behaviour:

beam search: return top N scores;
sampling: pick randomly;

0 replies

vince62s · 2024-10-29T11:29:52Z

vince62s
Oct 29, 2024
Maintainer Author

My point is not really relevant in fact.

When using beam_size = 20 with greedy search, it will automatically return 20 hypotheses for each ex.

The code sort those according to log prob , but when using an estimator just afterward we can rerank after those.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

topk / Nucleus sampling selections #138

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

topk / Nucleus sampling selections #138

vince62s Oct 29, 2024 Maintainer

Replies: 2 comments

francoishernandez Oct 29, 2024 Maintainer

vince62s Oct 29, 2024 Maintainer Author

vince62s
Oct 29, 2024
Maintainer

francoishernandez
Oct 29, 2024
Maintainer

vince62s
Oct 29, 2024
Maintainer Author