so, there's a synergy with top_k and num_beams, where you could set
top_k and num_beams equal -- [but it requires do_sample=True

[i'm not sure what the machine learning lingo for simply making a beam
for each top_k
[[but of course they can't do that anyway because they only run
num_beams beams which leaves nothing to swap out {{curious what
algorithm it uses

Reply via email to