Allow users to generate texts longer than 1024 tokens #2

minimaxir · 2019-04-18T21:43:03Z

It likely isn't possible to do it at the generation level (like other frameworks), but we can hack it by:

Generate full text.
Feed latter half of previous text as a prefix.
Repeat until done.

The text was updated successfully, but these errors were encountered:

aletote · 2019-05-09T17:13:11Z

Why half of the previous text and not all?

yeldarby · 2019-05-10T22:17:24Z

Would this be something you'd be willing to accept a PR on? I'd be willing to give it a go next week.

rafaeelaudibert · 2019-05-28T18:51:13Z

Is there some work on this? I would like to have this feature implemented

woctezuma · 2019-05-28T20:27:26Z

Why half of the previous text and not all?

I guess that the length is computed on the generated text, including the prefix.

If you feed the whole previous text as a prefix, then you would not be able to generate anything more if the length of the input is already at the max.

By feeding half of the previous text, you are guaranteed to have space(*) left for the rest of the generation process. This allows to circumvent the length constraint by iterating.

(*) at least half the length

minimaxir · 2019-05-30T01:04:47Z

I'm currently not working on this; if there's a PR, I'll merge it.

The hard part is that the 1024 limit is done at the tensor level; not sure what's necessary to shift it to handle it efficiently. (especially in the batch case)

rafaeelaudibert · 2019-05-30T14:45:33Z

Yeah, the batch case is where I fail to get it working, as I already am using a code with it working for only one batch, but can't figure it out how to properly and efficiently make it work for multiple batches, so I can't create a PR right now.

Yet, I'd be glad if someone would create a PR that solves this problem

cedspam · 2019-06-05T14:29:00Z

Why half of the previous text and not all?
you need some space left on the model sequence to generate your text, so it can be up to max len minus one token. it's a compromize between generation speed and generated text contextuality

minimaxir added the enhancement New feature or request label Apr 18, 2019

woctezuma mentioned this issue Jul 5, 2019

Is there a limit to context (prefix) length? #82

Open

sirmammingtonham linked a pull request Jul 13, 2019 that will close this issue

Allowing non-fixed length generation #87

Open

dhlanm linked a pull request Apr 18, 2020 that will close this issue

Allowing non-fixed length generation (again) #199

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow users to generate texts longer than 1024 tokens #2

Allow users to generate texts longer than 1024 tokens #2

minimaxir commented Apr 18, 2019

aletote commented May 9, 2019

yeldarby commented May 10, 2019

rafaeelaudibert commented May 28, 2019

woctezuma commented May 28, 2019 •

edited

Loading

minimaxir commented May 30, 2019

rafaeelaudibert commented May 30, 2019

cedspam commented Jun 5, 2019

Allow users to generate texts longer than 1024 tokens #2

Allow users to generate texts longer than 1024 tokens #2

Comments

minimaxir commented Apr 18, 2019

aletote commented May 9, 2019

yeldarby commented May 10, 2019

rafaeelaudibert commented May 28, 2019

woctezuma commented May 28, 2019 • edited Loading

minimaxir commented May 30, 2019

rafaeelaudibert commented May 30, 2019

cedspam commented Jun 5, 2019

woctezuma commented May 28, 2019 •

edited

Loading