Skip to content

Conversation

@ajaysaini725
Copy link
Contributor

@ajaysaini725 ajaysaini725 commented Jun 21, 2023

Opening this for review but do not merge

# This is what the user request will contain
INPUT_GENERATE_KWARGS = {
'max_new_tokens': 256,
'max_tokens': 256,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did we end up on this? It is kind of confusing because it doesn't say whether it's generated or prompt + generated.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

imo it should be max_new_tokens, max_tokens is not very helpful, because your prompt can be longer than 256 and then you just get a 500 from the server.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants