Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement function summarize(text) #1

Open
emcf opened this issue Apr 4, 2023 · 4 comments
Open

Implement function summarize(text) #1

emcf opened this issue Apr 4, 2023 · 4 comments
Labels
bug Something isn't working

Comments

@emcf
Copy link
Owner

emcf commented Apr 4, 2023

Engshell fails with text data larger than the maximum prompt size. Need a non-GPT based summarize(text) function for processing large text data.

@emcf emcf added the bug Something isn't working label Apr 4, 2023
@AzureDominus
Copy link
Contributor

Why must it be non GPT based? Wouldn't recursively summarizing chunks of the conversation using the gpt3.5 chat endpoint be sufficient?

@tondeaf
Copy link

tondeaf commented Apr 4, 2023

Why must it be non GPT based? Wouldn't recursively summarizing chunks of the conversation using the gpt3.5 chat endpoint be sufficient?

It's much better to vectorize large texts using the ADA model and then process it from there. Much cheaper and faster.

"This model's maximum context length is 4097 tokens. However, your messages resulted in 685229 tokens. Please reduce the length of the messages."

@AzureDominus
Copy link
Contributor

Getting the embeddings of the large text doesn't help to summarize it as you can't usefully convert the embeddings back to text. What do you mean by "process it from there"?

@emcf
Copy link
Owner Author

emcf commented Apr 4, 2023

Why must it be non GPT based? Wouldn't recursively summarizing chunks of the conversation using the gpt3.5 chat endpoint be sufficient?

I believe chunking the text and then doing this would work as well, yes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants