Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About processed data #34

Closed
Z-Diviner opened this issue May 22, 2024 · 2 comments
Closed

About processed data #34

Z-Diviner opened this issue May 22, 2024 · 2 comments

Comments

@Z-Diviner
Copy link

Hello, it's an honor to read your paper, which has inspired me deeply. When downloading your preprocessed data, I found the following four files, representing examples of how they were processed. Can you help me explain them?
image

@Z-Diviner
Copy link
Author

And could you provide some information about the data processed on the Bird dataset?

@BeachWang
Copy link
Owner

Hi, thank you for your interest in our paper.

The EUCDISQUESTIONMASK implies that the application of selector with only masked question similarity. The EUCDISMASKPRESKLSIMTHR considers the both similarities of masked question and the skeleton of pre-predicted SQL. The QA says we represent examples with questions and queries, and without database schemas. The 150 is the limit of output length of LLM, while 10000 and 4096 are the limit of total length.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants