Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

从头到尾训练自己的模型 #23

Open
Huahuoy opened this issue Nov 29, 2024 · 1 comment
Open

从头到尾训练自己的模型 #23

Huahuoy opened this issue Nov 29, 2024 · 1 comment

Comments

@Huahuoy
Copy link

Huahuoy commented Nov 29, 2024

请问如果我想从头训练一个自己的模型,从构建数据开始,我看到你的readme里面包括data_construct的介绍,那examples_ctx.json里面有good_res和bad_res,请问这种是如何生成的呢,对于model training 是只使用了类似于finetune_train_examples.jsonl里面的格式的数据吗?所以请问你testset里面的数据集是为了验证你的模型的优越性吗?请问你可以详细说一下bpo_test.json这个文件的作用吗?这个文件是如何生成的呢?

@Huahuoy
Copy link
Author

Huahuoy commented Nov 29, 2024

整个流程其实我大概清楚了,但是我如何得到类似于examples_ctx.json这样的数据呢?bpo数据集没有instruction和context字段

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant