Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FEAT: Support RWKV Pile #535

Draft
wants to merge 7 commits into
base: main
Choose a base branch
from
Draft

Conversation

Bojun-Feng
Copy link
Contributor

Resolve #533

Add support for rwkv-4-pile models. Successfully tested the 169m variant on a local Mac setup.

I would like some guidance on setting the model size. Historically, we've rounded up the sizes for models with parameters fewer than 1 billion. However, with rwkv-4-pile, we encounter two smaller models: 169m and 430m.

I am leaning towards simply excluding the 169m variant and setting the smallest size threshold at 430m (for 1b). However, alternative suggestions are welcomed.

@XprobeBot XprobeBot added this to the v0.5.2 milestone Oct 13, 2023
@XprobeBot XprobeBot modified the milestones: v0.5.2, v0.6.0 Oct 16, 2023
@Bojun-Feng
Copy link
Contributor Author

It seems that all the tests are actually passing, but somehow there is a segmentation fault from Stable Diffusion test. Not sure if this is just a probabilistic error or do I need to fix something.

@UranusSeven
Copy link
Contributor

Hi, could you please also update the docs? Here's an example: #587

@XprobeBot XprobeBot modified the milestones: v0.6.0, v0.6.1, v0.6.2, v0.6.3 Nov 3, 2023
@XprobeBot XprobeBot modified the milestones: v0.6.3, v0.6.4, v0.6.5 Nov 21, 2023
@XprobeBot XprobeBot modified the milestones: v0.6.5, v0.6.6, v0.7.0 Dec 1, 2023
@XprobeBot XprobeBot modified the milestones: v0.7.1, v0.7.2 Dec 12, 2023
@XprobeBot XprobeBot modified the milestones: v0.10.2, v0.10.3, v0.11.0 Apr 19, 2024
@XprobeBot XprobeBot modified the milestones: v0.11.0, v0.11.1, v0.11.2 May 11, 2024
@XprobeBot XprobeBot modified the milestones: v0.11.2, v0.11.3 May 24, 2024
@XprobeBot XprobeBot modified the milestones: v0.11.3, v0.11.4, v0.12.0, v0.12.1 May 31, 2024
@XprobeBot XprobeBot modified the milestones: v0.12.1, v0.12.2 Jun 14, 2024
@XprobeBot XprobeBot modified the milestones: v0.12.2, v0.12.4, v0.13.0, v0.13.1 Jun 28, 2024
@XprobeBot XprobeBot modified the milestones: v0.13.1, v0.13.2 Jul 12, 2024
@XprobeBot XprobeBot modified the milestones: v0.13.2, v0.13.4 Jul 26, 2024
@XprobeBot XprobeBot modified the milestones: v0.14, v0.15 Sep 3, 2024
@XprobeBot XprobeBot modified the milestones: v0.15, v0.16 Oct 30, 2024
@XprobeBot XprobeBot modified the milestones: v0.16, v1.x Nov 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

FEAT: added support for rwkv
3 participants