alibaba / ChatLearn Public

Notifications You must be signed in to change notification settings
Fork 24
Star 312

Code
Issues 10
Pull requests 8
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: alibaba/ChatLearn

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

10 Open 20 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

while self.generation_config.eos_token_id is list type, self.tokenizer.eos_token_id wiil be set to None

#263 opened Feb 27, 2025 by haiasd

[BUG] Resource temporarily unavailable

#262 opened Feb 26, 2025 by lxy-nlp

[BUG] env var ENABLE_VLLM is not natively a bool

#261 opened Feb 14, 2025 by slimfrkha

[BUG] fails to utilize all instances when num_replica of next model is less than the number of output batch_size

#209 opened Jan 17, 2025 by haolin-nju

megatron_utils.py in load_checkpoint: args.iteration = state_dict['iteration'] TypeError: 'NoneType' object is not subscriptable

#198 opened Jan 2, 2025 by carrot0117

[train_rlhf_llama] When using vllm，Time-consuming optimization is not obvious as expectation.

#197 opened Dec 31, 2024 by tkqie

[BUG] Context parallelism not enabled in SFT

#189 opened Dec 24, 2024 by slimfrkha

[Alignment training (DPO)] Megatron support for QWEN2.5 series models

#183 opened Dec 18, 2024 by Hevans123

[Feature]Use Megatron-core dist_checkpointing to load checkpoint with different parallel strategies

#169 opened Dec 5, 2024 by SeaOfOcean

[Feature]support to parameter sync when trainer_tp divides inference_tp for megatron core model

#105 opened Sep 25, 2024 by charles9304

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly