-
Notifications
You must be signed in to change notification settings - Fork 3.4k
Issues: Lightning-AI/pytorch-lightning
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Model Checkpointing + FSDP causes Cuda OOM
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.3.x
#20312
opened Oct 1, 2024 by
profPlum
Save save_hyperparameters no longer respects linked arguments.
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.3.x
ver: 2.4.x
#20311
opened Sep 30, 2024 by
Erotemic
hparams
not loaded when loading checkpoint via LightningCLI
bug
#20310
opened Sep 30, 2024 by
YouRik
The problem shows: version incompatibility from v1.3.x to v2.4
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20308
opened Sep 27, 2024 by
sunhan3787
Trainer
's .init_module()
context does not initialize model on target device
bug
#20307
opened Sep 27, 2024 by
jin-zhe
NCCL backend fails during multi-node, multi-GPU training
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20306
opened Sep 26, 2024 by
raketenolli
the example that shows "The LightningModule also has access to the Hyperparameters" is not correct
docs
Documentation related
needs triage
Waiting to be triaged by maintainers
#20303
opened Sep 26, 2024 by
XinleiRen
RichProgressBar: refresh_rate doesn't affect metric_component
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20300
opened Sep 24, 2024 by
marios1861
Incosistant memory usage comparing to huggingface trainer when using deepspeed
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20299
opened Sep 24, 2024 by
mickeysun0104
Error encountered while using multiple optimizers inside a loop.
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
#20296
opened Sep 23, 2024 by
RAraghavarora
Mid-epoch resume causes a single unwanted validation step (which is not a sanity check)
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.2.x
#20288
opened Sep 19, 2024 by
Youyoun
NeptuneCallback
produces lots of X-coordinates (step) must be strictly increasing
errors
bug
#20281
opened Sep 14, 2024 by
iirekm
SLURM resubmission crashes because of multiprocessing error
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20280
opened Sep 13, 2024 by
antonzub99
Bug Report: Incorrect URI Prefix Stripping in MLflowLogger
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20279
opened Sep 13, 2024 by
awindmann
WandbLogger will cause error on TPU v3-8
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20278
opened Sep 13, 2024 by
buoyancy99
Validation is incorrectly run on resume
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20277
opened Sep 12, 2024 by
PiotrDabkowski
strict = False
does not work when the checkpoint is distributed
bug
#20274
opened Sep 11, 2024 by
NathanGodey
MLFlow logger returns None when MLFlow server is used
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20273
opened Sep 11, 2024 by
lilruwu
_atomic_save with transaction cause "Invalid cross-device link" error
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20270
opened Sep 10, 2024 by
RichardChe
rich progress bar shows v_num as 0.000
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20268
opened Sep 9, 2024 by
npuichigo
_update_dataloader
improperly copies state of subclassed dataloader with attribute names that differ from __init__
parameters.
bug
#20265
opened Sep 8, 2024 by
spenceforce
Registered buffers not moved to correct device when using DeepSpeed Stage 3
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20258
opened Sep 6, 2024 by
amorehead
Weights are misshappen when using model's forward in on_fit_end() hook with FSDP
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.3.x
#20255
opened Sep 6, 2024 by
QuentinAndre11
Cannot turn off sampler injection at inference time.
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.1.x
#20253
opened Sep 6, 2024 by
ovavourakis
Mixed precision, ddp and torch.no_grad()
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.1.x
#20251
opened Sep 6, 2024 by
tomsons22
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.