-
Notifications
You must be signed in to change notification settings - Fork 3.4k
Issues: Lightning-AI/pytorch-lightning
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
ModelCheckpointCallback is triggered by mistake after every validation stage when mannual optimization
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20459
opened Nov 29, 2024 by
silverbulletmdc
Jax Support within Lightning
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#20458
opened Nov 28, 2024 by
ludwigwinkler
[BUG] (Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
DataLoader
) sanity check fails due to Input type (torch.FloatTensor) and weight type (torch.cuda.FloatTensor)
bug
#20456
opened Nov 27, 2024 by
MathiasBaumgartinger
Ask a Question and Chat with us are not working.
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20455
opened Nov 27, 2024 by
sorenwacker
"FileNotFoundError: The provided path is not a valid DeepSpeed checkpoint" when using strategy='deepspeed_stage_2'
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20453
opened Nov 26, 2024 by
ShiweiWu98
Error: Invalid value for '--accelerator': 'auto' is not one of 'cpu', 'gpu', 'cuda', 'mps', 'tpu'.
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20451
opened Nov 26, 2024 by
wkd88
Make sure the upcoming change in the default for Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
weights_only
from False to True is handled correctly
bug
#20450
opened Nov 26, 2024 by
lantiga
BatchSizeFinder safety margin
feature
Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
#20447
opened Nov 25, 2024 by
edmcman
Add Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
on_validation_model_train/eval
to Callback API as well
feature
#20441
opened Nov 22, 2024 by
yundai424
Slurm multi-node work fine but multi-gpu doesn't
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20438
opened Nov 22, 2024 by
atifkhanncl
Multi-gpu training with slurm times out
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.3.x
#20434
opened Nov 19, 2024 by
nightingal3
Make Is an improvement or enhancement
needs triage
Waiting to be triaged by maintainers
save_hyperparameters
consistent for CLI and hardcoded training for custom python objects
feature
#20432
opened Nov 19, 2024 by
cgebbe
When interrupting a run with Ctrl+C, sometimes the WandbLogger does not upload a checkpoint artifact
bug
Something isn't working
needs triage
Waiting to be triaged by maintainers
ver: 2.4.x
#20425
opened Nov 16, 2024 by
edmcman
Why only one GPU is getting used in the kaggle kernel
waiting on author
Waiting on user action, correction, or update
#20424
opened Nov 16, 2024 by
KeesariVigneshwarReddy
Weird error while training a model with tabular data!!!! Some problem related self.log_dict
bug
Something isn't working
ver: 2.4.x
#20423
opened Nov 16, 2024 by
KeesariVigneshwarReddy
Log default metrics
feature
Is an improvement or enhancement
logger
Related to the Loggers
#20418
opened Nov 13, 2024 by
ierezell
seed_everything(..., workers=True)
causes the Dataloader
to apply exactly the same augmentations each epoch if they sample values from torch.distributions
bug
#20412
opened Nov 12, 2024 by
nan-dre
update dataset at "on_train_epoch_start", but "training_step" still get old data
bug
Something isn't working
loops
Related to the Loop API
waiting on author
Waiting on user action, correction, or update
#20407
opened Nov 8, 2024 by
Yak1m4Sg
FSDP full state dict mangles fsspec path
bug
Something isn't working
ver: 2.4.x
ver: 2.5.x
#20406
opened Nov 8, 2024 by
oceanusxiv
How to deal with uneven inputs in DDP with sharded data without hanging
discussion
In a discussion stage
#20404
opened Nov 7, 2024 by
ssharpe42
PytorchStreamReader failed reading zip archive: not a ZIP archive
bug
Something isn't working
checkpointing
Related to checkpointing
strategy: deepspeed
ver: 2.4.x
#20398
opened Nov 6, 2024 by
Crazy-LittleBoy
put the monitor metric into default filename for ModelCheckpoint
feature
Is an improvement or enhancement
#20397
opened Nov 5, 2024 by
VDFaller
Light / dark mode for documentation
bug
Something isn't working
docs
Documentation related
ver: 2.5.x
#20396
opened Nov 5, 2024 by
nbrosse
Gradient checkpointing and ddp do not work together
bug
Something isn't working
repro needed
The issue is missing a reproducible example
ver: 2.4.x
#20395
opened Nov 4, 2024 by
rubenweitzman
Error if SLURM_NTASKS != SLURM_NTASKS_PER_NODE
ver: 2.4.x
working as intended
Working as intended
#20391
opened Nov 4, 2024 by
guarin
Previous Next
ProTip!
no:milestone will show everything without a milestone.