Utkarsh/continual learning benchmark upgrade by 1Utkarsh1 · Pull Request #3 · 1Utkarsh1/Continual-Learning

1Utkarsh1 · 2026-05-25T17:24:45Z

No description provided.

Copilot

Pull request overview

This PR upgrades the continual-learning benchmark by adding ER-ACE and GDumb strategies, expanding smoke/integration coverage, and enhancing reporting artifacts to include memory-budget context in leaderboard outputs.

Changes:

Added ER-ACE (asymmetric CE masking + replay) and GDumb (class-balanced memory + retrain-from-scratch) strategies and wired them into strategy creation + CLI method selection.
Introduced a class-balanced replay buffer utility to support exemplar-only baselines like GDumb.
Updated reporting/README/docs assets to include memory-budget information and refreshed generated benchmark artifacts.

Reviewed changes

Copilot reviewed 18 out of 21 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
tests/test_strategies.py	Adds unit tests for ER-ACE, GDumb, and the balanced replay buffer.
tests/test_integration_smoke.py	Extends smoke suite to cover `er_ace` and `gdumb`.
tests/test_config.py	Asserts `gdumb_epochs` is loaded from config.
src/cl_bench/strategies/replay.py	Adds `BalancedReplayBuffer` alongside reservoir replay buffer.
src/cl_bench/strategies/gdumb.py	Implements GDumb strategy and factory.
src/cl_bench/strategies/er_ace.py	Implements ER-ACE strategy and factory.
src/cl_bench/strategies/init.py	Exports new strategies and adds `er_ace` / `gdumb` to `create_strategy`.
src/cl_bench/reporting.py	Adds memory budget aggregation + reporting column; improves retention-curve alignment.
src/cl_bench/experiments.py	Logs `gdumb_epochs` into run summary metadata.
src/cl_bench/config.py	Adds `gdumb_epochs` to config schema + parsing defaults.
src/cl_bench/cli.py	Adds new methods and expands allowed model choices.
README.md	Documents ER-ACE/GDumb and adds a GDumb command + updated leaderboard format.
docs/BENCHMARK_CARD.md	Updates benchmark card to include ER-ACE and GDumb descriptions.
docs/assets/split_cifar10_headline/summary.json	Updates generated report summary with GDumb runs + memory field.
docs/assets/split_cifar10_headline/README.md	Updates generated report README with GDumb and memory column.
docs/assets/split_cifar10_headline/leaderboard.csv	Updates generated leaderboard CSV schema + GDumb row.
configs/split_cifar10_headline.yaml	Adds `strategy.gdumb_epochs`.
configs/smoke.yaml	Adds `strategy.gdumb_epochs` for smoke runs.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+            task_classes=[
+                [int(label) for label in task.classes]
+                for task in config.tasks
+                if task.classes != "all"
+            ],


        forgetting = [_metric(record, "average_forgetting") for record in method_records]
        backward_transfer = [_metric(record, "backward_transfer") for record in method_records]
        runtimes = [record.runtime_seconds for record in method_records]
+        memory_budgets = [_metric(record, "replay_buffer_size") for record in method_records]
        seeds = ",".join(


+        self._fit_memory(task_id)
+        val_metrics = self.evaluate(val_loader)
+        return [
+            {
+                "task_id": task_id,
+                "epoch": 1,
+                "train_loss": 0.0,
+                "train_accuracy": 0.0,
+                "train_examples": float(example_count),
+                **{f"val_{key}": value for key, value in val_metrics.items()},
+            }
+        ]


-    parser.add_argument("--model", choices=["linear", "mlp", "small_cnn", "cnn"])
+    parser.add_argument(
+        "--model",
+        choices=["linear", "mlp", "small_cnn", "cnn", "cifar_convnet", "resnet18_cifar"],


1Utkarsh1 added 2 commits May 26, 2026 02:04

Add high-memory continual learning baselines

eea83cb

Report high-memory GDumb benchmark

01a961c

Copilot AI review requested due to automatic review settings May 25, 2026 17:24

Copilot started reviewing on behalf of 1Utkarsh1 May 25, 2026 17:24 View session

Copilot AI reviewed May 25, 2026

View reviewed changes

Add research paper benchmark scaffold

7a97f70

1Utkarsh1 merged commit ad1d3e5 into main May 25, 2026
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Utkarsh/continual learning benchmark upgrade#3

Utkarsh/continual learning benchmark upgrade#3
1Utkarsh1 merged 3 commits into
mainfrom
Utkarsh/continual-learning-benchmark-upgrade

1Utkarsh1 commented May 25, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

1Utkarsh1 commented May 25, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants