draft for rollin implementation #12

slvnwhrl · 2022-06-16T08:39:18Z

No description provided.

slvnwhrl · 2022-06-16T08:41:07Z

trans/train.py

+                rollin_samples = random.sample(sample_ids, nr_samples)
+                with torch.no_grad():
+                    # restore
+                    for id_, sample in sample_stack:


@peter-makarov Here my idea was to sample an increasing size of samples each epoch while sampling a different subset each epoch (and setting the previous subset to the previous status (teacher forcing))

Makes sense.

slvnwhrl · 2022-06-16T08:42:06Z

trans/transducer.py

@@ -539,6 +539,99 @@ def continue_decoding():
        return Output(action_history, self.decode_encoded_output(input_, action_history),
                      log_p, None)

+    def roll_in(self, sample: utils.Sample, rollin: int) -> None:


The implementation is for a single sample at the moment, the decoder steps could probably be batched (and sampling / exploration as loop). This would likely increase speed.

rollin should probably be a float representing probability?

slvnwhrl · 2022-06-16T08:42:47Z

trans/transducer.py

+            # expert prediction
+            expert_actions = self.expert_rollout(sample.input, sample.target,
+                                                 current_alignment.item(), output)
+            optimal_actions.append(expert_actions)


The targets are always generated by expert.

slvnwhrl · 2022-06-16T08:43:38Z

trans/transducer.py

+            optimal_actions.append(expert_actions)
+
+            # update states
+            if np.random.rand() <= rollin:


The next state is either sampled from the expert or the model itself. Is this the idea? Alternatively, one could always execute the model (and only rely on the expert for the targets).

Yes, that's the idea!

slvnwhrl · 2022-06-16T08:45:18Z

@peter-makarov So I've drafted an implementation. I've tested it a bit and it seems to work somehow, however, I think I am still missing something...

slvnwhrl · 2022-06-16T09:27:50Z

trans/train.py

+                    for id_ in rollin_samples:
+                        sample_stack.append((id_, dataclasses.replace(training_data.samples[id_])))
+                        transducer_.roll_in(training_data.samples[id_], rollin)
+
            j = 0
            for j, batch in enumerate(training_data_loader):


With the current setup, rollin is performed before each epoch. I think a problem with this approach could be that the model performs update during the epoch and the rolled in target sequences are not representative anymore for the model. So maybe it would make more sense to rollin after every training step with some probability.

Agreed, it would be more sound and more useful to sample for each batch (this will address the errors of the current model checkpoint, not the errors that may have resolved themselves anyway already due to the recent parameter updates).

peter-makarov

LGTM

peter-makarov · 2022-06-16T14:52:24Z

trans/train.py

+                    for id_ in rollin_samples:
+                        sample_stack.append((id_, dataclasses.replace(training_data.samples[id_])))
+                        transducer_.roll_in(training_data.samples[id_], rollin)
+
            j = 0
            for j, batch in enumerate(training_data_loader):


Agreed, it would be more sound and more useful to sample for each batch (this will address the errors of the current model checkpoint, not the errors that may have resolved themselves anyway already due to the recent parameter updates).

peter-makarov · 2022-06-16T14:57:16Z

trans/transducer.py

@@ -539,6 +539,99 @@ def continue_decoding():
        return Output(action_history, self.decode_encoded_output(input_, action_history),
                      log_p, None)

+    def roll_in(self, sample: utils.Sample, rollin: int) -> None:


rollin should probably be a float representing probability?

peter-makarov · 2022-06-16T15:04:11Z

trans/transducer.py

+            optimal_actions.append(expert_actions)
+
+            # update states
+            if np.random.rand() <= rollin:


Yes, that's the idea!

peter-makarov · 2022-06-16T15:05:33Z

trans/transducer.py

+                action = sample_action
+            else:
+                action = expert_actions[
+                    int(np.argmax([log_probs_np[a] for a in expert_actions]))


So this does not over-corrects the model if it already predicts an optimal action, in case multiple actions are optimal.

peter-makarov · 2022-06-16T15:08:11Z

trans/transducer.py

+            if char != "":
+                output.append(char)
+
+            alignment_history = torch.cat(


Doesn't seem like this has to be done via concatenation in a loop. Can this not be re-written using list append?

peter-makarov · 2022-06-16T15:08:56Z

trans/transducer.py

+                ]
+
+            action_history = torch.cat(
+                (action_history, torch.tensor([[[action]]], device=self.device)),


Same comment regarding concatenation in the loop.

peter-makarov · 2022-06-16T15:18:32Z

@peter-makarov So I've drafted an implementation. I've tested it a bit and it seems to work somehow, however, I think I am still missing something...

Is there no improvement?

Test it on some little data (e.g. morphological inflection 100 samples) and batch size 1. Try with roll-in and without roll-in. You should be seeing consistent improvement.

draft for rollin implementation

94c8381

slvnwhrl requested a review from peter-makarov June 16, 2022 08:39

slvnwhrl self-assigned this Jun 16, 2022

slvnwhrl commented Jun 16, 2022

View reviewed changes

peter-makarov approved these changes Jun 16, 2022

View reviewed changes

update declaration for rollin parameter

3ef4b28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

draft for rollin implementation #12

draft for rollin implementation #12

slvnwhrl commented Jun 16, 2022

slvnwhrl Jun 16, 2022

peter-makarov Jun 16, 2022

slvnwhrl Jun 16, 2022

peter-makarov Jun 16, 2022

slvnwhrl Jun 16, 2022

slvnwhrl Jun 16, 2022

peter-makarov Jun 16, 2022

slvnwhrl commented Jun 16, 2022

slvnwhrl Jun 16, 2022

peter-makarov Jun 16, 2022

peter-makarov left a comment

peter-makarov Jun 16, 2022

peter-makarov Jun 16, 2022

peter-makarov Jun 16, 2022

peter-makarov Jun 16, 2022

peter-makarov Jun 16, 2022

peter-makarov Jun 16, 2022

peter-makarov commented Jun 16, 2022

draft for rollin implementation #12

Are you sure you want to change the base?

draft for rollin implementation #12

Conversation

slvnwhrl commented Jun 16, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

slvnwhrl commented Jun 16, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

peter-makarov left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

peter-makarov commented Jun 16, 2022