Added implementation for MAE [work in progress - help appreciated] #19

ariguiba · 2024-12-09T09:22:48Z

Hi 👋 I'm new here and trying to learn more about SSL techniques. This dataset seemed like a great place to start!

First try at implementing a masked autoencoder
The idea was to extend the approach of simCLR and implement a masked autoencoder using self-supervised learning. For now the masked mechanism is turned off and I tried to implement a simple reconstruction autoencoder.

I followed similar architecture design as simCLR, using only CNNs and Linear layers, and similar hyperparams.
The main setup is as follows:

mask_ratio: 0%
Encoder architecture (Decoder: symmetric)
- CNN: num_channels: 25 + kernel: 5 + stride:2
- CNN: num_channels: 25 + kernel: 3 + stride: 2
- Linear projection head: 12 neurons (optional)
lr = 0.01
num epochs: 100
batch size = 100
Optimizer: Adam
Loss function: MSE
Seed: 42
Training on both test + train data
tSNE: on all data - in latent space
kNN & linReg fit to train data + run on test data - in latent space

⚠️ Current issues:

Linear Regression accuracy seems to increase above baseline, but doesn't improve further with training
kNN accuracy decreases significantly from baseline
No visible cluster behavior (see tSNE projection) in the latent space

My intuition is: the reason behind this is that autoencoders are not optimized for similarity but for reconstruction, so that's why we're not seeing any class clusters. However, I would still expect some clusters (say of features or something) in the latent space. This is also necessary once it's extended to a masked autoencoder.

Any insights? Opinions on this?
Has anyone tried it before?

Any feedback is appreciated 🎉

dkobak · 2024-12-09T11:51:54Z

I haven't tried it.

I haven't looked at your code in detail, but it seems your network is not really training: the loss per batch is constant over epochs, and the reconstruction quality is very poor...

ariguiba · 2024-12-09T13:28:57Z

I haven't tried it.

I haven't looked at your code in detail, but it seems your network is not really training: the loss per batch is constant over epochs, and the reconstruction quality is very poor...

Yes, that's also my worry that the model is not learning at all and I don't understand why, I've tried smaller and bigger model (each time adding 1-2 conv layers or playing around with the projector) but that didn't have any effect.

Thank you in advance if you check it out!

ariguiba added 2 commits December 9, 2024 10:01

Added implementation for MAE

a4729ae

added reconstruction plot

1dc5500

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added implementation for MAE [work in progress - help appreciated] #19

Added implementation for MAE [work in progress - help appreciated] #19

ariguiba commented Dec 9, 2024

dkobak commented Dec 9, 2024

ariguiba commented Dec 9, 2024

Added implementation for MAE [work in progress - help appreciated] #19

Are you sure you want to change the base?

Added implementation for MAE [work in progress - help appreciated] #19

Conversation

ariguiba commented Dec 9, 2024

dkobak commented Dec 9, 2024

ariguiba commented Dec 9, 2024