Denser CLIP representations

Max Woolf has some shallow resuls suggesting that we might have more powerful CLIP representations by incorporating the activations from additional penultimate layers: https://github.com/minimaxir/imgbeddings/blob/main/DESIGN.md

I think this merits looking into. really wouldn't be that hard, basically just need to add this representational strategy to a CLIP evaluation harness.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

denser-CLIP.md

denser-CLIP.md

Denser CLIP representations

Files

denser-CLIP.md

Latest commit

History

denser-CLIP.md

File metadata and controls

Denser CLIP representations