You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have been reviewing the architecture described in Section 3.1 ("Macro Architecture") of the paper, where the downsampler is described as a "batch normalized 3×3 CNN layer with stride 2" that reduces the image resolution by half.
However, in the Downsample class in mamba_vision.py (227-254), I noticed that the Batch Normalization step appears to be omitted, with only the 3×3 convolution (stride 2) implemented:
Could you please clarify whether the Batch Normalization is intended to be part of the downsampler, as described in the paper, or if the omission in the code is deliberate?
Thank you very much for your time and assistance!
Best regards,
SeunghanYu
The text was updated successfully, but these errors were encountered:
Thank you so much for your attention to the details. This is indeed a typo and we have already removed the word "batch normalized" to avoid any confusions in the next iteration (camera-ready version) of the manuscript.
Thank you so much for the clarification.
I’m glad to hear the updated manuscript addresses this point, and I look forward to reading the newly updated version.
Thank you again for your prompt response and for sharing these updates!
Hi @johnnynunez and @ahatamiz !
Thank you for your excellent work on MambaVision!
I have been reviewing the architecture described in Section 3.1 ("Macro Architecture") of the paper, where the downsampler is described as a "batch normalized 3×3 CNN layer with stride 2" that reduces the image resolution by half.
However, in the
Downsample
class inmamba_vision.py (227-254)
, I noticed that the Batch Normalization step appears to be omitted, with only the 3×3 convolution (stride 2) implemented:Could you please clarify whether the Batch Normalization is intended to be part of the downsampler, as described in the paper, or if the omission in the code is deliberate?
Thank you very much for your time and assistance!
Best regards,
SeunghanYu
The text was updated successfully, but these errors were encountered: