Missing Batch Normalization in Downsample Class #68

SeunghanYu · 2025-03-25T06:06:00Z

Thank you for your excellent work on MambaVision!

I have been reviewing the architecture described in Section 3.1 ("Macro Architecture") of the paper, where the downsampler is described as a "batch normalized 3×3 CNN layer with stride 2" that reduces the image resolution by half.
However, in the Downsample class in mamba_vision.py (227-254), I noticed that the Batch Normalization step appears to be omitted, with only the 3×3 convolution (stride 2) implemented:

class Downsample(nn.Module):
    """
    Down-sampling block"
    """

    def __init__(self,
                 dim,
                 keep_dim=False,
                 ):
        """
        Args:
            dim: feature size dimension.
            norm_layer: normalization layer.
            keep_dim: bool argument for maintaining the resolution.
        """

        super().__init__()
        if keep_dim:
            dim_out = dim
        else:
            dim_out = 2 * dim
        self.reduction = nn.Sequential(
            nn.Conv2d(dim, dim_out, 3, 2, 1, bias=False),
        )

    def forward(self, x):
        x = self.reduction(x)
        return x

Could you please clarify whether the Batch Normalization is intended to be part of the downsampler, as described in the paper, or if the omission in the code is deliberate?

Thank you very much for your time and assistance!

Best regards,
SeunghanYu

The text was updated successfully, but these errors were encountered:

ahatamiz · 2025-03-25T06:20:30Z

Hi @SeunghanYu

Thank you so much for your attention to the details. This is indeed a typo and we have already removed the word "batch normalized" to avoid any confusions in the next iteration (camera-ready version) of the manuscript.

Kind Regards,
Ali Hatamizadeh

ahatamiz · 2025-03-26T02:04:18Z

Hi @SeunghanYu please see the update manuscript that reflects this change.

Thanks again for your attention to details !

Regards

SeunghanYu · 2025-03-26T03:52:29Z

Hi @ahatamiz!

Thank you so much for the clarification.
I’m glad to hear the updated manuscript addresses this point, and I look forward to reading the newly updated version.
Thank you again for your prompt response and for sharing these updates!

Best regards,
SeunghanYu

ahatamiz closed this as completed Mar 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing Batch Normalization in Downsample Class #68

Missing Batch Normalization in Downsample Class #68

SeunghanYu commented Mar 25, 2025

ahatamiz commented Mar 25, 2025 •

edited

Loading

ahatamiz commented Mar 26, 2025

SeunghanYu commented Mar 26, 2025

Missing Batch Normalization in Downsample Class #68

Missing Batch Normalization in Downsample Class #68

Comments

SeunghanYu commented Mar 25, 2025

ahatamiz commented Mar 25, 2025 • edited Loading

ahatamiz commented Mar 26, 2025

SeunghanYu commented Mar 26, 2025

ahatamiz commented Mar 25, 2025 •

edited

Loading