Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What exactly is object pointer? #312

Open
jeezrick opened this issue Sep 13, 2024 · 3 comments
Open

What exactly is object pointer? #312

jeezrick opened this issue Sep 13, 2024 · 3 comments

Comments

@jeezrick
Copy link

Hi, I would appreciate it if someone could shed some light on the object pointer feature introduced in SAM2.

As mentioned in the paper:
In addition to the spatial memory, we store a list of object pointers as lightweight vectors for high-level semantic information of the object to segment, based on mask decoder output tokens of each frame.

However, in the (inference) code, I just don't see the benefit of this particular token. Moreover, the tracking still seems to work if you disable it. I wonder if someone could share their insights on this.

@jeezrick
Copy link
Author

image
and of course I see the ablation experiment.

@13512263278
Copy link

the object pointer vector for the output mask, extracted based on the output token from the SAM mask decoder.

@Lr-2002
Copy link

Lr-2002 commented Nov 7, 2024

I'm now trying to extract the object pointer from the sam2 model and return it.
while I see the num of the object pointer always is 3 while i input 5 points into the sam with multimask_output = True
Have anyone faced this problem before

I've already notice the pointer num was hard coded in the _build_sam_heads, Does it has no realtionship with the decoded mask ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants