Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support Unified Multimodal Model #33368

Open
KevinZeng08 opened this issue Sep 7, 2024 · 1 comment
Open

Support Unified Multimodal Model #33368

KevinZeng08 opened this issue Sep 7, 2024 · 1 comment
Labels

Comments

@KevinZeng08
Copy link

Feature request

Hi, I am wondering that can this repository supports the unified multimodal model like Show-o? https://github.com/showlab/Show-o

Motivation

The unified multimodal model may be a trend with multimodality

Your contribution

trying for integration

@KevinZeng08 KevinZeng08 added the Feature request Request for a new feature label Sep 7, 2024
@zucchini-nlp
Copy link
Member

Hey @KevinZeng08 !

Indeed multimodal models are exciting to have! We currently have a PR to integrate Chameleon (#32013) with interleaved text-image generation capabilities. Although we don't have work on adding Show-o, there is an issue for people interested in such models (#32926). Feel free to comment under the issue and open a PR for Show-o, if you want to give it a try

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants