`AutoModel` class for `image-text-to-text` models #32042

merveenoyan · 2024-07-18T08:44:29Z

Feature request

It would be nice to get a standard AutoModel class for image-text-to-text models (since @molbap is standardizing the processor)

Motivation

@NielsRogge noticed that in model repositories the automatic snippets fallback to AutoModelForPreTraining because these models don't exist in PIPELINE_TAGS_AND_AUTO_MODELS (due to lack of AutoClass) More importantly it would be nice to load it to a single class.

Your contribution

I haven't checked what it takes to implement an AutoClass when model classes exist in different names for the same task but if decided I don't mind looking into it and taking a stab.

The text was updated successfully, but these errors were encountered:

NielsRogge · 2024-07-18T08:55:36Z

First attempt was at #29572, but is awaiting standardization of processors which is tracked at #31911

amyeroberts · 2024-07-18T10:03:47Z

cc @zucchini-nlp re VLMs

yonigozlan · 2024-07-18T18:02:05Z

Another blocker was that some models need custom processing code to be moved into their processors. I started #32059 and will get to work on checking which models need additional processing to standardize the inputs and outputs :).

merveenoyan added the Feature request Request for a new feature label Jul 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`AutoModel` class for `image-text-to-text` models #32042

`AutoModel` class for `image-text-to-text` models #32042

merveenoyan commented Jul 18, 2024 •

edited

Loading

NielsRogge commented Jul 18, 2024

amyeroberts commented Jul 18, 2024

yonigozlan commented Jul 18, 2024

AutoModel class for image-text-to-text models #32042

AutoModel class for image-text-to-text models #32042

Comments

merveenoyan commented Jul 18, 2024 • edited Loading

Feature request

Motivation

Your contribution

NielsRogge commented Jul 18, 2024

amyeroberts commented Jul 18, 2024

yonigozlan commented Jul 18, 2024

`AutoModel` class for `image-text-to-text` models #32042

`AutoModel` class for `image-text-to-text` models #32042

merveenoyan commented Jul 18, 2024 •

edited

Loading