Paligemma Workflows Block #399

hansent · 2024-05-15T19:35:43Z

Description

This adds a workflow block to use the new Paligemma model

Type of change

New feature (non-breaking change which adds functionality)

How has this change been tested, please provide a testcase or example of how you tested the change?

Locally using a notebook

yeldarby

Nice work getting it working! 🔥

Looks like the tests aren't passing; not sure if that's also the case on main or not (didn't look into it yet).

inference/core/workflows/core_steps/loader.py

yeldarby · 2024-05-16T04:32:10Z

inference/core/workflows/core_steps/models/foundation/paligemma.py

+            "block_type": "model",
+        }
+    )
+    type: Literal["Paligemma"]


I think this should be PaliGemmaModel (all model blocks should end in Model IMO)

yeldarby · 2024-05-16T04:33:35Z

inference/core/workflows/core_steps/models/foundation/paligemma.py

+            )
+
+            response = await self._model_manager.infer_from_request(
+                paligemma_model_id, inference_request


Does this require a server to be running?

This looks more like what I'd expect: https://colab.research.google.com/drive/1_q09OjR2Ldl1FZnvfqwckvxrW_FIYclC?usp=sharing

No, I don't think requires server, this is how we load all the models in workflow blocks

yeldarby · 2024-05-16T04:34:46Z

inference/core/workflows/core_steps/models/foundation/paligemma.py

+    def get_manifest(cls) -> Type[WorkflowBlockManifest]:
+        return BlockManifest
+
+    async def run_locally(


We may want this one to be setup both for local & async execution since it can't run without an NVIDIA GPU. You may want to be doing realtime video for your workflow (eg on a Jetson) but occasionally call out to a beefy server somewhere for a LLM response.

PawelPeczek-Roboflow · 2024-05-16T08:39:51Z

just FYI - unit tests are failing as we are dragging torch import by importing new block - we shall probably act as we do with importing of our models - letting them to be silently not loaded given imports fail

yeldarby · 2024-05-16T12:02:37Z

Should we add torch as a dep for the tests?

PawelPeczek-Roboflow · 2024-05-16T12:29:36Z

@yeldarby - we could, but the same would apply for pepole installing inference, as this breaks the extras rule that we have (special model = extras requirement) and silent imports error handling (for instance if u dont have SAM requirements, u will not be able to use sam which will fail on model unknown error on usage attempt, rather than on import error while importing anything from inference).
For me failure in unit tests now is extremely viable signal - making it clear that PR breaks the rule and we need to fix it

hansent · 2024-05-16T14:19:40Z

to fix the test maybe we just have to remove the import here, since actually we are using model manager to load the model? https://github.com/roboflow/inference/pull/399/files#diff-f8d393a3b2cc29b1852b9c9bcedd10a1f4682607a3fb57bb30dcc110954fb0abR26

PawelPeczek-Roboflow · 2024-05-16T14:20:30Z

inference/core/workflows/core_steps/models/foundation/paligemma.py

+    WorkflowBlock,
+    WorkflowBlockManifest,
+)
+from inference.models.paligemma.paligemma import PaliGemma


hansent · 2024-05-22T12:58:58Z

I think before we merge this we should figure out wither:

a way to remote execute paligemma for hosted workflow execution
a way to display in UI that this block wont work unless you run inference locally

PawelPeczek-Roboflow

Changed a lot in EE, cannot be merged now propably

yeldarby · 2024-06-07T08:55:44Z

a way to display in UI that this block wont work unless you run inference locally

We'll certainly need this ability for other blocks (even if we eventually host Paligemma). @EmilyGavrilenko this seems aligned with your json_schema_extra metadata changes to enable dynamic categories, plan gating, keywords, etc.

@hansent are there any other blocks which cannot run on Hosted API currently?

hansent · 2024-06-07T08:59:44Z

@hansent are there any other blocks which cannot run on Hosted API currently?

Not in this repo yet I think. The enterprise blocks repo that we are wanting to add blocks from however contains some that can't because they are stateful, like e.g. ByteTrackerBlock.

EmilyGavrilenko · 2024-06-07T09:10:12Z

Don't we currently support CogVLM in the UI via the LMM block?

hansent · 2024-06-07T09:18:41Z

@EmilyGavrilenko your right, the LLM block will error on hosted execution via this code path: https://github.com/roboflow/inference/blob/main/inference/core/workflows/core_steps/models/foundation/lmm.py#L373

Seems like a third class in terms of the LLM block. It can run on hosted and local, but depending on configuration can only run local / wont know until runtime (unless the manifest would somehow expose that certain config values make it non executable on hosted). Another approach would be be to split it up into separate blocks one for GPT4, one for CogVLM?

Not sure its worth handling / worrying about now? CogVLM hasn't really been something I've heard requested or come up in examples we want to enable for users very much.

hansent · 2024-07-11T14:19:32Z

stale

hansent added 2 commits May 15, 2024 14:30

add paligemma block

a61d71b

format

52a9dd8

yeldarby self-requested a review May 16, 2024 03:26

yeldarby reviewed May 16, 2024

View reviewed changes

PawelPeczek-Roboflow reviewed May 16, 2024

View reviewed changes

hansent added 4 commits May 16, 2024 10:26

remove unused imports

887ab54

rename Paligemma -> PaliGemma

deafae0

rename PaliGemma block to PaliGemmaModel

08fd384

format / import order

b68e587

hansent requested a review from yeldarby May 17, 2024 12:52

hansent marked this pull request as ready for review May 17, 2024 12:52

PawelPeczek-Roboflow approved these changes May 20, 2024

View reviewed changes

PawelPeczek-Roboflow self-requested a review June 7, 2024 08:34

PawelPeczek-Roboflow requested changes Jun 7, 2024

View reviewed changes

hansent closed this Jul 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Paligemma Workflows Block #399

Paligemma Workflows Block #399

hansent commented May 15, 2024

yeldarby left a comment

yeldarby May 16, 2024

yeldarby May 16, 2024

yeldarby May 16, 2024

hansent May 16, 2024

yeldarby May 16, 2024

PawelPeczek-Roboflow commented May 16, 2024

yeldarby commented May 16, 2024

PawelPeczek-Roboflow commented May 16, 2024

hansent commented May 16, 2024

PawelPeczek-Roboflow May 16, 2024

hansent commented May 22, 2024

PawelPeczek-Roboflow left a comment

yeldarby commented Jun 7, 2024

hansent commented Jun 7, 2024 •

edited

Loading

EmilyGavrilenko commented Jun 7, 2024

hansent commented Jun 7, 2024 •

edited

Loading

hansent commented Jul 11, 2024

Paligemma Workflows Block #399

Paligemma Workflows Block #399

Conversation

hansent commented May 15, 2024

Description

Type of change

How has this change been tested, please provide a testcase or example of how you tested the change?

yeldarby left a comment

Choose a reason for hiding this comment

yeldarby May 16, 2024

Choose a reason for hiding this comment

yeldarby May 16, 2024

Choose a reason for hiding this comment

yeldarby May 16, 2024

Choose a reason for hiding this comment

hansent May 16, 2024

Choose a reason for hiding this comment

yeldarby May 16, 2024

Choose a reason for hiding this comment

PawelPeczek-Roboflow commented May 16, 2024

yeldarby commented May 16, 2024

PawelPeczek-Roboflow commented May 16, 2024

hansent commented May 16, 2024

PawelPeczek-Roboflow May 16, 2024

Choose a reason for hiding this comment

hansent commented May 22, 2024

PawelPeczek-Roboflow left a comment

Choose a reason for hiding this comment

yeldarby commented Jun 7, 2024

hansent commented Jun 7, 2024 • edited Loading

EmilyGavrilenko commented Jun 7, 2024

hansent commented Jun 7, 2024 • edited Loading

hansent commented Jul 11, 2024

hansent commented Jun 7, 2024 •

edited

Loading

hansent commented Jun 7, 2024 •

edited

Loading