Issue with DIFY When Orchestrating Workflows Involving Visual VL Models and Standard Text Models #8824

svcvit · 2024-09-27T02:38:11Z

Self Checks

This is only for bug report, if you would like to ask a question, please head to Discussions.
I have searched for existing issues search for existing issues, including closed ones.
I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
[FOR CHINESE USERS] 请务必使用英文提交 Issue，否则会被关闭。谢谢！:）
Please do not modify this template :) and fill in all the required fields.

Dify version

0.8.3

Cloud or Self Hosted

Self Hosted (Docker)

Steps to reproduce

I'm trying to implement a branching logic in the workflow where regular text content is processed by a standard large language model (LLM), and image content is processed by a visual VL model. However, the system parameter sys.files cannot be referenced within the if branch.

To work around this, I added a keyword "image" to sys.query. If this keyword is present, the workflow will use the visual VL model.

The workflow functions correctly during the first round of conversation. However, in the second round, which only involves text, an error occurs: Run failed: 'TextPromptMessageContent' object has no attribute 'text'.

Reference image for the first round of conversation:

Reference image for the second round of conversation:

✔️ Expected Behavior

Enhancement Request: I would like the if condition to support sys.files so that image-related tasks can be processed using different models.
Bug Fix: I'm also seeking a fix for the current bug.

❌ Actual Behavior

No response

dosubot · 2024-09-27T02:39:52Z

I found a similar unsolved discussion that might be relevant to your issue: Error in item: Invalid variable, which was last updated on September 23, 2024 ^[1].

Addressing Your Issues:

Enhancement Request: Support for sys.files in IF Conditions
- Currently, the sys.files parameter is used to store images uploaded by users and requires enabling the image upload function in the 'Features' section of the application orchestration page ^[2]. However, it seems that sys.files cannot be directly referenced within the IF condition. This enhancement would indeed be useful for workflows involving different content types.
Bug Fix: 'TextPromptMessageContent' object has no attribute 'text'
- The error 'TextPromptMessageContent' object has no attribute 'text' occurs because the TextPromptMessageContent class uses the data attribute to store text data, not text. Ensure that you are using the data attribute to access the text content.

Here is the correct way to handle TextPromptMessageContent:

from your_module import TextPromptMessageContent, PromptMessageContentType

# Correct initialization
text_content = TextPromptMessageContent(type=PromptMessageContentType.TEXT, data="Your text here")

# Accessing the text data
print(text_content.data)  # This will print "Your text here"

In your workflow, make sure to use content.data instead of content.text when dealing with TextPromptMessageContent objects ^[3]^[4].

Implementing Branching Logic Based on Content Type:

To implement branching logic based on content type, you can use the IF/ELSE node to check for the presence of images or specific keywords in sys.query. Here is an example of how to set up such a workflow:

IF/ELSE Node: Use this node to check if sys.query contains the keyword "image".
Branch for Image Content: If the condition is met, process the content using the visual VL model.
Branch for Text Content: If the condition is not met, process the content using the standard LLM.

This approach allows you to handle different types of content within the same workflow ^[5]^[6].

By addressing these points, you should be able to enhance your workflow and resolve the current bug.

To continue talking to Dosu, mention @dosu.

crazywoola · 2024-09-27T03:07:08Z

Link #8777, not sure if is resolved.

svcvit · 2024-09-27T04:34:16Z

I tried pulling the latest merged code and switched to developer mode, but the error still persists.

If I first send an image with text, it follows the visual processing branch and responds correctly. However, when I then send a text-only message, it throws this error.

Additionally, there's another issue—sys.files can't be used as a condition in an if statement. Is there any way to optimize this?

free-cx · 2024-09-27T06:05:33Z

This bug is related to enabling memory in the node.

laipz8200 · 2024-09-27T08:53:32Z

We'll provide full file-type support in future versions.

dosubot bot added 🐞 bug Something isn't working 💪 enhancement New feature or request labels Sep 27, 2024

svcvit linked a pull request Sep 27, 2024 that will close this issue

Added sys.files support to the if-else condition, fixed the issue whe… #8837

Open

6 tasks

free-cx linked a pull request Sep 27, 2024 that will close this issue

fix: When Orchestrating Workflows Involving Visual VL Models and Stan… #8838

Open

12 tasks

laipz8200 mentioned this issue Sep 27, 2024

feat/enhance the multi-modal support #8818

Draft

11 tasks

laipz8200 self-assigned this Sep 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue with DIFY When Orchestrating Workflows Involving Visual VL Models and Standard Text Models #8824

Issue with DIFY When Orchestrating Workflows Involving Visual VL Models and Standard Text Models #8824

svcvit commented Sep 27, 2024

dosubot bot commented Sep 27, 2024

crazywoola commented Sep 27, 2024

svcvit commented Sep 27, 2024

free-cx commented Sep 27, 2024 •

edited

Loading

laipz8200 commented Sep 27, 2024

Issue with DIFY When Orchestrating Workflows Involving Visual VL Models and Standard Text Models #8824

Issue with DIFY When Orchestrating Workflows Involving Visual VL Models and Standard Text Models #8824

Comments

svcvit commented Sep 27, 2024

Self Checks

Dify version

Cloud or Self Hosted

Steps to reproduce

✔️ Expected Behavior

❌ Actual Behavior

dosubot bot commented Sep 27, 2024

Addressing Your Issues:

Implementing Branching Logic Based on Content Type:

crazywoola commented Sep 27, 2024

svcvit commented Sep 27, 2024

free-cx commented Sep 27, 2024 • edited Loading

laipz8200 commented Sep 27, 2024

free-cx commented Sep 27, 2024 •

edited

Loading