Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] update document with _bulk fails to generate embeddings in inference processors #17494

Open
will-hwang opened this issue Mar 1, 2025 · 0 comments
Labels
bug Something isn't working _No response_ untriaged

Comments

@will-hwang
Copy link

will-hwang commented Mar 1, 2025

Describe the bug

Bug Description:

embeddings are not generated when documents are updated with /index/_bulk operation, when successfully generated with insert /index/_doc/ operation. Is there any change made in Opensearch that skips embedding generation for bulk operation only?

How to reproduce the error:

Can you please assist with this issue?

Related component

No response

To Reproduce

  1. deploy model

POST /_plugins/_ml/models/{model_id}/_deploy

  1. configure pipeline
"processors": [
    {
      "text_embedding": {
        "model_id": {model_id},
        "field_map": {
          "text": "passage_embedding"
        }
      }
    }
  1. ingest doc
PUT /my-nlp-index/_doc/1
{
  "text": "hello world"
}
  1. update doc with _bulk
PUT /my-nlp-index/_bulk
{ "update": { "_index": "my-nlp-index", "_id": "1" } }
{ "doc" : { "text": "bye world" } }

Expected behavior

Embeddings are created for the initial ingest for "text":"hello world", but not updated with bulk operation
{ "doc" : { "text": "bye world" } }

Embeddings should be re-generated for bulk operation by calling text_embedding_processor

Additional Details

Plugins
Please list all plugins currently enabled.

Screenshots
If applicable, add screenshots to help explain your problem.

Host/Environment (please complete the following information):

  • OS: [e.g. iOS] MAC OS
  • Version [e.g. 22] Sequoia 15.3

Additional context
Add any other context about the problem here.

@will-hwang will-hwang added bug Something isn't working untriaged labels Mar 1, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working _No response_ untriaged
Projects
None yet
Development

No branches or pull requests

1 participant