generateContent throws 500 Internal Server Error when attempting to ground with VertexAISearch #378

jwilson-tower · 2024-07-12T14:30:01Z

All requests using the RetrievalTool fail with the following error...

[VertexAI.GoogleGenerativeAIError]: got status: 500 Internal Server Error. {"error":{"code":500,"message":"Internal error encountered.","status":"INTERNAL"}}"

The same requests, or rather requests with the same parameters, work from the console.

Requests using the GoogleSearchRetrievalTool work in the same code but I am specifically looking to ground my responses against my own data set in Vertex AI Search

jwilson-tower · 2024-07-12T16:02:33Z

const vertexAI = new VertexAI({
	project: 'xxx',
	location: 'xxx',
	googleAuthOptions: {
		keyFile: 'xxx'
	}
});
const.generativeModel = this.vertexAI.preview.getGenerativeModel({
	model: 'gemini-1.5-flash-001',
	generationConfig: {
		"max_output_tokens": 8192,
		"temperature": 0,
		"top_p": 1,
	},
	tools: [
		{
			retrieval: {
				vertexAiSearch: {
					datastore: 'projects/xxx/locations/global/collections/default_collection/dataStores/xxx',
				},
				disableAttribution: false,
			},
		}
	]
});
const result = await generativeModel.generateContent({
    contents: [{
        role: 'user',
        parts: [{
            text: 'xxx'
        }]
    }]
});

keepforever · 2024-07-24T19:08:00Z

This is happening to me too, using the node sdk

GoogleGenerativeAIError: [VertexAI.GoogleGenerativeAIError]: got status: 500 Internal Server Error. {"error":{"code":500,"message":"Internal error encountered.","status":"INTERNAL"}}
    at throwErrorIfNotOK (/Users/xxx/Desktop/projects/xxx/node_modules/@google-cloud/vertexai/build/src/functions/post_fetch_processing.js:34:15)
    at process.processTicksAndRejections (node:internal/process/task_queues:95:5)
    at async generateContent (/Users/xxx/Desktop/projects/xxx/node_modules/@google-cloud/vertexai/build/src/functions/generate_content.js:51:5)

import { invariantResponse } from '@epic-web/invariant'
import { Part } from '@google-cloud/vertexai'
import {
  ActionFunctionArgs,
  json,
  unstable_createMemoryUploadHandler,
  unstable_parseMultipartFormData,
} from '@remix-run/node'
import { MAX_BRAND_LOGO_UPLOAD_SIZE } from '~/constants/constants'
import { deleteFile, uploadFile } from '~/utils/cloud-storage.server'
import { generativeModelForPdf, parseAndConcatText } from '~/utils/vertex.server'

export async function action(args: ActionFunctionArgs) {
  const { request } = args

  const formData = await unstable_parseMultipartFormData(
    request,
    unstable_createMemoryUploadHandler({ maxPartSize: MAX_BRAND_LOGO_UPLOAD_SIZE }),
  )

  const guidelinesFile = formData.get('pdf') as File
  invariantResponse(guidelinesFile, 'Guidelines file is required')

  const brandId = formData.get('brandId') as string
  invariantResponse(brandId, 'Brand ID is required')

  const { uri, fileName } = await uploadFile(parseInt(brandId), guidelinesFile)
  console.log('\n', `uri = `, uri, '\n')

  const filePart: Part = {
    fileData: {
      fileUri: uri,
      mimeType: 'application/pdf',
    },
  }

  const textPart = {
    text: `
    You are a very professional document summarization specialist.
    Please summarize the given document.`,
  }

  const vertexRequest = {
    contents: [{ role: 'user', parts: [filePart, textPart] }],
  }

  let resp
  try {
    resp = await generativeModelForPdf.generateContent(vertexRequest)
  } catch (error) {
    console.log('\n', `error = `, error, '\n')
    throw new Error('Failed to generate content')
  }

  const contentResponse = resp.response
  console.log('Generated content response: ', JSON.stringify(contentResponse))

  const concatenatedText = parseAndConcatText(contentResponse)

  console.log('\n', `concatenatedText = `, concatenatedText, '\n')

  // delete file after analysis
  try {
    const resp = await deleteFile(fileName)
    console.log('\n', `resp = `, resp, '\n')
  } catch (error) {
    console.log('\n', `failed to delete file ${fileName} `, '\n')
  }

  return json({ resp: concatenatedText })
}

export const generativeModelForPdf = vertexAI.getGenerativeModel({
  model: 'gemini-1.5-flash-001',
  generationConfig: { maxOutputTokens: 256 },
})

neoromantic · 2024-07-29T17:57:35Z

Have same problem with very minimal setup.

I'm using demo project from Vercel and trying to use it with Gemini.

As soon as I enable tools (with @ai/tools from vercel) every request fails with 500.

import { createVertex } from "@ai-sdk/google-vertex"

import { convertToCoreMessages, streamText, tool } from "ai"

const vertex = createVertex({
  project: process.env.GOOGLE_VERTEX_PROJECT,
  location: "us-east1",
})

export const maxDuration = 30

export async function POST(req: Request) {
  const { messages } = await req.json()

  const result = await streamText({
    model: vertex("gemini-1.5-pro"),
    messages: convertToCoreMessages(messages),
    system: `You are a helpful assistant.`,
    // tools: {
    // getInformation: tool({
    //   description: `get information from your knowledge base to answer questions.`,
    //   parameters: z.object({
    //     question: z.string().describe("the users question"),
    //   }),
    //   execute: async ({ question }) => findRelevantContent(question),
    // }),
  })

  return result.toAIStreamResponse()
}

Works with tools commented out, fails if enable tools.

happy-qiao · 2024-08-29T23:09:00Z

Cannot reproduce 500 error now. I use the following script and it returns 200

import {
    VertexAI
  } from '@google-cloud/vertexai';

(async function() {
    const project = 'vertexsdk';
    const datastoreId = 'hello_1724863936046';
    const vertexAI = new VertexAI({
        project: project,
        location: 'us-central1'
    });
    const generativeModel = vertexAI.preview.getGenerativeModel({
        model: 'gemini-1.5-flash-001',
        generationConfig: {
            maxOutputTokens: 8192,
            temperature: 0,
            topP: 1,
        },
        tools: [
            {
                retrieval: {
                    vertexAiSearch: {
                        datastore: `projects/${project}/locations/global/collections/default_collection/dataStores/${datastoreId}`,
                    },
                    disableAttribution: false,
                },
            }
        ]
    });
    const result = await generativeModel.generateContent({
        contents: [{
            role: 'user',
            parts: [{
                text: 'what\'s the weather today?'
            }]
        }]
    });
    console.log(JSON.stringify(result, null, 2));
}
)();

SebSchroen · 2024-09-03T11:38:32Z

Even though the error message didn't point in that direction, it essentially was a permission issue for me. Giving the Service Account who is calling the model the role "Discovery Agent Viewer" fixed the problems. You also get the internal error if the location of the project and the datastore don't match (e.g. europe-west1 + global). @jwilson-tower

jwilson-tower · 2024-09-03T17:23:42Z

Thank you @SebSchroen! I added the "Discovery Engine Viewer" to the service account and it fixed the issue.

jwilson-tower added priority: p2 Moderately-important priority. Fix may not be included in next release. type: bug Error or flaw in code with unintended results or allowing sub-optimal usage patterns. labels Jul 12, 2024

product-auto-label bot added the api: aiplatform Issues related to the googleapis/nodejs-vertexai API. label Jul 12, 2024

jwilson-tower changed the title ~~Generate~~ generateContent throws 500 Internal Server Error when attempting to ground with VertexAISearch Jul 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

generateContent throws 500 Internal Server Error when attempting to ground with VertexAISearch #378

generateContent throws 500 Internal Server Error when attempting to ground with VertexAISearch #378

jwilson-tower commented Jul 12, 2024 •

edited

Loading

jwilson-tower commented Jul 12, 2024

keepforever commented Jul 24, 2024 •

edited

Loading

neoromantic commented Jul 29, 2024

happy-qiao commented Aug 29, 2024

SebSchroen commented Sep 3, 2024

jwilson-tower commented Sep 3, 2024

generateContent throws 500 Internal Server Error when attempting to ground with VertexAISearch #378

generateContent throws 500 Internal Server Error when attempting to ground with VertexAISearch #378

Comments

jwilson-tower commented Jul 12, 2024 • edited Loading

jwilson-tower commented Jul 12, 2024

keepforever commented Jul 24, 2024 • edited Loading

neoromantic commented Jul 29, 2024

happy-qiao commented Aug 29, 2024

SebSchroen commented Sep 3, 2024

jwilson-tower commented Sep 3, 2024

jwilson-tower commented Jul 12, 2024 •

edited

Loading

keepforever commented Jul 24, 2024 •

edited

Loading