Mathvista PNG images are written with a `.jpg` suffix, causing API failures #483

evanmiller-anthropic · 2024-09-23T18:28:47Z

Running Mathvista against Anthropic APIs, I encounter the failure

│ BadRequestError: Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message':                                          │
│ 'messages.0.content.1.image.source.base64.data: The image was specified using the image/jpeg media type,                                            │
│ but does not appear to be a valid jpeg image'}}

It appears that Mathvista images are a mix of JPEG and PNG files, but they are all saved locally with extension .jpg. Inspect's MIME inference logic then reports them to the Anthropic API as JPEG files, causing 400 Bad Request failures when PNGs are encountered.

The text was updated successfully, but these errors were encountered:

sudhir-b · 2024-09-23T22:46:23Z

I'd love to have a crack at this as a first-time contributor if that's okay! I had a brief look, and it seems like it might be straightforward to save the images locally with the correct file extensions with a small change to evals/mathvista/mathvista.py:

@@ -1,3 +1,4 @@
+import imghdr
 import re
 from pathlib import Path
 
@@ -114,12 +115,19 @@ def mathvista_solver() -> Solver:
 
 def record_to_sample(record: dict) -> Sample:
     # extract image
-    image = Path(record["image"])
+    image_bytes = record["decoded_image"]["bytes"]
+    image_type = imghdr.what(None, h=image_bytes)
+    original_path = Path(record["image"])
+    file_extension = (
+        f".{image_type}" if image_type is not None else original_path.suffix
+    )
+    image = original_path.with_suffix(file_extension)
+
     if not image.exists():
         print(f"Extracting {image}")
         image.parent.mkdir(exist_ok=True)
         with open(image, "wb") as file:
-            file.write(record["decoded_image"]["bytes"])
+            file.write(image_bytes)
 
     message: list[ChatMessage] = [
         ChatMessageUser(

However, imghdr has been marked for deprecation. Two reasonable alternatives seem to be python-magic and filetype. I'd be more than happy to try to submit a PR for this but would appreciate any and all guidance from core contributors.

jjallaire-aisi · 2024-09-24T08:55:26Z

Hi @sudhir-b, yes, it would be great if you took a crack at this! I would suggest the filetype library as it has no external dependencies. One note: when doing this you should add a requirements.txt file to the folder listing filetype. We will soon be turning evals into a package and may pick this up as a package dependency (in either case the requirements.txt will serve as documentation).

evanmiller-anthropic · 2024-09-24T15:00:10Z

For a dependency-free solution, you could check the first 8 bytes for the static PNG header

https://en.wikipedia.org/wiki/PNG#File_header

sudhir-b · 2024-09-24T22:46:25Z

I had assumed that's what the filetype package did but in fact it only checks the first 4 bytes:
https://github.com/h2non/filetype.py/blob/0c7f219ea20a50b636c4a279af8694b0edf8419c/filetype/types/image.py#L135

I'm happy to do either implementation: using filetype or looking at the raw bytes.

jjallaire-aisi · 2024-09-25T06:56:00Z

Let's just look at the raw bytes.

evanmiller-anthropic · 2024-09-26T20:25:31Z

I'm still seeing this error with the merged changes. It appears that line 163 needs to be modified to point to the written PNG files, rather than the original JPEG.

inspect_ai/evals/mathvista/mathvista.py

Line 163 in 78ae701

files={f"image:{record['image']}": record["image"]},

jjallaire-aisi · 2024-09-26T20:38:45Z

Interestingly that line is exactly unneeded (that's for copying files to a docker container): 3f511de

I am still seeing this w/ Sonnet 3.5:

{'type': 'error', 'error': {'type': 'invalid_request_error',                               
'message': 'messages.0.content.1.image.source.base64: image exceeds 5 MB maximum: 6415740 bytes >                                 
 5242880 bytes'}}

So I think we need another image reduction pass here.

jjallaire-aisi · 2024-09-26T20:41:29Z

@evanmiller-anthropic I would defer to you on what you think the right heuristics are for reducing images in this dataset (i.e. we probably can target going well below 5MB but I'm not sure what the optimal target is)

evanmiller-anthropic · 2024-09-26T20:48:55Z

Hmm, I wonder why it still thinks bad JPEGs are being provided – I will need to investigate more.

@jjallaire-aisi I think same heuristic I added to MMMU? 1024 pixels per side? Anthropic endpoints have a pixel limit of 1.15 megapixels

https://github.com/UKGovernmentBEIS/inspect_ai/pull/482/files

jjallaire-aisi · 2024-09-26T20:58:36Z

Okay I added the reduction here: b253aba

I am still seeing this happen periodically though:

Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error',                                   │
│ 'message': 'messages.0.content.1.image.source.base64.data: The image was specified using the                                      │
│ image/jpeg media type, but does not appear to be a valid jpeg image'}}

sudhir-b mentioned this issue Sep 25, 2024

Handle Mathvista PNG images with .jpg extension #524

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mathvista PNG images are written with a `.jpg` suffix, causing API failures #483

Mathvista PNG images are written with a `.jpg` suffix, causing API failures #483

evanmiller-anthropic commented Sep 23, 2024

sudhir-b commented Sep 23, 2024

jjallaire-aisi commented Sep 24, 2024

evanmiller-anthropic commented Sep 24, 2024

sudhir-b commented Sep 24, 2024 •

edited

Loading

jjallaire-aisi commented Sep 25, 2024

evanmiller-anthropic commented Sep 26, 2024

jjallaire-aisi commented Sep 26, 2024

jjallaire-aisi commented Sep 26, 2024

evanmiller-anthropic commented Sep 26, 2024

jjallaire-aisi commented Sep 26, 2024

Mathvista PNG images are written with a .jpg suffix, causing API failures #483

Mathvista PNG images are written with a .jpg suffix, causing API failures #483

Comments

evanmiller-anthropic commented Sep 23, 2024

sudhir-b commented Sep 23, 2024

jjallaire-aisi commented Sep 24, 2024

evanmiller-anthropic commented Sep 24, 2024

sudhir-b commented Sep 24, 2024 • edited Loading

jjallaire-aisi commented Sep 25, 2024

evanmiller-anthropic commented Sep 26, 2024

jjallaire-aisi commented Sep 26, 2024

jjallaire-aisi commented Sep 26, 2024

evanmiller-anthropic commented Sep 26, 2024

jjallaire-aisi commented Sep 26, 2024

Mathvista PNG images are written with a `.jpg` suffix, causing API failures #483

Mathvista PNG images are written with a `.jpg` suffix, causing API failures #483

sudhir-b commented Sep 24, 2024 •

edited

Loading