Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Onboarding: Successful Anserini Reproduction #2628

Open
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

Sinanmz
Copy link

@Sinanmz Sinanmz commented Nov 9, 2024

Hi,

I’m using a MacBook Pro with an M2 Pro chip, running:

  • Python 3.9.20
  • Maven 3.9.9
  • Java 21.0.5

Everything was successful, but when running mvn clean package, 6 of the tests failed. I was able to proceed by using mvn clean package -DskipTests instead.

@lintool
Copy link
Member

lintool commented Nov 9, 2024

There shouldn't be any test failures... can you share what failed?

@Sinanmz
Copy link
Author

Sinanmz commented Nov 10, 2024

Here are the main issues I encountered:

  1. Output Limit Exceeded: Tests like JsonVectorCollectionDocumentObjectTest and BM25StatTest went over the 8192-byte limit.
  2. ONNX Model Load Failure: BgeBaseEn15EncoderInferenceTest.testMaxLength had a Protobuf parsing error.

Here are the final lines of the output:


[INFO] Results:
[INFO] 
[ERROR] Failures: 
[ERROR]   JsonVectorCollectionDocumentObjectTest The test or suite printed 16688 bytes to stdout and stderr, even though the limit was set to 8192 bytes. Increase the limit with @Limit, ignore it completely with @SuppressSysoutChecks or run with -Dtests.verbose=true
[ERROR]   BM25StatTest The test or suite printed 8722 bytes to stdout and stderr, even though the limit was set to 8192 bytes. Increase the limit with @Limit, ignore it completely with @SuppressSysoutChecks or run with -Dtests.verbose=true
[ERROR]   BigramFeaturesTest The test or suite printed 8836 bytes to stdout and stderr, even though the limit was set to 8192 bytes. Increase the limit with @Limit, ignore it completely with @SuppressSysoutChecks or run with -Dtests.verbose=true
[ERROR]   LmDirTest The test or suite printed 8794 bytes to stdout and stderr, even though the limit was set to 8192 bytes. Increase the limit with @Limit, ignore it completely with @SuppressSysoutChecks or run with -Dtests.verbose=true
[ERROR]   ControllerTest.testSearch:42 expected:<10> but was:<0>
[ERROR]   ExtractTopDfTermsTest The test or suite printed 19156 bytes to stdout and stderr, even though the limit was set to 8192 bytes. Increase the limit with @Limit, ignore it completely with @SuppressSysoutChecks or run with -Dtests.verbose=true
[ERROR] Errors: 
[ERROR]   BgeBaseEn15EncoderInferenceTest.testMaxLength:1108 Runtime ai.onnxruntime.OrtException: Error code - ORT_INVALID_PROTOBUF - message: Load model from /Users/sina/.cache/pyserini/encoders/bge-base-en-v1.5-optimized.onnx failed:Protobuf parsing failed.
[INFO] 
[ERROR] Tests run: 812, Failures: 6, Errors: 1, Skipped: 0
[INFO] 
[INFO] ------------------------------------------------------------------------
[INFO] BUILD FAILURE
[INFO] ------------------------------------------------------------------------
[INFO] Total time:  38:08 min
[INFO] Finished at: 2024-11-10T20:43:21+03:30

@lintool
Copy link
Member

lintool commented Nov 24, 2024

I've tried looking into this and can't reproduce these errors... would you mind digging more deeply and try and figure out what's wrong?

@lintool
Copy link
Member

lintool commented Nov 24, 2024

Also, please fix conflicts?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants