Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Applying xmoverscore to novel dataset #5

Open
billray0259 opened this issue Sep 23, 2022 · 1 comment
Open

Applying xmoverscore to novel dataset #5

billray0259 opened this issue Sep 23, 2022 · 1 comment

Comments

@billray0259
Copy link

Hello, I'm attempting to apply the xmoverscore metric to a novel dataset.

I ran main.py, and it generated the following files, which I organized into results directories.
image

Are the person correlation coefficients reported in these files? I seem to be unable to find them.
Also, are the sample-level xmoverscore values reported in the HUMAN columns of DA-seglevel.csv file?
image

I calculated sample-level xmoverscores on the novel dataset I'm working with and the scores seem to be roughly in the range [-0.1, -0.2]
image

Does that seem like a valid range for the scores, or is it likely there is an error in the way I am calculating the scores?
Thank you.

@andyweizhao
Copy link
Collaborator

andyweizhao commented Sep 24, 2022

Hello @billray0259

Are the Pearson correlation coefficients reported in these files?

No, these would be reported in console.

are the sample-level xmoverscore values reported in the HUMAN columns of DA-seglevel.csv file?

No, the HUMAN column shows human judgments of translation quality normalized by z-score, done by WMT workshops.

does that seem like a valid range for the scores?

The xmoverscore metric produces scores less than (or equal to) 1. Below are the details:

xmoverscore = 1 - EMD (earth mover distance). EMD produces positive scores or zeros. I have normalized the metric scores into the interval of [0, 1], with 1 as a perfect score. See 2026e18

Hope these can help!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants