Feature/eval spans #132

laurejt · 2025-01-09T19:06:30Z

Here is the current code for evaluating span annotations.

The script requires two key files to run, page-level JSONL files that contain reference and system span annotations.

Usage: python evaluate_poetry_spans ref_jsonl sys_jsonl out.csv

Note that I'm considering adding reporting for the following:

number of exact matches
number of partial matches
number of missed reference spans
number of spurious system spans

codecov · 2025-01-09T21:11:14Z

Codecov Report

Attention: Patch coverage is 96.58247% with 23 lines in your changes missing coverage. Please review.

Project coverage is 80.81%. Comparing base (431b528) to head (a0bce13).

Additional details and impacted files

@@             Coverage Diff             @@
##           develop     #132      +/-   ##
===========================================
+ Coverage    75.12%   80.81%   +5.69%     
===========================================
  Files           21       23       +2     
  Lines         1865     2538     +673     
===========================================
+ Hits          1401     2051     +650     
- Misses         464      487      +23

laurejt · 2025-01-10T17:11:16Z

Codecov Report

Attention: Patch coverage is 99.45652% with 2 lines in your changes missing coverage. Please review.

Project coverage is 99.64%. Comparing base (cb03ab8) to head (718aa4b).
Report is 2 commits behind head on develop.

Additional details and impacted files

Looks like Codecov is only checking the test file and not the actual code file.

Now also report counts of matches, misses, and spurious.

laurejt added 4 commits December 18, 2024 15:50

Initial eval code & unit tests

8e0e642

Added outstanding unit tests for existing methods

1c460eb

Added eval I/O functionality and unit tests.

fd4bdf5

Added CL support and progress bar

718aa4b

laurejt requested a review from rlskoeser January 9, 2025 19:06

jerielizabeth mentioned this pull request Jan 13, 2025

Wrap up evaluation code #131

Open

2 tasks

laurejt added 3 commits January 21, 2025 12:49

Expanded evaluation reporting

979de2c

Now also report counts of matches, misses, and spurious.

Added poetry-level counts to evavluation reporting

f59ce66

Merge branch 'develop' into feature/eval-spans

a0bce13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/eval spans #132

Feature/eval spans #132

laurejt commented Jan 9, 2025 •

edited

Loading

codecov bot commented Jan 9, 2025 •

edited

Loading

laurejt commented Jan 10, 2025

Codecov Report

Feature/eval spans #132

Are you sure you want to change the base?

Feature/eval spans #132

Conversation

laurejt commented Jan 9, 2025 • edited Loading

codecov bot commented Jan 9, 2025 • edited Loading

Codecov Report

laurejt commented Jan 10, 2025

Codecov Report

laurejt commented Jan 9, 2025 •

edited

Loading

codecov bot commented Jan 9, 2025 •

edited

Loading