-
-
Notifications
You must be signed in to change notification settings - Fork 7
Open
Labels
bugSomething isn't workingSomething isn't workingenhancementNew feature or requestNew feature or request
Description
Currently, when confidences are saved for a book, it produces a [book_id].SFM.confidences.tsv file containing confidences for verses and tokens, a [book_id].SFM.confidences.chapters.tsv file for chapter confidences, and a confidences.books.tsv file for book confidences. There are some improvements that can be made to this system.
- Having a single file for both verse level and token level sequences can make it difficult for users to visually parse the sheet if all they need is the verse confidences. Separating the sheets out into separate
...confidences.verses.tsvand...confidences.tokens.tsvsheets would help. - The confidences sheets currently contain extraneous data like headers or table of contents info. The default should be to only include verses in the confidence files, with an option to include the other info if desired.
- When confidences are being generated from .txt files rather than .sfm files, there isn't a vref to use for the confidences.tsv as a label to go along with the verse score. So instead, they're currently labeled using zero-based indexing. However, in editors, usually the first row is row 1, not row 0, so the confidence files should use one-based indexing in this case for easy lookup.
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't workingenhancementNew feature or requestNew feature or request
Type
Projects
Status
🏗 In progress