You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
With a token that extends across physical lines, splitting it at the line break works to create separate tokens, but doesn't recalculate the second line's token's location.
running the service for refreshEditionWordLocations.php?db=yourDBName&ednID=### is teh current work around.
The text was updated successfully, but these errors were encountered:
We need to have a discussion about the Tokenization containment versus
Physical Lines. It became clear that tokens wrap across one or more
physical lines, at which point the code could no longer maintain alignment
with physical lines. We cannot assume that the token "TextDivision"
sequence containers are labeled the same as physical lines and cannot
assume that they align. The Physical line number or range is calculated
from the first and last grapheme of the token and following these thru
their syllablecluster to the physical line. The line number is accurate,
while the line token position is approximate.
With a token that extends across physical lines, splitting it at the line break works to create separate tokens, but doesn't recalculate the second line's token's location.
running the service for refreshEditionWordLocations.php?db=yourDBName&ednID=### is teh current work around.
The text was updated successfully, but these errors were encountered: