Word log-likelihood scores #17

DomhnallBoyle · 2024-02-27T15:16:29Z

Hi, I created this pull request to add word log-likelihood scores to the output

I used this for other projects but hopefully it can be useful for someone else too

Thanks

sourcery-ai

PR Type: Enhancement

PR Summary: The pull request introduces an enhancement to the existing functionality by adding log-likelihood scores to the output of word and phone alignments. This change allows for a more detailed analysis of the alignment results by including the log-likelihood scores alongside the start and end times of phones within words.

Decision: Comment

📝 Type: 'Enhancement' - not supported yet.

Sourcery currently only approves 'Typo fix' PRs.

✅ Issue addressed: this change correctly addresses the issue or implements the desired feature.

No details provided.

✅ Small diff: the diff is small enough to approve with confidence.

No details provided.

General suggestions:

Consider adding error handling or validation for the new log-likelihood score extraction to prevent potential runtime errors due to unexpected data formats. This could improve the robustness of the feature and ensure consistent behavior across different datasets.
Given the addition of log-likelihood scores, it might be beneficial to update any related documentation or examples to demonstrate how to use and interpret these new scores. This could help users take full advantage of the new feature.

Thanks for using Sourcery. We offer it for free for open source projects and would be very grateful if you could help us grow. If you like it, would you consider sharing Sourcery on your favourite social media? ✨

Share Sourcery

Help me be more useful! Please click 👍 or 👎 on each comment to tell me if it was helpful.

sourcery-ai · 2024-02-27T15:17:31Z

p2fa/align.py

@@ -203,14 +202,15 @@ def read_aligned_mlf(mlffile, SR, wave_start):

        # Append this phone to the latest word (sub-)list
        ph = lines[j].split()[2]
+        log_likelihood = float(lines[j].split()[3])


suggestion (llm): Extracting the log_likelihood directly from lines[j].split()[3] without checking if the log_likelihood value exists or if the split operation resulted in enough elements could lead to an IndexError if the data format is ever not as expected. Consider adding a check to ensure that the data format is correct before accessing the index.

Word log-likelihood scores

9ad76cc

sourcery-ai bot reviewed Feb 27, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Word log-likelihood scores #17

Word log-likelihood scores #17

DomhnallBoyle commented Feb 27, 2024

sourcery-ai bot left a comment

sourcery-ai bot Feb 27, 2024

Word log-likelihood scores #17

Are you sure you want to change the base?

Word log-likelihood scores #17

Conversation

DomhnallBoyle commented Feb 27, 2024

sourcery-ai bot left a comment

Choose a reason for hiding this comment

sourcery-ai bot Feb 27, 2024

Choose a reason for hiding this comment