-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
maxent LMs #12
Comments
... I think the latest version of SRILM supports them, and they're supposed to be a little better than regular Kneser-Ney LMs. |
FYI, on a news 1.5GB corpus, I get: not that good then |
I don't really understand what you are saying here, can you please format I found the reason for the crash with 4-gram pruning you found before- it's Dan On Thu, Jun 30, 2016 at 12:21 PM, vince62s [email protected] wrote:
|
yeah sorry copy paste from Excel. The corpus is "French news shuffle 2014" about 1.5 GB text file, |
what I am trying to say here is that these results are somehow surprising, because when I ran it on the cantab-tedlium text corpus (entropy filtered) maxent gave better results. |
Another issue for anyone who's watching this project:
it would be nice, as an additional baseline for the paper, to try maxent LMs.
Can someone figure out how to do this on, say, Switchboard or tedlium?
The text was updated successfully, but these errors were encountered: