Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Impossible to add a fine-tuned custom model #107

Open
mathieuromain08 opened this issue Nov 26, 2024 · 2 comments
Open

Impossible to add a fine-tuned custom model #107

mathieuromain08 opened this issue Nov 26, 2024 · 2 comments

Comments

@mathieuromain08
Copy link

Hello,

It would be great to be able to choose your own model, there are fine tune models for French, or other languages, for example that are much better than the basic Whisper ones. Perhaps to keep the “out of the box” aspect, three choices of models, fast, precise and custom?

Anyway, thanks for your work, absolutely incredible quality, I've been looking for this for a long time for my radio and interview work.

@kaixxx
Copy link
Owner

kaixxx commented Nov 26, 2024

I totally agree. We plan to implement a mechanism to add extra models. Which one would you recommend for french in particular? It would be nice to have a curated list of good working models.

Currently, you can try to add your custom model manually:

  • It must be in the 'faster-whisper' format: https://github.com/SYSTRAN/faster-whisper Only fine-tuned models based on whisper-large v2 or smaller models are supported right now (at least on Windows; on Mac, newer models might work too).
  • In the installation-folder of noScribe, you will find the sub folder "models". You can replace the contents of either "faster-whisper-small" or "faster-whisper-large-v2" with your custom fine-tuned model. However, make sure that all the file and folder names are exactly the same.

@Lod3
Copy link

Lod3 commented Nov 26, 2024

It would be nice to have a curated list of good working models.

This might be a good excuse to setup a wiki part on this repo to have a matrix of good performing models per language.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants