-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Suggestions #5
Comments
Hi @ahhyeah, thanks for the feedback! New version is in progress and it will include at least importing and exporting audio. The realtime feature is still an open question, will do my best. I am also working on a new UI. I am trying to decide what information/buttons to put in each card on "main screen", and what to put into new "recording details screen". |
I like the idea of just showing a few lines of text as a preview of a recording. That would be very helpful. I think copy and share are nice to have on the main screen. And ability to edit title. I assume the refresh icon runs the transcribe again? Based on current model selected? That could probably be in the detail screen. I like how you have a lot of it now. It's simple, clean. Another thought is to add a tiny description to each model. I'm pretty techie and follow AI news and I don't know the different between all of the models. I just chose the larger one assuming it's more accurate. Thoughts? If you're looking for more ideas, I will keep sharing. Not sure if you're long term goal is to charge or not but some features could be an "in app purchase" for a pro version. So, another feature could be that when you play back audio, the corresponding word would be highlighted, or stand out somehow (color change?) as it's being played back. Sorry, I'm not a coder, so that seems really difficult. Just throwing out ideas! Another: Ability to record when the app is closed. Then, interacting with the dynamic island like the voice memo app does. Another : Using ChatGPT type AI to summarize the voice recording and create a title along with using the current location. |
Hey, thanks for the feedback and suggestions! I will definitely think about allowing title edits on main screen. And yes, the refresh icon re-runs the transcription with current model. Adding descriptions for Whisper models is a good idea, and I appreciate the feature suggestions you mentioned. Some might be challenging to implement, but I will consider them for future updates. Feel free to keep sharing ideas, they are super helpful. Thanks again for your support! |
You could add a widget for the Lock Screen to easily access the app |
A new version is now available on the App Store which has a lot of improvements including importing and sharing audio files. |
@Saik0s Just tried it! Works great, but miss the option to select the app from the Share sheet. From Voice memos you now have to save it in files and then open the file in WhisperBoard. |
@mhauken maybe you have to add the app Whisperboard to the share sheet. Did you try whisperboard with a long audio? I would like to know how it behaves with long audios, cuz I can't test it at the moment. |
No. I can't find it in the share sheet (or when you tap edit there). It seems to work perfectly for long audio as well. I tried adding a 40min audio and it seems to work flawlessly.🙌 |
Did you test with the large model? How much time did it take? |
I tried an 8 minute file by exporting from Voice Memos app to Files and then Files to Whisperboard and it got stuck on "transcribing" I have the larger language model. I let it sit for 20-30 minutes and there was no progress. I can try again. **update: I opened the app back up and the transcription was there... Weird 🤷🏻♂️ |
There is indeed an issue with properly displaying the current state of transcription. Going to fix it asap. |
I utilize GPT-4 to refine my transcripts, which significantly enhances them. Ideally, one would receive real-time transcription, correction, and translation |
Hi, I Have used whisper in python on my Mac previously, and it is really great. |
Do you use a specific prompt on chatgpt? |
working on highlighting the changes made by GPT, but ATM some are ommited |
Hi @Werner602, I released a new update, can you try it to see if the problem is gone? |
What about srt/subtitles output format, at least rought timestamps at the level of chunks |
Great app, I would pay for this! A couple of ideas: 1. Allow for import of existing voice recordings from the native iPhone app. 2. Ability to see the transcribing and ability to copy and paste from it while it's still recording. 3. Allow for saving and importing files from both the native files app and dropbox. Thanks for putting this together!
The text was updated successfully, but these errors were encountered: