Skip to content

StoryToolkitAI v0.17.5 - ultra alpha

Compare
Choose a tag to compare
@octimot octimot released this 19 Nov 06:58
· 397 commits to main since this release

Download the standalone release here:

StoryToolkitAI 0.17.5 for Mac M1

StoryToolkitAI 0.17.5 for Mac Intel

StoryToolkitAI 0.17.5 for Windows (only for CUDA GPUs)

Please report if something doesn't work as expected.

Info

The standalone version is way behind the git version of StoryToolkitAI. This is because compiling the app into Windows and Mac executables isn't really a walk in the park. We're working on getting the new standalone version up to date within a few days.

However, if you find the tool useful, please consider supporting the development, and you may enjoy some perks while doing so. If not, no worries, the tool will continue to be open and free for as long as we can support it!

What's new in this release

  • Transcript Groups allows you to group transcript segments and recall them later
    (CMD/CTRL+G to add segment to group, then use SHIFT+G in the transcription window to open up the transcript group window.)
  • Copy transcript lines with timecode using CMD/CTRL+SHIFT+C (then simply paste them wherever you want)
  • New transcript editing shortcuts: CMD/CTRL+A selects all transcript segments, BACKSPACE deletes active segment
  • Improved Resolve API communication and logging
  • When transcribing Resolve timelines, the timeline name, frame rate and start timecode is saved to the transcription.json file
  • Other minor fixes, including transcript scrollbar, more detailed logging etc.

Installation:

If you never installed the tool on your machine, please read this. Otherwise, simply replacing the old app will work.

IMPORTANT

The following steps describe how you should install ffmpeg on your machine to benefit from the full functionality of StoryToolkitAI. In reality, you do not need to have ffmpeg installed unless you're transcribing files that don't use the Linear PCM codec for their audio. As a matter of fact, other formats work without ffmpeg aswel, but we just haven't tested it widely enough to know for sure which ones.

So, before torturing yourself with the following steps (unless you like the terminal or light pain in general), try to create the transcription_WAV render preset in Resolve following this info, or simply use WAV audio files or any other format that uses Linear PCM for audio when transcribing, and if it's good enough for you just ignore the following steps (except the permissions part on MacOS, if you get warnings when you start the app). Also, make sure that you're using mono audio to prevent the algorithm from ignoring some of the channels.

Obviously if you're using the tool for other functions apart from transcriptions, you absolutely do not need ffmpeg on your machine.

If this works for enough people we'll take out the FFMPEG warning at the start of the app and just provide these steps for advanced use cases. What do you think?

Mac OS

You need to install ffmpeg manually from the terminal and the easiest way to do that is via Homebrew:

1. Install Homebrew

Open up a Terminal window, then run:
/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"

If Homebrew installation fails, first make sure you have xcode installed:
xcode-select --install

Note: if Homebrew installation fails for any reason, a good starting point is this page.

2. Install ffmpeg:

In the same Terminal window:
brew install ffmpeg

3. Open the release file

First, download the StoryToolkitAI zip file that fits with your platform, if you haven't done so already.
Unzip, copy the app file into your Applications folder, then double click and it should start.

4. Give the app all the necessary permissions to run

Mac OS might warn you of malicious software or it might ask you to allow the app to access files on your Mac. If the app won't start, first try to right click on the app and then click Open, and probably Open again on the dialog you'll see. If it still doesn't work, try this: Safely open apps on your Mac, and maybe this: control access to files and folders on Mac

If you get something like ""StoryToolkitAI.X.X.X" is damaged and can't be opened" and you downloaded the release from this page, open Terminal in the folder with the app and try to take the app out of Mac quarantine with this command before starting the app, for eg.:
xattr -d com.apple.quarantine ./StoryToolkitAI.0.17.5.M1.app. More details on the issue here.

Note: The standalone version does not open up the console. If you're interested to see what is happening behind the hood, open a terminal in the folder where you have the tool (typically in your Downloads or Applications folder) and run this command:

./[STORYTOOLKITAI_APP_NAME].app/Contents/MacOS/[STORYTOOLKIT_APP_NAME]  

# (but replace [STORYTOOLKIT_APP_NAME] with the name of your release)

Windows

1. Download and install ffmpeg

The simplest approach is to use a package manager like Choco. Once you follow the installation steps for "Individual Use" on that page, you can install ffmpeg using this command in the command prompt:

choco install ffmpeg

2. Download and extract the release

The release file is a self extracting archive. Choose the folder where you want to have StoryToolkitAI on your computer and extract it there.

3. Open the app

Open the folder where you un-archived the file, and open the StoryToolkitAI.X.X.X.WIN.exe file, where X are the version numbers.

Info for all platforms

Make sure Resolve scripting is enabled

The tool only works with the Studio version of Davinci Resolve.

Davinci Resolve Preferences -> General, "External Scripting using", set to: Local

Downloading models to the cache

The first time you transcribe something with a specific model, it will take a bit longer to start the process because Whisper needs to download the model file (around 1.5GB for the medium model) on your local machine. But, after the model is saved on your machine, transcriptions will take less. The same applies to the advanced search function.