Skip to content

StoryToolkitAI v0.18.1 - still in alpha

Compare
Choose a tag to compare
@octimot octimot released this 19 Apr 09:35
· 356 commits to main since this release
dbb4cb6

What's new in this release

We're slowly pushing updates that include access to ChatGPT from the tool, to have a state-of-the-art AI by your side to look into transcripts, markers, projects etc. and do all sorts of interesting stuff in the future, from transcript summarization, to grouping content and maybe even coding timelines to re-import into your editing suite - more tips and ideas regarding this integration soon via Patreon!

A few highlights:

  • Improved usability of Transcription Window (it still looks like we're in 2005, but more UI updates to come!)
  • Ability to export transcripts as SRT and Text files via menu buttons and CMD/CTRL+SHIFT+S ( #84)
  • Most transcription related features are now available via menu buttons
  • Custom Punctuation Marks via Preferences Window to be used for splitting segments on punctuation (Patron request)
  • Warning when trying to add markers to non-linked timeline in Resolve
  • Additional menu shortcuts
  • Fixed some bad bad bugs which prevented some Resolve API commands

For a detailed list of changes, see CHANGELOG.

Early Update Access for Patreon Frequent Users and Producers

This update is available early for Frequent User and Producer tiers only, see this post on Patreon

Version 0.17.16 is still available publicly for everyone else here.

StoryToolkitAI Patreon

BTW, if you find the tool useful, please consider becoming a patron.
This way, we can speed up development to work on cool features that you might need and keep up with rapid advancements in AI.

Installation:

IMPORTANT NOTE
We recommend backing up the configuration folder your user directory if you update from versions earlier than 0.18.0, since this version contains a major overhaul of the tool file structure.

macOS

Requirements

macOS 12.6 Monterey or newer
Intel or arm64 (M1, M2 etc.)
8 GB RAM

Installation Steps

Download the zip file that fits with your platform.
Unzip, copy the app file into your Applications folder, then double-click and it should start.

Please also read Info For All Platforms below.

Unsafe warnings

If you get something like "StoryToolkitAI is damaged and can't be opened", try to right click and then click open in the dropdown menu. It will warn you that it's unsafe, but that's just because we haven't notarized the tool with Apple.

If you still have problems opening it, try to take it out of quarantine: open up Terminal and execute:
xattr -d com.apple.quarantine ./StoryToolkitAI.0.18.0.1.arm64.app (or x86_64 if Intel).
More details on the issue here.

Console Mode

Note: The standalone version does not open up the console. If you're interested to see what is happening behind the hood, open a terminal in the folder where you have the tool (typically in your Downloads or Applications folder) and run this command:

./[STORYTOOLKITAI_APP_NAME].app/Contents/MacOS/[STORYTOOLKIT_APP_NAME]  

# (but replace [STORYTOOLKIT_APP_NAME] with the name of your release)

Windows

Requirements

Windows 10 or newer
CUDA-enabled graphics card

Installation

Just follow the installer to get the tool on your machine.

Notes for Windows

Do not install this version of the tool in Program Files unless you know what you're doing. Some users have experienced permission issues which will make some features unusable.

In case some functions return errors, a first good step is to uninstall the tool using the uninstaller and then re-install it again.

You don't need to have Python installed on your machine for the standalone version to work.
But, if you have Resolve Studio installed and the tool doesn't start, it's probably because you have multiple Python versions installed on your machine (see #22). This is because Resolve API seems to choose the newest version of Python that is available on the machine, while the tool uses 3.9.13. If you're feeling brave, we recommend installing directly via git and creating an appropriate virtual environment that matches the one that Resolve is using.

Also, if you do need Python for other reasons, make sure you install it from the official Python website and only use other Python installers / versions if you know what you're doing.

If you see an error mentioning python_check, start the tool with the --skip-python-check argument.

Also, see below:

Info for all platforms

Download Permissions

If you get something like PermissionError: [Errno 13] Permission denied: '/Users/[your user]/.cache/torch/hub/trusted_list', it's most likely because the new standalone version uses different ssl certificates (issue #77) , so it's best if you delete the old hub cache, like this:

On macOS

Open terminal, and execute

rm -rf /Users/USERNAME/.cache/hub
rm -rf /Users/USERNAME/.cache/torch
rm -rf /Users/USERNAME/.cache/whisper
  • replace USERNAME with your MacOS user

On Windows

Open CMD, and execute

rmdir /s /q C:\Users\USERNAME\.cache\hub
rmdir /s /q C:\Users\USERNAME\.cache\torch
rmdir /s /q C:\Users\USERNAME\.cache\whisper
  • replace USERNAME with your Windows user

Keep in mind that if you do this, the first time you transcribe or search something it will need to re-download the models, so it will take a bit longer.

Make sure Resolve scripting is enabled

The tool only works with the Studio version of Davinci Resolve.

Davinci Resolve Preferences -> General, "External Scripting using", set to: Local

transcription_WAV

For the optimum transcription quality, try to create the transcription_WAV render preset in Resolve following this info, or simply use WAV audio files or any other format that uses Linear PCM for audio when transcribing - preferably mono.

Downloading models to the cache

The first time you transcribe something with a specific model, it will take a bit longer to start the process because Whisper needs to download the model file (around 1.5GB for the medium model) on your local machine. But, after the model is saved on your machine, transcriptions will take less. The same applies to the advanced search function.