Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add further sox options for audio deidentification #8

Open
peteruhrig opened this issue Jun 23, 2022 · 1 comment
Open

Add further sox options for audio deidentification #8

peteruhrig opened this issue Jun 23, 2022 · 1 comment

Comments

@peteruhrig
Copy link

On the command line, the following commands were successful for stereo audio with 44.1 kHz sampling rate:
First, create audio from the video:

ffmpeg -i covid.mp4 covid.wav

Then, find out exact length, sampling rate and number of channels, in our cases 49.53 seconds, 44100 Hz and 2 channels (stereo):

ffprobe covid.wav

Create a sine wave with the same length, sampling rate and number of channels:

sox -n -r 44100 sine-200.wav synth 49.53 sine 200 remix 1 1

Combine the two into the deidentified audio:

sox -T covid.wav sine-200.wav  covid_distorted_sine-200.wav

A value of 200 makes it still easy to understand, a value of 500 is relatively difficult. Users are invited to experiment with the exact value that works best for their use case.

These steps need to be integrated into the code and made configurable via the command line interface.

@peteruhrig
Copy link
Author

Actually, a value of 300 is still intelligible, so we should use this as our default.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant