shennong-1.0

Latest

Latest

mmmaat released this 25 May 16:21

· 3 commits to master since this release

shennong-1.0

breaking changes

Deletion of processor.rastaplp.RastaPlpProcessor. Rasta filtering is now an
option of the standard PlpProcessor, so as to use both Rasta and VTLN.
Pitch, delta and CMVN are now deactivated by default in pipeline configuration
generation (concerns both speech-features binary and the shennong.pipeline
module).
Features serialization in JSON is no more supported (replaced by CSV)
Code reorganization (renamed shennong.features.* to shennong.*, new
shennong.logger, import processors directly from
shennong.processor, renamed PitchProcessor to KaldiPitchProcessor,
etc...)
When defining utterances for use with a pipeline, the format <audio-file>
is no more supported, it must be superseeded by <utterance-id> <audio-file>.
processor.process_all() now takes a shennong.Utterances instead of a
dict(name, audio).

new models

Vocal Tract Length Normalization (VTLN) implemented using Kaldi:
shennong.processor.vtln.
CREPE pitch extraction: shennong.processor.pitch_crepe.

improvments

new shennong.Utterances class to encapsulate and manage utterances to be
feeded to a pipeline.
shennong.Audio can now read/write more than wav files: flac, mp3, etc...
(anything supported by pydub/ffmpeg).
Optionnally ignore features properties when saving them.
New example code in examples.

Assets 2