Version numbers follow semantic versioning.
-
Deletion of
processor.rastaplp.RastaPlpProcessor
. Rasta filtering is now an option of the standardPlpProcessor
, so as to use both Rasta and VTLN. -
Pitch, delta and CMVN are now deactivated by default in pipeline configuration generation (concerns both
speech-features
binary and theshennong.pipeline
module). -
Features serialization in JSON is no more supported (replaced by CSV)
-
Code reorganization (renamed
shennong.features.*
toshennong.*
, newshennong.logger
, import processors directly fromshennong.processor
, renamedPitchProcessor
toKaldiPitchProcessor
, etc...) -
When defining utterances for use with a pipeline, the format
<audio-file>
is no more supported, it must be superseeded by<utterance-id> <audio-file>
. -
processor.process_all()
now takes ashennong.Utterances
instead of adict(name, audio)
.
-
Vocal Tract Length Normalization (VTLN) implemented using Kaldi:
shennong.processor.vtln
. -
CREPE pitch extraction:
shennong.processor.pitch_crepe
.
-
new
shennong.Utterances
class to encapsulate and manage utterances to be feeded to a pipeline. -
shennong.Audio
can now read/write more than wav files: flac, mp3, etc... (anything supported by pydub/ffmpeg). -
Optionnally ignore features properties when saving them.
-
New example code in
examples
.
-
correctly load utterances from a file in
speech-features
(ignore empty lines) -
fixed a harmless warning when resampling audio with sox
-
now compatible with (and depends on)
pytest>=5.0
-
improved installation instructions for MacOS
-
new releases are automatically deployed on conda
First public release