Tdnn pool #3

vijayaditya · 2016-08-26T19:29:55Z

No description provided.

…ing gmm-latgen-faster The binary complains that there is no such option and exits abnormally, which in turn causes local/test_decoders.sh to fail with a scary error message.

rm/s5: Don't use the no-longer-existing '--max-arcs' option when call…

12 hours is the estimate I got when I ran the recipe: ``` root@27af4331c701:/opt/kaldi/egs/swbd/s5c/data/train_30kshort $ awk '{s+=$4-$3}END{print s}' segments 43022.7 ``` (That's 11.95 hours.)

Fix swbd/s5c recipe comment

… with online decoding (and to enable fix to --snip-edges=false bug).

…o write online feature-extraction code that would respect the snip-edges=false option.

…rturb

- the stays same functionality, but now it is more 'correct', as we don't do the triple cast with 'bool'.

moving the src/path.sh into tools/config/common_path.sh

Fixing stuff and renaming into am_nnet.

Updated arpa2fst usage message

WIP: fix for utt2dur when applied on speed-pertubed data

…oef'

…he correct corpra.

…es option; includes rewrite of window extraction code.

… bn-music-speech.

…the HUB4 corpus, update to local/make_bn.py so that it is more flexible with regard to differences in source directory format

…ch-fix BN music speech fixes

smbr: Avoid extra epochs if frame shift is not used during training

…ming more generic, - the binary can be replaced (so we could eventually append posteriors, features, etc.)

A new CUDA kernel for CuMatrixBase<Real>::FindRowMaxId;

base/kaldi_error : the error messages are no longer printed 2x

…[email protected])

Add barrier for correct timing. Original performance: LOG (TestCuMatrixTransposeCross():cu-matrix-speed-test.cc:91) For CuMatrix::TransposeCross<float>, for dim = 1024, speed was 4.26727 gigaflops. LOG (TestCuMatrixTransposeS():cu-matrix-speed-test.cc:72) For CuMatrix::TransposeS<float>, for dim = 1024, speed was 5.97203 gigaflops. LOG (TestCuMatrixTransposeNS():cu-matrix-speed-test.cc:56) For CuMatrix::TransposeNS<float>, for dim = 1024, speed was 3.0816 gigaflops. LOG (TestCuMatrixTransposeCross():cu-matrix-speed-test.cc:91) For CuMatrix::TransposeCross<double>, for dim = 1024, speed was 3.95059 gigaflops. LOG (TestCuMatrixTransposeS():cu-matrix-speed-test.cc:72) For CuMatrix::TransposeS<double>, for dim = 1024, speed was 4.36189 gigaflops. LOG (TestCuMatrixTransposeNS():cu-matrix-speed-test.cc:56) For CuMatrix::TransposeNS<double>, for dim = 1024, speed was 2.39275 gigaflops.

LOG (TestCuMatrixTransposeCross():cu-matrix-speed-test.cc:91) For CuMatrix::TransposeCross<float>, for dim = 1024, speed was 14.0498 gigaflops. LOG (TestCuMatrixTransposeS():cu-matrix-speed-test.cc:72) For CuMatrix::TransposeS<float>, for dim = 1024, speed was 16.845 gigaflops. LOG (TestCuMatrixTransposeNS():cu-matrix-speed-test.cc:56) For CuMatrix::TransposeNS<float>, for dim = 1024, speed was 14.2464 gigaflops. LOG (TestCuMatrixTransposeCross():cu-matrix-speed-test.cc:91) For CuMatrix::TransposeCross<double>, for dim = 1024, speed was 10.4523 gigaflops. LOG (TestCuMatrixTransposeS():cu-matrix-speed-test.cc:72) For CuMatrix::TransposeS<double>, for dim = 1024, speed was 9.65529 gigaflops. LOG (TestCuMatrixTransposeNS():cu-matrix-speed-test.cc:56) For CuMatrix::TransposeNS<double>, for dim = 1024, speed was 8.52148 gigaflops.

add new results for Multi-splice version of online recipe of Librispeech, including those on test set.

…size. New: LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<float>, for dim = 1024, speed was 10.1076 gigaflops. LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<float> [transposed], for dim = 1024, speed was 11.8711 gigaflops. LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<double>, for dim = 1024, speed was 7.10019 gigaflops. LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<double> [transposed], for dim = 1024, speed was 7.81977 gigaflops. Old: LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<float>, for dim = 1024, speed was 4.57783 gigaflops. LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<float> [transposed], for dim = 1024, speed was 7.96795 gigaflops. LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<double>, for dim = 1024, speed was 3.61182 gigaflops. LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<double> [transposed], for dim = 1024, speed was 6.39571 gigaflops.

2 CUDA kernels for TraceMatMat with/without transpose for all matrix size.

smbr: Fixed minor bug in generating diagnostics egs

Speed up CuMatrix<Real>::Transpose() and transposed copy from matrix

some cosmetic changes: add comments to RNNLM rescoring utilities to r…

added utils/combine_ali_dirs.sh (fixes kaldi-asr#553).

…ningful error messages.

$ ./configure --mkl-root=/opt/intel/mkl --static-math=yes ... Configuring MKL library directory: Found: /opt/intel/mkl/lib/intel64 MKL configured with threading: sequential, libs: -Wl,--start-group /opt/intel/mkl/lib/intel64/libmkl_intel_lp64.a /opt/intel/mkl/lib/intel64/libmkl_core.a /opt/intel/mkl/lib/intel64/libmkl_sequential.a -Wl,--end-group MKL include directory configured as: /opt/intel/mkl/include Configuring MKL threading as sequential MKL threading libraries configured as -lpthread -lm Using Intel MKL as the linear algebra library. /opt/intel/mkl/lib/intel64/libmkl_core.a(mkl_memory_patched.o): In function `mkl_serv_set_memory_limit': mkl_memory.c:(.text+0x49c): undefined reference to `dlsym' mkl_memory.c:(.text+0x4b2): undefined reference to `dlsym' mkl_memory.c:(.text+0x4c8): undefined reference to `dlsym' /opt/intel/mkl/lib/intel64/libmkl_core.a(mkl_memory_patched.o): In function `mkl_serv_allocate': mkl_memory.c:(.text+0x1251): undefined reference to `dlsym' mkl_memory.c:(.text+0x1267): undefined reference to `dlsym' ...

Add missing dependencies to Makefiles

Add dimension check in online-nnet3 decoding code, so we get more mea…

Fix bug: static link to MKL 11.3.2 failed.

added the option trainer.deriv-truncate-margin to train_rnn.py and tr…

vdp and others added 30 commits April 8, 2016 13:44

rm/s5: Don't use the no-longer-existing '--max-arcs' option when call…

1fec277

…ing gmm-latgen-faster The binary complains that there is no such option and exits abnormally, which in turn causes local/test_decoders.sh to fail with a scary error message.

Merge pull request kaldi-asr#675 from vdp/rm-max-arcs

3408563

rm/s5: Don't use the no-longer-existing '--max-arcs' option when call…

Adding prepare_online_decoding.sh and tweaking online/nnet3/decode.sh

654d4cd

Changing link to score.sh and moving readme into .md format.

75f8b02

Fix swbd/s5c recipe comment

5826272

12 hours is the estimate I got when I ran the recipe: ``` root@27af4331c701:/opt/kaldi/egs/swbd/s5c/data/train_30kshort $ awk '{s+=$4-$3}END{print s}' segments 43022.7 ``` (That's 11.95 hours.)

Merge pull request kaldi-asr#677 from guoguo12/patch-1

b77e930

Fix swbd/s5c recipe comment

Refactoring feature computations to make for more natural integration…

5aa540f

… with online decoding (and to enable fix to --snip-edges=false bug).

fix for utt2dur when applied on speed-pertubed data

c0981ce

refactoring of code in feat/feature-window.{h,cc} to make it easier t…

9ddcbd4

…o write online feature-extraction code that would respect the snip-edges=false option.

read entire file according to if there are sox commands with speed pe…

931f031

…rturb

Update arpa2fst invocations in common (wsj/{utils,steps}) files

6e8dd7a

Update arpa2fst invocations in individual egs/*/local scripts

829432d

nnet1: frame dropping, adding the parenthesis in 'if'

d9c145a

- the stays same functionality, but now it is more 'correct', as we don't do the triple cast with 'bool'.

Merge pull request kaldi-asr#631 from jtrmal/path_sh_fix2

753c6be

moving the src/path.sh into tools/config/common_path.sh

fix

93eebc9

fix for bug in lattice-align-words encountered by Florian Boyer.

abd0f7a

get duration from feats.scp when wav.scp or segments are absent

65e5b07

Preparing code to move into kaldi repository.

461746a

Fixing stuff and renaming into am_nnet.

Updated arpa2fst usage message (see kaldi-asr#682)

9052509

Merge pull request kaldi-asr#688 from kkm000/arpa2fst-usage

13e4e23

Updated arpa2fst usage message

Merge pull request kaldi-asr#678 from freewym/dur_fix

4488d3c

WIP: fix for utt2dur when applied on speed-pertubed data

nnet1: support for adjusting 'learn-rate-coef' and 'bias-learn-rate-c…

7de12d5

…oef'

bn-music-speech-fix: Changing LDC numbers in egs/bn_music_speech to t…

fb01718

…he correct corpra.

reorganization of online feature code to work correctly with snip-edg…

4f6823c

…es option; includes rewrite of window extraction code.

bn-music-speech-fix: Changing LDC97S66 to LDC97S44 in corpra list for…

eb5b32c

… bn-music-speech.

bn-music-speech-fix: Update to run.sh to use the LDC distribution of …

f4fd8dd

…the HUB4 corpus, update to local/make_bn.py so that it is more flexible with regard to differences in source directory format

Merge pull request kaldi-asr#693 from david-ryan-snyder/bn-music-spee…

f86bc2c

…ch-fix BN music speech fixes

smbr: Avoid extra epochs if frame shift is not used during training

c52315c

Merge pull request kaldi-asr#694 from vimalmanohar/frame_shift_fix

c4c4ec9

smbr: Avoid extra epochs if frame shift is not used during training

pipe added

a76bce5

KarelVesely84 and others added 28 commits May 18, 2016 20:38

nnet1: updating scripts, the mechanism of appending i-vectors is beco…

6096531

…ming more generic, - the binary can be replaced (so we could eventually append posteriors, features, etc.)

Merge pull request kaldi-asr#780 from kangshiyin/faster-FindRowMaxId

26ef88f

A new CUDA kernel for CuMatrixBase<Real>::FindRowMaxId;

Merge pull request kaldi-asr#755 from vesis84/error_logs

ea7f04b

base/kaldi_error : the error messages are no longer printed 2x

fix name of decoding script in some old nnet2 recipes (thanks: miguel…

b45a70b

…[email protected])

Merge pull request kaldi-asr#786 from freewym/librispeech_nnet2

0d4f1b2

add new results for Multi-splice version of online recipe of Librispeech, including those on test set.

Remove _copy_from_mat_trans<16>, not used any more.

9af6653

Merge pull request kaldi-asr#791 from kangshiyin/trace-mat-mat

fb3c66c

2 CUDA kernels for TraceMatMat with/without transpose for all matrix size.

No problem on local building. Retry travis CI build.

1fdfed4

added utils/combine_ali_dirs.sh (fixes kaldi-asr#553).

772ee4f

Merge pull request kaldi-asr#792 from vimalmanohar/smbr_bug_fix

52557c8

smbr: Fixed minor bug in generating diagnostics egs

Merge pull request kaldi-asr#790 from kangshiyin/cumatrix-copy-trans

60a106e

Speed up CuMatrix<Real>::Transpose() and transposed copy from matrix

Merge pull request kaldi-asr#758 from danpovey/minor-change

653c78d

some cosmetic changes: add comments to RNNLM rescoring utilities to r…

Merge pull request kaldi-asr#725 from xiaohui-zhang/1509

ae5bff6

added utils/combine_ali_dirs.sh (fixes kaldi-asr#553).

Add dimension check in online-nnet3 decoding code, so we get more mea…

8dca28c

…ningful error messages.

Merge remote-tracking branch 'upstream/chain'

8cf4d78

Add missing dependencies to Makefiles

5b3fccd

Merge pull request kaldi-asr#794 from akreal/master

4a76336

Add missing dependencies to Makefiles

Merge pull request kaldi-asr#793 from danpovey/nnet3-decoding-dim-check

45b9897

Add dimension check in online-nnet3 decoding code, so we get more mea…

Merge pull request kaldi-asr#796 from kangshiyin/mkl-static-link-bug

71ffc7b

Fix bug: static link to MKL 11.3.2 failed.

Added tdnn pooling recipes & made composite components

73ce8a9

swbd: Added blstm+chain recipe

2465666

Minor modifications to report library

8254bbf

More scripts

fe507fe

Added non-subsampling experiments

2f83626

vijayaditya force-pushed the master branch from dcc08c9 to d689217 Compare August 28, 2016 10:00

vijayaditya pushed a commit that referenced this pull request Nov 30, 2016

Merge pull request #3 from freewym/vimal_raw_python_script

48fd6ab

added the option trainer.deriv-truncate-margin to train_rnn.py and tr…

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tdnn pool #3

Tdnn pool #3

vijayaditya commented Aug 26, 2016

Tdnn pool #3

Are you sure you want to change the base?

Tdnn pool #3

Conversation

vijayaditya commented Aug 26, 2016