Commits · 490559079df00481d988cbb6f3bb20f5e48a9123 · Simon Will / kaldi-commonvoice

May 23, 2016
- Update documentation URLs in src/ · 49055907
  Allen Guo authored 8 years ago
  
  49055907
- Update documentation URLs in egs/ · 2917e42e
  Allen Guo authored 8 years ago
  
  2917e42e
May 20, 2016

Merge pull request #796 from kangshiyin/mkl-static-link-bug · 71ffc7b1
Daniel Povey authored 8 years ago
```
Fix bug: static link to MKL 11.3.2 failed.
```
71ffc7b1
Merge pull request #793 from danpovey/nnet3-decoding-dim-check · 45b98978
Daniel Povey authored 8 years ago
```
Add dimension check in online-nnet3 decoding code, so we get more mea…
```
45b98978
Merge pull request #794 from akreal/master · 4a76336a
Daniel Povey authored 8 years ago
```
Add missing dependencies to Makefiles
```
4a76336a

Fix bug: static link to MKL failed. · 95010297

Shiyin Kang authored 8 years ago

$ ./configure --mkl-root=/opt/intel/mkl --static-math=yes
...
Configuring MKL library directory: Found: /opt/intel/mkl/lib/intel64
MKL configured with threading: sequential, libs:  -Wl,--start-group /opt/intel/mkl/lib/intel64/libmkl_intel_lp64.a /opt/intel/mkl/lib/intel64/libmkl_core.a /opt/intel/mkl/lib/intel64/libmkl_sequential.a -Wl,--end-group
MKL include directory configured as: /opt/intel/mkl/include
Configuring MKL threading as sequential
MKL threading libraries configured as   -lpthread -lm
Using Intel MKL as the linear algebra library.
/opt/intel/mkl/lib/intel64/libmkl_core.a(mkl_memory_patched.o): In function `mkl_serv_set_memory_limit':
mkl_memory.c:(.text+0x49c): undefined reference to `dlsym'
mkl_memory.c:(.text+0x4b2): undefined reference to `dlsym'
mkl_memory.c:(.text+0x4c8): undefined reference to `dlsym'
/opt/intel/mkl/lib/intel64/libmkl_core.a(mkl_memory_patched.o): In function `mkl_serv_allocate':
mkl_memory.c:(.text+0x1251): undefined reference to `dlsym'
mkl_memory.c:(.text+0x1267): undefined reference to `dlsym'
...

95010297

Add missing dependencies to Makefiles · 5b3fccd5
Pavel Denisov authored 8 years ago

5b3fccd5

May 19, 2016

Merge remote-tracking branch 'upstream/chain' · 8cf4d782
Daniel Povey authored 8 years ago

8cf4d782
Add dimension check in online-nnet3 decoding code, so we get more meaningful error messages. · 8dca28c1
Daniel Povey authored 8 years ago

8dca28c1
Merge pull request #725 from xiaohui-zhang/1509 · ae5bff6f
Daniel Povey authored 8 years ago
```
added utils/combine_ali_dirs.sh (fixes #553).
```
ae5bff6f
Merge pull request #758 from danpovey/minor-change · 653c78db
Daniel Povey authored 8 years ago
```
some cosmetic changes: add comments to RNNLM rescoring utilities to r…
```
653c78db
Merge pull request #790 from kangshiyin/cumatrix-copy-trans · 60a106eb
Daniel Povey authored 8 years ago
```
Speed up CuMatrix<Real>::Transpose() and transposed copy from matrix
```
60a106eb
Merge pull request #792 from vimalmanohar/smbr_bug_fix · 52557c82
Daniel Povey authored 8 years ago
```
smbr: Fixed minor bug in generating diagnostics egs
```
52557c82
added utils/combine_ali_dirs.sh (fixes #553). · 772ee4f8
xiaohui-zhang authored 8 years ago

772ee4f8
No problem on local building. Retry travis CI build. · 1fdfed4c
Shiyin Kang authored 8 years ago

1fdfed4c
Merge pull request #791 from kangshiyin/trace-mat-mat · fb3c66c0
Daniel Povey authored 8 years ago
```
2 CUDA kernels for TraceMatMat with/without transpose for all matrix size.
```
fb3c66c0

2 CUDA kernels for TraceMatMat with/without transpose for all matrix size. · 70df8813

Shiyin Kang authored 8 years ago

New:
LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<float>, for dim = 1024, speed was 10.1076 gigaflops.
LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<float> [transposed], for dim = 1024, speed was 11.8711 gigaflops.
LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<double>, for dim = 1024, speed was 7.10019 gigaflops.
LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<double> [transposed], for dim = 1024, speed was 7.81977 gigaflops.

Old:
LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<float>, for dim = 1024, speed was 4.57783 gigaflops.
LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<float> [transposed], for dim = 1024, speed was 7.96795 gigaflops.
LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<double>, for dim = 1024, speed was 3.61182 gigaflops.
LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<double> [transposed], for dim = 1024, speed was 6.39571 gigaflops.

70df8813

Remove _copy_from_mat_trans<16>, not used any more. · 9af66530
Shiyin Kang authored 8 years ago

9af66530

May 18, 2016

Merge pull request #786 from freewym/librispeech_nnet2 · 0d4f1b24
Daniel Povey authored 8 years ago
```
add new results for Multi-splice version of online recipe of Librispeech, including those on test set.
```
0d4f1b24

A new copy transpose kernel with same performance as plain copy. · 13792af4

Shiyin Kang authored 8 years ago

LOG (TestCuMatrixTransposeCross():cu-matrix-speed-test.cc:91) For CuMatrix::TransposeCross<float>, for dim = 1024, speed was 14.0498 gigaflops.
LOG (TestCuMatrixTransposeS():cu-matrix-speed-test.cc:72) For CuMatrix::TransposeS<float>, for dim = 1024, speed was 16.845 gigaflops.
LOG (TestCuMatrixTransposeNS():cu-matrix-speed-test.cc:56) For CuMatrix::TransposeNS<float>, for dim = 1024, speed was 14.2464 gigaflops.
LOG (TestCuMatrixTransposeCross():cu-matrix-speed-test.cc:91) For CuMatrix::TransposeCross<double>, for dim = 1024, speed was 10.4523 gigaflops.
LOG (TestCuMatrixTransposeS():cu-matrix-speed-test.cc:72) For CuMatrix::TransposeS<double>, for dim = 1024, speed was 9.65529 gigaflops.
LOG (TestCuMatrixTransposeNS():cu-matrix-speed-test.cc:56) For CuMatrix::TransposeNS<double>, for dim = 1024, speed was 8.52148 gigaflops.

13792af4

Add code for cumatrix copy transpose benchmark · c765ba6e

Shiyin Kang authored 8 years ago

Add barrier for correct timing.

Original performance:
LOG (TestCuMatrixTransposeCross():cu-matrix-speed-test.cc:91) For CuMatrix::TransposeCross<float>, for dim = 1024, speed was 4.26727 gigaflops.
LOG (TestCuMatrixTransposeS():cu-matrix-speed-test.cc:72) For CuMatrix::TransposeS<float>, for dim = 1024, speed was 5.97203 gigaflops.
LOG (TestCuMatrixTransposeNS():cu-matrix-speed-test.cc:56) For CuMatrix::TransposeNS<float>, for dim = 1024, speed was 3.0816 gigaflops.
LOG (TestCuMatrixTransposeCross():cu-matrix-speed-test.cc:91) For CuMatrix::TransposeCross<double>, for dim = 1024, speed was 3.95059 gigaflops.
LOG (TestCuMatrixTransposeS():cu-matrix-speed-test.cc:72) For CuMatrix::TransposeS<double>, for dim = 1024, speed was 4.36189 gigaflops.
LOG (TestCuMatrixTransposeNS():cu-matrix-speed-test.cc:56) For CuMatrix::TransposeNS<double>, for dim = 1024, speed was 2.39275 gigaflops.

c765ba6e

fix name of decoding script in some old nnet2 recipes (thanks: miguelcm86@gmail.com) · b45a70b9
Daniel Povey authored 8 years ago

b45a70b9
Merge pull request #755 from vesis84/error_logs · ea7f04b2
Daniel Povey authored 8 years ago
```
base/kaldi_error : the error messages are no longer printed 2x
```
ea7f04b2
Merge pull request #780 from kangshiyin/faster-FindRowMaxId · 26ef88fe
Daniel Povey authored 8 years ago
```
A new CUDA kernel for CuMatrixBase<Real>::FindRowMaxId;
```
26ef88fe
nnet1: updating scripts, the mechanism of appending i-vectors is becoming more generic, · 60965311
vesis84 authored 8 years ago
```
- the binary can be replaced (so we could eventually append posteriors, features, etc.)
```
60965311
Merge pull request #789 from glorey/master · 9ae455aa
Jan "yenda" Trmal authored 8 years ago
```
align-equal-compiled.cc: correct the usage description
```
9ae455aa
correct the usage description · 0731b8f6
wan guanglu authored 8 years ago

0731b8f6
add barrier for correct timing. · 24b886a2
Shiyin Kang authored 8 years ago

24b886a2
a few more comments · a829139c
kangshiyin authored 8 years ago

a829139c
Keep cu-matrix-speed-test.cc unchanged. I thought dimN was typo. It is not. · 7c88a9e1
kangshiyin authored 8 years ago

7c88a9e1

A new CUDA kernel for CuMatrixBase<Real>::FindRowMaxId; · 074e0053

sykang@sepc83 authored 8 years ago

Old:
LOG (TestCuFindRowMaxId():cu-matrix-speed-test.cc:264) For CuMatrix::FindRowMaxId<float>, for dim = 1024, speed was 3.99218 gigaflops.
LOG (TestCuFindRowMaxId():cu-matrix-speed-test.cc:264) For CuMatrix::FindRowMaxId<double>, for dim = 1024, speed was 3.46283 gigaflops.

New:
LOG (TestCuFindRowMaxId():cu-matrix-speed-test.cc:264) For CuMatrix::FindRowMaxId<float>, for dim = 1024, speed was 66.2965 gigaflops.
LOG (TestCuFindRowMaxId():cu-matrix-speed-test.cc:264) For CuMatrix::FindRowMaxId<double>, for dim = 1024, speed was 58.442 gigaflops.

074e0053

May 17, 2016
- add new results for Multi-splice version of online recipe (5/16/2016). · f7b34367
  freewym authored 8 years ago
  
  f7b34367
- Merge pull request #787 from tal1974/patch-1 · 489a1f5b
  Daniel Povey authored 8 years ago
  
  Update nnet1-to-raw-nnet.cc
  489a1f5b
- Merge pull request #788 from vimalmanohar/patch-4 · 1b1a108f
  Daniel Povey authored 8 years ago
  
  Update perturb_data_dir_volume.sh
  1b1a108f
- Update perturb_data_dir_volume.sh · 476f41d7
  Vimal Manohar authored 8 years ago
  
  Add seed for random number generator in utils/data/perturb_data_dir.sh
  476f41d7
- error_logs: abort or throw exception even with custom log-handler · 1449a9d0
  vesis84 authored 8 years ago
  
  1449a9d0
- Update nnet1-to-raw-nnet.cc · 95688aa7
  tal1974 authored 8 years ago
  
  Now GetParams requires vector allocated already. Changes in nnet\nnet-various.h at Apr 21, 2016
  95688aa7
- error_logs: adding 'k' to the enum. · 216f9ec4
  vesis84 authored 8 years ago
  
  216f9ec4
- add tdnn xent and tdnn+chain recipe for librispeech (#781) · 71083b66
  Yiming Wang authored 8 years ago
  
  71083b66
May 16, 2016
- improve speed of split_data.sh; includes change to filter_scps.pl (thanks to... · 76909608
  Daniel Povey authored 8 years ago
  
  improve speed of split_data.sh; includes change to filter_scps.pl (thanks to remi francis for noticing the issue)
  76909608