Commits · d8b196951c1cf3437b3fa6cd76edbbc0542b3db9 · Simon Will / kaldi-commonvoice

May 25, 2016
- cpplit.py : relocaing script to misc/maintenance, · 569da136
  vesis84 authored 8 years ago
  
  - adding script to fix whitespace in C++ code,
  569da136
May 24, 2016

Minor improvements to documentation of data-preparation, to clarify about mono... · 61e7bfc9
Daniel Povey authored 8 years ago
```
Minor improvements to documentation of data-preparation, to clarify about mono wav files and utterance/speaker order agreement.
```
61e7bfc9

Change src/configure to also exclude gcc version 4.8.1 and to refuse to... · e69198c3

Daniel Povey authored 8 years ago

Change src/configure to also exclude gcc version 4.8.1 and to refuse to continue if 'bad' versionf of gcc are in use.
Also add warning about OS X El Capitan, about matrix-library bug.

e69198c3

May 23, 2016
- Update documentation URLs in src/ · 49055907
  Allen Guo authored 8 years ago
  
  49055907
May 20, 2016

Fix bug: static link to MKL failed. · 95010297

Shiyin Kang authored 8 years ago

$ ./configure --mkl-root=/opt/intel/mkl --static-math=yes
...
Configuring MKL library directory: Found: /opt/intel/mkl/lib/intel64
MKL configured with threading: sequential, libs:  -Wl,--start-group /opt/intel/mkl/lib/intel64/libmkl_intel_lp64.a /opt/intel/mkl/lib/intel64/libmkl_core.a /opt/intel/mkl/lib/intel64/libmkl_sequential.a -Wl,--end-group
MKL include directory configured as: /opt/intel/mkl/include
Configuring MKL threading as sequential
MKL threading libraries configured as   -lpthread -lm
Using Intel MKL as the linear algebra library.
/opt/intel/mkl/lib/intel64/libmkl_core.a(mkl_memory_patched.o): In function `mkl_serv_set_memory_limit':
mkl_memory.c:(.text+0x49c): undefined reference to `dlsym'
mkl_memory.c:(.text+0x4b2): undefined reference to `dlsym'
mkl_memory.c:(.text+0x4c8): undefined reference to `dlsym'
/opt/intel/mkl/lib/intel64/libmkl_core.a(mkl_memory_patched.o): In function `mkl_serv_allocate':
mkl_memory.c:(.text+0x1251): undefined reference to `dlsym'
mkl_memory.c:(.text+0x1267): undefined reference to `dlsym'
...

95010297

Add missing dependencies to Makefiles · 5b3fccd5
Pavel Denisov authored 8 years ago

5b3fccd5

May 19, 2016

Add dimension check in online-nnet3 decoding code, so we get more meaningful error messages. · 8dca28c1
Daniel Povey authored 8 years ago

8dca28c1
No problem on local building. Retry travis CI build. · 1fdfed4c
Shiyin Kang authored 8 years ago

1fdfed4c

2 CUDA kernels for TraceMatMat with/without transpose for all matrix size. · 70df8813

Shiyin Kang authored 8 years ago

New:
LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<float>, for dim = 1024, speed was 10.1076 gigaflops.
LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<float> [transposed], for dim = 1024, speed was 11.8711 gigaflops.
LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<double>, for dim = 1024, speed was 7.10019 gigaflops.
LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<double> [transposed], for dim = 1024, speed was 7.81977 gigaflops.

Old:
LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<float>, for dim = 1024, speed was 4.57783 gigaflops.
LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<float> [transposed], for dim = 1024, speed was 7.96795 gigaflops.
LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<double>, for dim = 1024, speed was 3.61182 gigaflops.
LOG (TestCuMatrixTraceMatMat():cu-matrix-speed-test.cc:458) For CuMatrix::TraceMatMat<double> [transposed], for dim = 1024, speed was 6.39571 gigaflops.

70df8813

Remove _copy_from_mat_trans<16>, not used any more. · 9af66530
Shiyin Kang authored 8 years ago

9af66530

May 18, 2016

A new copy transpose kernel with same performance as plain copy. · 13792af4

Shiyin Kang authored 8 years ago

LOG (TestCuMatrixTransposeCross():cu-matrix-speed-test.cc:91) For CuMatrix::TransposeCross<float>, for dim = 1024, speed was 14.0498 gigaflops.
LOG (TestCuMatrixTransposeS():cu-matrix-speed-test.cc:72) For CuMatrix::TransposeS<float>, for dim = 1024, speed was 16.845 gigaflops.
LOG (TestCuMatrixTransposeNS():cu-matrix-speed-test.cc:56) For CuMatrix::TransposeNS<float>, for dim = 1024, speed was 14.2464 gigaflops.
LOG (TestCuMatrixTransposeCross():cu-matrix-speed-test.cc:91) For CuMatrix::TransposeCross<double>, for dim = 1024, speed was 10.4523 gigaflops.
LOG (TestCuMatrixTransposeS():cu-matrix-speed-test.cc:72) For CuMatrix::TransposeS<double>, for dim = 1024, speed was 9.65529 gigaflops.
LOG (TestCuMatrixTransposeNS():cu-matrix-speed-test.cc:56) For CuMatrix::TransposeNS<double>, for dim = 1024, speed was 8.52148 gigaflops.

13792af4

Add code for cumatrix copy transpose benchmark · c765ba6e

Shiyin Kang authored 8 years ago

Add barrier for correct timing.

Original performance:
LOG (TestCuMatrixTransposeCross():cu-matrix-speed-test.cc:91) For CuMatrix::TransposeCross<float>, for dim = 1024, speed was 4.26727 gigaflops.
LOG (TestCuMatrixTransposeS():cu-matrix-speed-test.cc:72) For CuMatrix::TransposeS<float>, for dim = 1024, speed was 5.97203 gigaflops.
LOG (TestCuMatrixTransposeNS():cu-matrix-speed-test.cc:56) For CuMatrix::TransposeNS<float>, for dim = 1024, speed was 3.0816 gigaflops.
LOG (TestCuMatrixTransposeCross():cu-matrix-speed-test.cc:91) For CuMatrix::TransposeCross<double>, for dim = 1024, speed was 3.95059 gigaflops.
LOG (TestCuMatrixTransposeS():cu-matrix-speed-test.cc:72) For CuMatrix::TransposeS<double>, for dim = 1024, speed was 4.36189 gigaflops.
LOG (TestCuMatrixTransposeNS():cu-matrix-speed-test.cc:56) For CuMatrix::TransposeNS<double>, for dim = 1024, speed was 2.39275 gigaflops.

c765ba6e

correct the usage description · 0731b8f6
wan guanglu authored 8 years ago

0731b8f6
add barrier for correct timing. · 24b886a2
Shiyin Kang authored 8 years ago

24b886a2
a few more comments · a829139c
kangshiyin authored 8 years ago

a829139c
Keep cu-matrix-speed-test.cc unchanged. I thought dimN was typo. It is not. · 7c88a9e1
kangshiyin authored 8 years ago

7c88a9e1

A new CUDA kernel for CuMatrixBase<Real>::FindRowMaxId; · 074e0053

sykang@sepc83 authored 8 years ago

Old:
LOG (TestCuFindRowMaxId():cu-matrix-speed-test.cc:264) For CuMatrix::FindRowMaxId<float>, for dim = 1024, speed was 3.99218 gigaflops.
LOG (TestCuFindRowMaxId():cu-matrix-speed-test.cc:264) For CuMatrix::FindRowMaxId<double>, for dim = 1024, speed was 3.46283 gigaflops.

New:
LOG (TestCuFindRowMaxId():cu-matrix-speed-test.cc:264) For CuMatrix::FindRowMaxId<float>, for dim = 1024, speed was 66.2965 gigaflops.
LOG (TestCuFindRowMaxId():cu-matrix-speed-test.cc:264) For CuMatrix::FindRowMaxId<double>, for dim = 1024, speed was 58.442 gigaflops.

074e0053

May 17, 2016
- error_logs: abort or throw exception even with custom log-handler · 1449a9d0
  vesis84 authored 8 years ago
  
  1449a9d0
- Update nnet1-to-raw-nnet.cc · 95688aa7
  tal1974 authored 8 years ago
  
  Now GetParams requires vector allocated already. Changes in nnet\nnet-various.h at Apr 21, 2016
  95688aa7
- error_logs: adding 'k' to the enum. · 216f9ec4
  vesis84 authored 8 years ago
  
  216f9ec4
May 16, 2016
- Fixes in comments in chain-denominator.h (thanks: hhadian) · 53e5e7f1
  Daniel Povey authored 8 years ago
  
  53e5e7f1
May 15, 2016
- Clean up unneeded flags · 2f6809c5
  Allen Guo authored 8 years ago
  
  2f6809c5
May 13, 2016
- Further fix to GMM->UBM code to resolve the issue that the last commit addressed (RE zero weights) · e1d56526
  Daniel Povey authored 8 years ago
  
  e1d56526
- minor fix to am-diag-gmm.cc to prevent a crash when clustering to a UBM when... · f1c43ac4
  Daniel Povey authored 8 years ago
  
  minor fix to am-diag-gmm.cc to prevent a crash when clustering to a UBM when there are too few Gaussians and some zero counts.
  f1c43ac4
May 12, 2016
- nnet1: integrating comments from Dan. · bffbe481
  vesis84 authored 8 years ago
  
  bffbe481
May 11, 2016

fix small bug regarding online-ivector-features (for online-nnet2/nnet3 setup)... · 50b76350

Daniel Povey authored 8 years ago

fix small bug regarding online-ivector-features (for online-nnet2/nnet3 setup) regarding how silence-weighting is applied in iVector estimation. (thanks: xiang li)

50b76350

May 10, 2016

base/kaldi_error : integrating comments from Dan. · a4fff0d5
vesis84 authored 8 years ago

a4fff0d5

base/kaldi_error : refactoring the logging code · 24bef8dc

vesis84 authored 8 years ago

- some TODO's are to be decided:
  - Can we remove the: 'IsKaldiError()'? (It's very 'dirty' function. And it's used only in the table-I/O to suppress printing 'what' messages from KALDI_ERR. IMHO, it may not be a good idea to suppress this.)
  - With Kirill's log-handler, the log is sent and then there's no abort() for errors/asserts (seems like a bad idea, but it is the way it worked previously).

24bef8dc

May 09, 2016

some cosmetic changes: add comments to RNNLM rescoring utilities to refer to... · c9c5d595

Dan Povey authored 8 years ago

some cosmetic changes: add comments to RNNLM rescoring utilities to refer to each other; improve messages printed out by 'configure'

c9c5d595

May 07, 2016

some cosmetic changes: add comments to RNNLM rescoring utilities to refer to... · 3c6b63c4

Dan Povey authored 8 years ago

some cosmetic changes: add comments to RNNLM rescoring utilities to refer to each other; improve messages printed out by 'configure'

3c6b63c4

May 05, 2016
- Use OnlineFeatureInterface instead of hardcoded OnlineNnet2FeaturePipeline · 47b7a6b5
  Nickolay Shmyrev authored 8 years ago
  
  in nnet2 and nnet3 decoders to allow more flexible feature pipelines.
  47b7a6b5
- Added --speex-* options to configure. · 66bb0643
  Giulio Paci authored 8 years ago
  
  66bb0643
- Fixed a check in configure. · d4b76616
  Giulio Paci authored 8 years ago
  
  d4b76616
May 04, 2016

base/kaldi_error : the error messages are no longer printed 2x · 1bcdf6aa

vesis84 authored 8 years ago

- e.what() contains stackttrace or is empty string
- we should also consider changing:
  'std::cerr << e.what();' -> 'fprintf(stderr, e.what().c_str());'
- fprintf is thread-safe and it is better not to mix 'std::cerr' and
  'stderr', and 'stderr' is already used for logging...

1bcdf6aa

nnet1: adding back the 'cerr' prints of exceptions, · 80d6d43e
vesis84 authored 8 years ago

80d6d43e
nnet3: Undo the damage done by 2310a19c and fix the original problem properly · 048c01d0
Vassil Panayotov authored 8 years ago
```
Also add an additional unit test for the topological sorting function
```
048c01d0

another fix to ComposeCompactLatticeDeterministic(), and a fix (plus cleanup)... · faa9c446

Daniel Povey authored 8 years ago

another fix to ComposeCompactLatticeDeterministic(), and a fix (plus cleanup) to word-align-lattice.cc, that affects the --test=true option of lattice-align-words for some topologies including chain models.

faa9c446

May 03, 2016

Update to ComposeCompactLatticeDeterministic to fix a bug regarding the olabel... · 0e284f3b

Daniel Povey authored 8 years ago

Update to ComposeCompactLatticeDeterministic to fix a bug regarding the olabel (which shouldn't matter in our normal use-cases), and make final-probs application more efficient.

0e284f3b

cudamatrix: adding comment to CuArray::{Min,Max}() · 2d5404ce
vesis84 authored 8 years ago

2d5404ce

cudamatrix: adding new '.cc' file for CuArray, · b07db175

vesis84 authored 8 years ago

- reordering cudamatrix/cu-array-inl.h, so the methods appear in the same
  order as in cudamatrix/cu-array.h
- adding asserts for 0-dim of CuArray

b07db175