Commits · b0b71220115b96b7d1d0f3f0f3dd08bc578a6afe · Simon Will / kaldi-commonvoice

Sep 20, 2016
- fixing the incorrect check for "which" command (#1043) · b0b71220
  Jan "yenda" Trmal authored 8 years ago
  
  b0b71220
- Fix some bugs in egs/hkust/s5/local/hkust_prepare_dict.sh (#1044) · 2bd99775
  ling0322 authored 8 years ago
  
  2bd99775
- Merge pull request #1039 from aevernon/master · 85d8131e
  Daniel Povey authored 8 years ago
  
  Check that the `which` command exists
  85d8131e
Sep 19, 2016

Check that the `which` command exists · 45edf34e

Albert Vernon authored 8 years ago

check_dependencies.sh depends on `which`, but some distributions, such as those intended for use with Docker, do not include it. Check to see if is installed.

45edf34e

Sep 17, 2016
- fix bad link in previous commit · 85a3dd5f
  Daniel Povey authored 8 years ago
  
  85a3dd5f
- Moving scripts in swbd/s5c/local/chain/ around for greater discoverability. · b3e5a5ad
  Daniel Povey authored 8 years ago
  
  b3e5a5ad
- Merge pull request #1034 from kaldi-asr/revert-899-tdnn_increasing_hidden_dims · cd6321c2
  Daniel Povey authored 8 years ago
  
  Revert "introduce a new splice configuration for tdnn+xent on swbd as default…"
  cd6321c2
- Revert "introduce a new splice configuration for tdnn+xent on swbd as default…" · 731cd0bf
  Daniel Povey authored 8 years ago
  
  731cd0bf
- Merge pull request #899 from freewym/tdnn_increasing_hidden_dims · fb9f6695
  Daniel Povey authored 8 years ago
  
  introduce a new splice configuration for tdnn+xent on swbd as default…
  fb9f6695
Sep 16, 2016
- Merge pull request #1029 from sih4sing5hong5/fix_lmrescore_arpa_usage · 9334df53
  Daniel Povey authored 8 years ago
  
  fixed the usage of lmrescore_const_arpa.sh
  9334df53
- fixed the usage of lmrescore_const_arpa.sh · 7379cddb
  薛丞宏 authored 8 years ago
  
  7379cddb
Sep 14, 2016

Merge pull request #1023 from kangshiyin/diff-log-softmax · 58b1de9c
Daniel Povey authored 8 years ago
```
Speed up LogSoftmaxComponent::Backprop
```
58b1de9c

single-kernel impl for diff log softmax · bc79ed49

Shiyin Kang authored 8 years ago

bench result:
New: For CuMatrix::DiffLogSoftmaxPerRow<float>, for dim = 16, speed was 0.0152883 gigaflops.
Old: For CuMatrix::DiffLogSoftmaxPerRow<float>, for dim = 16, speed was 0.00217375 gigaflops.
New: For CuMatrix::DiffLogSoftmaxPerRow<float>, for dim = 32, speed was 0.0577221 gigaflops.
Old: For CuMatrix::DiffLogSoftmaxPerRow<float>, for dim = 32, speed was 0.00867094 gigaflops.
New: For CuMatrix::DiffLogSoftmaxPerRow<float>, for dim = 64, speed was 0.267811 gigaflops.
Old: For CuMatrix::DiffLogSoftmaxPerRow<float>, for dim = 64, speed was 0.035306 gigaflops.
New: For CuMatrix::DiffLogSoftmaxPerRow<float>, for dim = 128, speed was 0.878541 gigaflops.
Old: For CuMatrix::DiffLogSoftmaxPerRow<float>, for dim = 128, speed was 0.134737 gigaflops.
New: For CuMatrix::DiffLogSoftmaxPerRow<float>, for dim = 256, speed was 2.8799 gigaflops.
Old: For CuMatrix::DiffLogSoftmaxPerRow<float>, for dim = 256, speed was 0.491975 gigaflops.
New: For CuMatrix::DiffLogSoftmaxPerRow<float>, for dim = 512, speed was 6.20522 gigaflops.
Old: For CuMatrix::DiffLogSoftmaxPerRow<float>, for dim = 512, speed was 1.34159 gigaflops.
New: For CuMatrix::DiffLogSoftmaxPerRow<float>, for dim = 1024, speed was 10.4197 gigaflops.
Old: For CuMatrix::DiffLogSoftmaxPerRow<float>, for dim = 1024, speed was 2.4438 gigaflops.
New: For CuMatrix::DiffLogSoftmaxPerRow<float>, for dim = 2048, speed was 10.5138 gigaflops.
Old: For CuMatrix::DiffLogSoftmaxPerRow<float>, for dim = 2048, speed was 2.97796 gigaflops.
New: For CuMatrix::DiffLogSoftmaxPerRow<float>, for dim = 4096, speed was 10.3679 gigaflops.
Old: For CuMatrix::DiffLogSoftmaxPerRow<float>, for dim = 4096, speed was 3.25972 gigaflops.
New: For CuMatrix::DiffLogSoftmaxPerRow<double>, for dim = 16, speed was 0.0139596 gigaflops.
Old: For CuMatrix::DiffLogSoftmaxPerRow<double>, for dim = 16, speed was 0.00193458 gigaflops.
New: For CuMatrix::DiffLogSoftmaxPerRow<double>, for dim = 32, speed was 0.0573372 gigaflops.
Old: For CuMatrix::DiffLogSoftmaxPerRow<double>, for dim = 32, speed was 0.0073193 gigaflops.
New: For CuMatrix::DiffLogSoftmaxPerRow<double>, for dim = 64, speed was 0.197072 gigaflops.
Old: For CuMatrix::DiffLogSoftmaxPerRow<double>, for dim = 64, speed was 0.0282332 gigaflops.
New: For CuMatrix::DiffLogSoftmaxPerRow<double>, for dim = 128, speed was 0.751801 gigaflops.
Old: For CuMatrix::DiffLogSoftmaxPerRow<double>, for dim = 128, speed was 0.111315 gigaflops.
New: For CuMatrix::DiffLogSoftmaxPerRow<double>, for dim = 256, speed was 2.43203 gigaflops.
Old: For CuMatrix::DiffLogSoftmaxPerRow<double>, for dim = 256, speed was 0.394491 gigaflops.
New: For CuMatrix::DiffLogSoftmaxPerRow<double>, for dim = 512, speed was 4.53031 gigaflops.
Old: For CuMatrix::DiffLogSoftmaxPerRow<double>, for dim = 512, speed was 0.930698 gigaflops.
New: For CuMatrix::DiffLogSoftmaxPerRow<double>, for dim = 1024, speed was 5.43358 gigaflops.
Old: For CuMatrix::DiffLogSoftmaxPerRow<double>, for dim = 1024, speed was 1.52317 gigaflops.
New: For CuMatrix::DiffLogSoftmaxPerRow<double>, for dim = 2048, speed was 5.47013 gigaflops.
Old: For CuMatrix::DiffLogSoftmaxPerRow<double>, for dim = 2048, speed was 1.84648 gigaflops.
New: For CuMatrix::DiffLogSoftmaxPerRow<double>, for dim = 4096, speed was 5.23873 gigaflops.
Old: For CuMatrix::DiffLogSoftmaxPerRow<double>, for dim = 4096, speed was 1.87967 gigaflops.

Conflicts:
src/cudamatrix/cu-kernels-ansi.h
src/cudamatrix/cu-kernels.h

naming of diff log softmax

bc79ed49

add speed test and unit test for DiffLogSoftmax · b885535e
Shiyin Kang authored 8 years ago

b885535e
mv diff log softmax code to CuMatrix · 7a525668
Shiyin Kang authored 8 years ago

7a525668
Merge pull request #1025 from galv/atomic-add · 13e5cc8f
Daniel Povey authored 8 years ago
```
Replace implementation of atomic addition.
```
13e5cc8f

Replace implementation of atomic addition. · 6f20b397

Daniel Galvez authored 8 years ago

Old version was based on atomicExch(), while this version uses CUDA's
built-in atomicAdd(), added in SM 2.0. When tested in isolation (test
code not provided in this commit), on a K10 (Kepler), the built-in
atomicAdd() is two times faster than the old version of atomic_add()
here, and on a 950M (Maxwell), 3 times faster.

Speed up to forward backward, however, is marginal for an
nnet3-chain-train call on the TEDLIUM version 1 dataset:

Times reported on a K10. Note speedup in BetaDashGeneralFrame(),
which is the only code calling the atomic add function.

New code:

[cudevice profile]
AddRows	0.468516s
AddVecVec	0.553152s
MulRowsVec	0.614542s
CuMatrix::SetZero	0.649105s
CopyRows	0.748831s
TraceMatMat	0.777907s
AddVecToRows	0.780592s
CuMatrix::Resize	0.850884s
AddMat	1.23867s
CuMatrixBase::CopyFromMat(from other CuMatrixBase)	2.04559s
AddDiagMatMat	2.18652s
AddMatVec	3.67839s
AlphaGeneralFrame	6.42574s
BetaDashGeneralFrame	8.69981s
AddMatMat	29.9714s
Total GPU time:	63.8273s (may involve some double-counting)
-----

Old code:

[cudevice profile]
AddRows	0.469031s
AddVecVec	0.553298s
MulRowsVec	0.615624s
CuMatrix::SetZero	0.658105s
CopyRows	0.750856s
AddVecToRows	0.782937s
TraceMatMat	0.786361s
CuMatrix::Resize	0.91639s
AddMat	1.23964s
CuMatrixBase::CopyFromMat(from other CuMatrixBase)	2.05253s
AddDiagMatMat	2.18863s
AddMatVec	3.68707s
AlphaGeneralFrame	6.42885s
BetaDashGeneralFrame	9.03617s
AddMatMat	29.9942s
Total GPU time:	64.3928s (may involve some double-counting)
-----

6f20b397

Sep 13, 2016
- Small change to lattice-determinize-pruned to call fst::Connect() after determinization. · 2e8e8494
  Daniel Povey authored 8 years ago
  
  2e8e8494
Sep 11, 2016
- Merge pull request #1016 from david-ryan-snyder/sid-fix-2016-08-22 · f8c69763
  Daniel Povey authored 8 years ago
  
  Updates to SRE08 example
  f8c69763
Sep 08, 2016
- Modify validate_dict_dir.pl to check for <eps> in lexicon. · eb49517c
  Daniel Povey authored 8 years ago
  
  eb49517c
- Merge pull request #1021 from vesis84/New-Parametric-ReLU · 98bf4e72
  Daniel Povey authored 8 years ago
  
  nnet1: adding <ParametricRelu> component,
  98bf4e72
- nnet1: adding <ParametricRelu> component, · 0cf63a88
  vesis84 authored 8 years ago
  
  0cf63a88
- Merge pull request #1019 from vijayaditya/report_improvements · 1d418828
  Daniel Povey authored 8 years ago
  
  nnet3/report : Added plotting capability for parameter differences.
  1d418828
Sep 07, 2016
- nnet3/report : Added plotting capability for parameter differences. · e984dab8
  Vijayaditya Peddinti authored 8 years ago
  
  e984dab8
Sep 06, 2016
- Merge pull request #1018 from vesis84/nnet1_blstm_update · 06e3f8d0
  Daniel Povey authored 8 years ago
  
  nnet1: minor cosmetic change,
  06e3f8d0
- nnet1: minor cosmetic change, · b789820c
  vesis84 authored 8 years ago
  
  b789820c
Sep 05, 2016
- Merge pull request #771 from guoguo12/multi-recipe · e9852e61
  Daniel Povey authored 8 years ago
  
  WIP: Multi-database English LVCSR recipe
  e9852e61
- Proofread recipe · c2829830
  Korbinian Riedhammer authored 8 years ago
  
  c2829830
- Use Tedlium release 2 scripts/data · 712a7d63
  Allen Guo authored 8 years ago
  
  712a7d63
- Start recipe · 8a0ddc1b
  Allen Guo authored 8 years ago
  
  8a0ddc1b
- Merge pull request #1017 from danpovey/remove_reverse · 7cf8616c
  Daniel Povey authored 8 years ago
  
  Removing little-used feature: time-reversed, and fwd-bkwd, decoding.
  7cf8616c
- Removing little-used feature: time-reversed, and forward-backward, decoding. · 5dfa20aa
  Daniel Povey authored 8 years ago
  
  5dfa20aa
Sep 02, 2016
- sid-fix: updating egs/sre08/v1/README · b41ac1c2
  David Snyder authored 8 years ago
  
  b41ac1c2
- sid-fix: fixing memory requirement in run.sh · 7264a1fd
  David Snyder authored 8 years ago
  
  7264a1fd
- sid-fix: Adding i-vector length normalization to test i-vectors in egs/sre08/v1/run.sh. · c0e65938
  David Snyder authored 8 years ago
  
  c0e65938
Sep 01, 2016

Daniel Povey authored 8 years ago

Various unrelated fixes: add --iter options to TIMIT sclite scoring; improve how syncfiles are removed in queue.pl; minor cosmetic and efficiency improvements in nnet3 code.

f0fab215

Aug 31, 2016

Merge pull request #1005 from psmit/extract-wav-perturb · ed674ed5
Daniel Povey authored 8 years ago
```
Extract wav - perturb_data_dir_speed.sh implementation
```
ed674ed5

Make wav-copy accept both xspecifiers and xfilenames · 278fcbe8

Peter Smit authored 8 years ago

In scripts such as perturb-speed and perturb-volume scp lines are
tranformed into piped command with the appropropriate sox command. The
case that the scp file has file offsets was not handled. This commit
both generalizes the wav-copy command to work also on xfilenames and
fixes the two perturb scripts to use this command in case of file
offsets.

278fcbe8

Aug 30, 2016
- Merge pull request #1013 from kangshiyin/log-softmax · b2c8497b
  Daniel Povey authored 8 years ago
  
  Speed up log softmax
  b2c8497b
- comment about aliasing in AddMatMatDivMat. · 81e20c4c
  Shiyin Kang authored 8 years ago
  
  81e20c4c