Skip to content
Snippets Groups Projects
  1. Jul 16, 2016
  2. Jul 15, 2016
  3. Jul 14, 2016
  4. Jul 13, 2016
  5. Jul 10, 2016
  6. Jul 08, 2016
    • Daniel Povey's avatar
      Merge pull request #888 from kangshiyin/softmax · 3f6ae23f
      Daniel Povey authored
      Speed up softmax
      3f6ae23f
    • Daniel Povey's avatar
      Merge pull request #887 from scinart/master · cb9d279b
      Daniel Povey authored
      linux_x86_64_mkl.mk now show respect to --static-fst=yes
      cb9d279b
    • Shiyin Kang's avatar
      re-impl softmax: less __syncthreads() / arithmetic op / global mem access · 42352b63
      Shiyin Kang authored
      New: For CuMatrix::Softmax<float>, for dim = 16, speed was 0.0153621 gigaflops.
      Old: For CuMatrix::Softmax<float>, for dim = 16, speed was 0.0138999 gigaflops.
      New: For CuMatrix::Softmax<float>, for dim = 32, speed was 0.0614275 gigaflops.
      Old: For CuMatrix::Softmax<float>, for dim = 32, speed was 0.0507328 gigaflops.
      New: For CuMatrix::Softmax<float>, for dim = 64, speed was 0.235765 gigaflops.
      Old: For CuMatrix::Softmax<float>, for dim = 64, speed was 0.203548 gigaflops.
      New: For CuMatrix::Softmax<float>, for dim = 128, speed was 0.729239 gigaflops.
      Old: For CuMatrix::Softmax<float>, for dim = 128, speed was 0.725481 gigaflops.
      New: For CuMatrix::Softmax<float>, for dim = 256, speed was 2.30126 gigaflops.
      Old: For CuMatrix::Softmax<float>, for dim = 256, speed was 1.71863 gigaflops.
      New: For CuMatrix::Softmax<float>, for dim = 512, speed was 5.0565 gigaflops.
      Old: For CuMatrix::Softmax<float>, for dim = 512, speed was 3.69659 gigaflops.
      New: For CuMatrix::Softmax<float>, for dim = 1024, speed was 10.2482 gigaflops.
      Old: For CuMatrix::Softmax<float>, for dim = 1024, speed was 6.38335 gigaflops.
      New: For CuMatrix::Softmax<double>, for dim = 16, speed was 0.0143354 gigaflops.
      Old: For CuMatrix::Softmax<double>, for dim = 16, speed was 0.013143 gigaflops.
      New: For CuMatrix::Softmax<double>, for dim = 32, speed was 0.0590478 gigaflops.
      Old: For CuMatrix::Softmax<double>, for dim = 32, speed was 0.0495458 gigaflops.
      New: For CuMatrix::Softmax<double>, for dim = 64, speed was 0.228611 gigaflops.
      Old: For CuMatrix::Softmax<double>, for dim = 64, speed was 0.193465 gigaflops.
      New: For CuMatrix::Softmax<double>, for dim = 128, speed was 0.668961 gigaflops.
      Old: For CuMatrix::Softmax<double>, for dim = 128, speed was 0.676449 gigaflops.
      New: For CuMatrix::Softmax<double>, for dim = 256, speed was 2.1013 gigaflops.
      Old: For CuMatrix::Softmax<double>, for dim = 256, speed was 1.51862 gigaflops.
      New: For CuMatrix::Softmax<double>, for dim = 512, speed was 4.13055 gigaflops.
      Old: For CuMatrix::Softmax<double>, for dim = 512, speed was 3.1547 gigaflops.
      New: For CuMatrix::Softmax<double>, for dim = 1024, speed was 6.43429 gigaflops.
      Old: For CuMatrix::Softmax<double>, for dim = 1024, speed was 5.02974 gigaflops.
      
      minor changes
      42352b63
    • scinart's avatar
      60757995
  7. Jul 07, 2016
  8. Jul 06, 2016
  9. Jul 04, 2016
  10. Jul 01, 2016
  11. Jun 29, 2016
  12. Jun 28, 2016
  13. Jun 26, 2016
  14. Jun 25, 2016
    • Daniel Povey's avatar
      Merge pull request #800 from jfainberg/yesno_path · 3a984f99
      Daniel Povey authored
      yesno recipe data prep: paths to absolute + misc. 
      3a984f99
    • jfainberg's avatar
      Minor change to original to allow abs path · b6424584
      jfainberg authored
      b6424584
    • Shiyin Kang's avatar
      _diff_softmax kernel: 4 reads and 1 write. · 6b8eefbb
      Shiyin Kang authored
      New: For CuMatrix::DiffSoftmaxPerRow<float>, for dim = 16, speed was 0.0165568 gigaflops.
      Old: For CuMatrix::DiffSoftmaxPerRow<float>, for dim = 16, speed was 0.00355242 gigaflops.
      New: For CuMatrix::DiffSoftmaxPerRow<float>, for dim = 32, speed was 0.0678791 gigaflops.
      Old: For CuMatrix::DiffSoftmaxPerRow<float>, for dim = 32, speed was 0.0145515 gigaflops.
      New: For CuMatrix::DiffSoftmaxPerRow<float>, for dim = 64, speed was 0.24739 gigaflops.
      Old: For CuMatrix::DiffSoftmaxPerRow<float>, for dim = 64, speed was 0.0583246 gigaflops.
      New: For CuMatrix::DiffSoftmaxPerRow<float>, for dim = 128, speed was 0.898427 gigaflops.
      Old: For CuMatrix::DiffSoftmaxPerRow<float>, for dim = 128, speed was 0.225076 gigaflops.
      New: For CuMatrix::DiffSoftmaxPerRow<float>, for dim = 256, speed was 2.89009 gigaflops.
      Old: For CuMatrix::DiffSoftmaxPerRow<float>, for dim = 256, speed was 0.834096 gigaflops.
      New: For CuMatrix::DiffSoftmaxPerRow<float>, for dim = 512, speed was 6.72164 gigaflops.
      Old: For CuMatrix::DiffSoftmaxPerRow<float>, for dim = 512, speed was 1.92722 gigaflops.
      New: For CuMatrix::DiffSoftmaxPerRow<float>, for dim = 1024, speed was 10.4916 gigaflops.
      Old: For CuMatrix::DiffSoftmaxPerRow<float>, for dim = 1024, speed was 2.78281 gigaflops.
      New: For CuMatrix::DiffSoftmaxPerRow<double>, for dim = 16, speed was 0.0148584 gigaflops.
      Old: For CuMatrix::DiffSoftmaxPerRow<double>, for dim = 16, speed was 0.00260567 gigaflops.
      New: For CuMatrix::DiffSoftmaxPerRow<double>, for dim = 32, speed was 0.0586865 gigaflops.
      Old: For CuMatrix::DiffSoftmaxPerRow<double>, for dim = 32, speed was 0.0121077 gigaflops.
      New: For CuMatrix::DiffSoftmaxPerRow<double>, for dim = 64, speed was 0.22893 gigaflops.
      Old: For CuMatrix::DiffSoftmaxPerRow<double>, for dim = 64, speed was 0.0527767 gigaflops.
      New: For CuMatrix::DiffSoftmaxPerRow<double>, for dim = 128, speed was 0.763462 gigaflops.
      Old: For CuMatrix::DiffSoftmaxPerRow<double>, for dim = 128, speed was 0.175736 gigaflops.
      New: For CuMatrix::DiffSoftmaxPerRow<double>, for dim = 256, speed was 2.40457 gigaflops.
      Old: For CuMatrix::DiffSoftmaxPerRow<double>, for dim = 256, speed was 0.58351 gigaflops.
      New: For CuMatrix::DiffSoftmaxPerRow<double>, for dim = 512, speed was 4.55165 gigaflops.
      Old: For CuMatrix::DiffSoftmaxPerRow<double>, for dim = 512, speed was 1.42464 gigaflops.
      New: For CuMatrix::DiffSoftmaxPerRow<double>, for dim = 1024, speed was 4.36421 gigaflops.
      Old: For CuMatrix::DiffSoftmaxPerRow<double>, for dim = 1024, speed was 1.94971 gigaflops.
      6b8eefbb
    • Shiyin Kang's avatar
      add speed test and unit test · 619889a1
      Shiyin Kang authored
      619889a1
    • Shiyin Kang's avatar
      mv diffsoftmax to cumatrix · 69ccd5ce
      Shiyin Kang authored
      69ccd5ce
  15. Jun 23, 2016
Loading