- May 24, 2018
-
- May 22, 2018
-
-
- May 21, 2018
-
-
Myle Ott authored
-
- May 09, 2018
- May 01, 2018
- Apr 02, 2018
-
-
Myle Ott authored
Changes: - 7d19e36: Add `--sampling` flag to generate.py to sample instead of doing beam search - c777340: Add `scripts/average_checkpoints.py` to average multiple checkpoints into a combined model - 3ea882c: Add `--max-update` option to train.py to stop training after a given number of updates - small bugfixes for distributed training, LSTM, inverse square root LR scheduler
-
- Mar 28, 2018
-
-
Sergey Edunov authored
Update training commands
-
Runqi Yang authored
specify a single GPU setup for IWSLT14
-
Runqi Yang authored
Update training commands in data/README to match the latest version of this project according to #132. Continue from 3c072958: add omitted "\".
-
Runqi Yang authored
Update training commands in data/README to match the latest version of this project according to #132. - Motivation: in the previous data/README, the commands are obsolete and will cause the error "unrecognized arguments: --label-smoothing 0.1 --force-anneal 50". - What's changed: add arguments "--criterion label_smoothed_cross_entropy" and "--lr-scheduler fixed" to the training commands of all 3 datasets. - Result: the new commands run without error on all 3 datasets.
-
- Mar 27, 2018
-
-
杨润琦 authored
-
- Mar 26, 2018
-
-
Runqi Yang authored
Change "awailable" to "available".
-
- Mar 25, 2018
-
-
Runqi Yang authored
Change "awailable" to "available".
-
- Mar 07, 2018
-
-
Myle Ott authored
-
- Mar 05, 2018
-
-
Sergey Edunov authored
Oss merge internal
-
Sergey Edunov authored
* Allow more flexible pre-processing and generation * Addressing CR comments * small fix
-
Myle Ott authored
-
Myle Ott authored
-
- Mar 02, 2018
-
-
James Reed authored
Remove custom ConvTBC code
-
- Mar 01, 2018
- Feb 27, 2018
-
-
Sergey Edunov authored
Oss merge internal changes
-
Sergey Edunov authored
* Making our code compatible with the latest pytorch * revert * torch.nn.utils.clip_grad_norm now returns tensor
-
Myle Ott authored
-
Myle Ott authored
-
Myle Ott authored
-
Dario Pavllo authored
* Add prefix * Fixes * Keep original scores with prefix * Improve prefix code * Replace 'repeat' with 'expand'
-
Myle Ott authored
-
Myle Ott authored
-
Myle Ott authored
-
Myle Ott authored
This PR includes breaking API changes to modularize fairseq-py and adds support for distributed training across multiple nodes. Changes: - c7033ef: add support for distributed training! See updated README for usage. - e016299: modularize fairseq-py, adding support for register_model, register_criterion, register_optimizer, etc. - 154e440: update LSTM implementation to use PackedSequence objects in the encoder, better following best practices and improving perf - 90c2973 and 1da6265: improve unit test coverage
-
- Feb 12, 2018
-
- Feb 09, 2018
-
-
Sergey Edunov authored
-
- Jan 31, 2018
-
-
Sergey Edunov authored
Prepare scripts for WMT14 (#88)
-
Sergey Edunov authored
-
Sergey Edunov authored
-