Commits · v0.4.0 · Simon Will / fairseq

May 24, 2018
- Merge internal changes (#163) · ec0031df
  Myle Ott authored May 24, 2018
  
  v0.4.0
  
  ec0031df
May 22, 2018
- Update dataset code for use by https://github.com/pytorch/translate/pull/62 (#161) · 29153e27
  theweiho authored May 22, 2018
  
  29153e27
May 21, 2018
- Fix old model checkpoints after #151 (fixes #156) (#157) · 3ae97589
  Myle Ott authored May 21, 2018
  
  3ae97589
May 09, 2018
- Flake8 · 4973d05a
  Myle Ott authored May 09, 2018
  
  4973d05a
- Add pretrained embedding support (#151) · e40363d7
  Sai authored May 09, 2018
  
  e40363d7
- use implicit padding when possible (#152) · 48c4c6d3
  ngimel authored May 09, 2018
  
  48c4c6d3
May 01, 2018
- Update README.md · 66ee3df9
  Myle Ott authored May 01, 2018
  
  66ee3df9
- Disallow --batch-size in interactive.py · 56099c74
  Myle Ott authored May 01, 2018
  
  56099c74
- make interactive mode print out alignment nicely · 6532e32b
  alexeib authored Apr 11, 2018
  
  6532e32b
Apr 02, 2018

Merge internal changes (#136) · d3795d6c

Myle Ott authored Apr 02, 2018

Changes:
- 7d19e36: Add `--sampling` flag to generate.py to sample instead of doing beam search
- c777340: Add `scripts/average_checkpoints.py` to average multiple checkpoints into a combined model
- 3ea882c: Add `--max-update` option to train.py to stop training after a given number of updates
- small bugfixes for distributed training, LSTM, inverse square root LR scheduler

d3795d6c

Mar 28, 2018

Merge pull request #134 from hitvoice/master · 48836525
Sergey Edunov authored Mar 28, 2018
```
Update training commands
```
48836525
Update training command for IWSLT14 · 0a141e3f
Runqi Yang authored Mar 29, 2018
```
specify a single GPU setup for IWSLT14
```
0a141e3f

Update training commands · 435ed351

Runqi Yang authored Mar 28, 2018

Update training commands in data/README to match the latest version of this project according to #132.

Continue from 3c072958: add omitted "\".

435ed351

Update training commands · 3c072958

Runqi Yang authored Mar 28, 2018

Update training commands in data/README to match the latest version of this project according to #132.

- Motivation: in the previous data/README, the commands are obsolete and will cause the error "unrecognized arguments: --label-smoothing 0.1 --force-anneal 50". 
- What's changed: add arguments "--criterion label_smoothed_cross_entropy" and "--lr-scheduler fixed" to the training commands of all 3 datasets.
- Result: the new commands run without error on all 3 datasets.

3c072958

Mar 27, 2018
- Merge remote-tracking branch 'upstream/master' · 4972056e
  杨润琦 authored Mar 28, 2018
  
  4972056e
Mar 26, 2018
- fix typo in data/README (#131) · 6268f20e
  Runqi Yang authored Mar 26, 2018
```
Change "awailable" to "available".
```
  6268f20e
Mar 25, 2018
- fix typo in data/README · 261d1822
  Runqi Yang authored Mar 25, 2018
```
Change "awailable" to "available".
```
  261d1822
Mar 07, 2018
- Enforce upper-bound on maximum generation length (#121) · 49aeab2d
  Myle Ott authored Mar 07, 2018
  
  49aeab2d
Mar 05, 2018
- Merge pull request #116 from facebookresearch/oss-merge-internal · cbaf59d4
  Sergey Edunov authored Mar 05, 2018
```
Oss merge internal
```
  cbaf59d4
- Allow more flexible pre-processing and generation (#227) · b03b53b4
  Sergey Edunov authored Mar 05, 2018
```
* Allow more flexible pre-processing and generation

* Addressing CR comments

* small fix
```
  b03b53b4
- Filter padding properly in LabelSmoothedCrossEntropyCriterion (#229) · e73fddf4
  Myle Ott authored Mar 04, 2018
  
  e73fddf4
- Small fixes · 5f29d123
  Myle Ott authored Mar 02, 2018
  
  5f29d123
Mar 02, 2018
- Use ATen built-in conv_tbc method (#66) · 56f9ec3c
  James Reed authored Mar 01, 2018
```
Remove custom ConvTBC code
```
  56f9ec3c
Mar 01, 2018
- More updates for PyTorch (#114) · 6e4d370a
  Myle Ott authored Mar 01, 2018
  
  6e4d370a
- More fixes for recent PyTorch (incl. topk issue) (#113) · 3bde773d
  Myle Ott authored Mar 01, 2018
  
  3bde773d
Feb 27, 2018
- Merge pull request #107 from facebookresearch/oss-merge-internal · 21b8fb5c
  Sergey Edunov authored Feb 27, 2018
```
Oss merge internal changes
```
  21b8fb5c
- Making our code compatible with the latest pytorch (#223) · 2f976aae
  Sergey Edunov authored Feb 27, 2018
```
* Making our code compatible with the latest pytorch

* revert

* torch.nn.utils.clip_grad_norm now returns tensor
```
  2f976aae
- Refactor incremental generation to be more explicit and less magical (#222) · 9438019f
  Myle Ott authored Feb 24, 2018
  
  9438019f
- Fix LabelSmoothedCrossEntropy test · e7094b14
  Myle Ott authored Feb 23, 2018
  
  e7094b14
- pytorch update: no need to rewrap variable in backward() · 78a6ef02
  Myle Ott authored Feb 23, 2018
  
  78a6ef02
- Add support to prefixes (#221) · 866b27d5
  Dario Pavllo authored Feb 23, 2018
```
* Add prefix

* Fixes

* Keep original scores with prefix

* Improve prefix code

* Replace 'repeat' with 'expand'
```
  866b27d5
- More unit test fixes · 0d90e35f
  Myle Ott authored Feb 15, 2018
  
  0d90e35f
- Fix tests and flake8 · 29c82741
  Myle Ott authored Feb 15, 2018
  
  29c82741
- Add OOM counter back to logging output · b9f2d427
  Myle Ott authored Feb 14, 2018
  
  b9f2d427
- fairseq-py goes distributed (#106) · 66415206
  Myle Ott authored Feb 27, 2018
```
This PR includes breaking API changes to modularize fairseq-py and adds support for distributed training across multiple nodes.

Changes:
- c7033ef: add support for distributed training! See updated README for usage.
- e016299: modularize fairseq-py, adding support for register_model, register_criterion, register_optimizer, etc.
- 154e440: update LSTM implementation to use PackedSequence objects in the encoder, better following best practices and improving perf
- 90c2973 and 1da6265: improve unit test coverage
```
  66415206
Feb 12, 2018
- Allow larger maxlen (fixes #100) (#101) · 7e86e30c
  Myle Ott authored Feb 12, 2018
  
  v0.3.0
  
  7e86e30c
Feb 09, 2018
- Adjust weight decay by the current learning rate to make it work correctly during annealing · 9a951216
  Sergey Edunov authored Feb 08, 2018
  
  9a951216
Jan 31, 2018
- Merge pull request #91 from facebookresearch/prepare_wmt · e4c935aa
  Sergey Edunov authored Jan 31, 2018
```
Prepare scripts for WMT14 (#88)
```
  e4c935aa
- spelling · 52b6119a
  Sergey Edunov authored Jan 31, 2018
  
  52b6119a
- Update README with new models · 2c18c273
  Sergey Edunov authored Jan 31, 2018
  
  2c18c273