Commit 8da9b1c5 authored 5 years ago by Liezl Puzon Committed by Facebook Github Bot 5 years ago

Load a XLM model into transformer encoder / decoder for MT training (#629)

Summary:
Pull Request resolved: https://github.com/pytorch/fairseq/pull/629

Use GeLU as an alternate activation layer for ReLU.

Reviewed By: lematt1991

Differential Revision: D14689851

fbshipit-source-id: 7ec81fa34bc7bd0e1e43b337847ae932dcbf8b15

parent 8500bdd0

No related branches found

No related tags found

Hide whitespace changes

Inline Side-by-side

Showing with 292 additions and 2 deletions

Please register or to comment