Commits · 2d7af05dd8a10b64b358a79f83a9c739c001d111 · rios / DQN-NMT

May 04, 2020
- small config file added. · 2d7af05d
  Antonio Ruiz authored 4 years ago
  
  2d7af05d
- Commented and clean code. · 7bb8d69a
  Antonio Ruiz authored 4 years ago
  
  7bb8d69a
Apr 03, 2020
- No news. · 6b2eb67b
  cariosr authored 4 years ago
  
  6b2eb67b
Apr 02, 2020
- I set two states, the eos token (zeros). And the excess of the max length (-1... · 36be00a3
  cariosr authored 4 years ago
  
  I set two states, the eos token (zeros). And the excess of the max length (-1 s). And add a punishment on last state on the reward.
  36be00a3
- Merge branch 'master' of https://github.com/cariosr/States-Joeynmt · cea0559c
  cariosr authored 4 years ago
  
  In order to include the mini_reverse_model. And changes provided by Antonio and Rene
  cea0559c
- I set two states, the eos token (zeros). And the excess of the max length (-1... · d0406028
  cariosr authored 4 years ago
  
  I set two states, the eos token (zeros). And the excess of the max length (-1 s). And add a punishment on last state on the reward.
  d0406028
- Created mini reverse task routine · 994181f2
  rbucchia authored 4 years ago
  
  994181f2
- Created mini reverse task routine · b39f9ca1
  rbucchia authored 4 years ago
  
  b39f9ca1
- small fix for in Reward_bleu_diff · ce6ebee4
  Antonio Ruiz authored 4 years ago
  
  ce6ebee4
- I modified the reward function using different parameters of the sacrebleu.... · 48c5694f
  cariosr authored 4 years ago
  
  I modified the reward function using different parameters of the sacrebleu. And from a prior test, seems to be improved the learning.
  48c5694f
Apr 01, 2020
- Add the option to use attention as the state. Is already learning something,... · b706445c
  cariosr authored 4 years ago
  
  Add the option to use attention as the state. Is already learning something, but in a very slow way....
  b706445c
- Added options for mini reverse problem · 51b3c912
  rbucchia authored 4 years ago
  
  51b3c912
- bug: a max value on the beam_dqn = action_size. · 1cb3d8bd
  cariosr authored 4 years ago
  
  1cb3d8bd
- Update README.md · 64c477a3
  Carlos Rios authored 4 years ago
  
  Unverified
  
  64c477a3
- minor change on Readme · 7e960631
  cariosr authored 4 years ago
  
  7e960631
Mar 31, 2020
- Update README.md · e6a10321
  rios authored 4 years ago
  
  e6a10321
- Improving README, adding blue scores on the dev steps. · 86823357
  cariosr authored 4 years ago
  
  86823357
- Update README.md · f8b569f2
  Carlos Rios authored 4 years ago
  
  Unverified
  
  f8b569f2
- Update README.md · ed5bbc50
  Carlos Rios authored 4 years ago
  
  Unverified
  
  ed5bbc50
- Improving the Readme, and the description of the funtions. Include the Score value on the dev step. · 82d81980
  cariosr authored 4 years ago
  
  82d81980
Mar 30, 2020
- To test in Colab · 5e97824c
  cariosr authored 4 years ago
  
  5e97824c
- To test in Colab · 5bcdb9c7
  cariosr authored 4 years ago
  
  5bcdb9c7
Mar 29, 2020
- dev_network with small datasets.... dev and test · 059ec3c4
  cariosr authored 4 years ago
  
  059ec3c4
- Major fixes, on the DQN procedure. Changing the algorithm as the used on the paper. · 1572fbe0
  cariosr authored 4 years ago
  
  1572fbe0
Mar 26, 2020
- minor changes · 375dc9df
  cariosr authored 4 years ago
  
  375dc9df
- minor changes · 1448098c
  cariosr authored 4 years ago
  
  1448098c
- minor changes · a8133f42
  cariosr authored 4 years ago
  
  a8133f42
- minor changes · 5557c6e3
  cariosr authored 4 years ago
  
  5557c6e3
- minor changes · cad5ad36
  cariosr authored 4 years ago
  
  cad5ad36
Mar 25, 2020
- Added tensor board, and command to use on Readme file · a7e32f9c
  cariosr authored 4 years ago
  
  a7e32f9c
- Undated the Readme · 991ce770
  cariosr authored 4 years ago
  
  991ce770
- Added freezed and unfreezed funtions, to solve the problem with the untrained parameters. · b43765f9
  cariosr authored 4 years ago
  
  b43765f9
Mar 24, 2020
- Inlcuding the reward shaping, Carlos way. The QNet is not training yet · 5752e65d
  cariosr authored 4 years ago
  
  5752e65d
Mar 23, 2020

Included the complete q-learn sequence. But the Qnet, still is not working(not... · 2897b99e

cariosr authored 4 years ago

Included the complete q-learn sequence. But the Qnet, still is not working(not improving scores). I write a TODO list on readme.

2897b99e

Mar 22, 2020
- Just commented two variables.. · 720966a3
  cariosr authored 4 years ago
  
  720966a3
- Step back with the implementation. Coded another idea, from extraction of... · 716cbbe1
  cariosr authored 4 years ago
  
  Step back with the implementation. Coded another idea, from extraction of state, a, r, state_, a basic idea of the reward(to be improved). Deletd the get_states...
  716cbbe1
Mar 20, 2020

Code a class QManager, to init model, data, and dqn parameters, all in... · 6c91f9f4

cariosr authored 4 years ago

Code a class QManager, to init model, data, and dqn parameters, all in DQN_loop.py, and added the option dqn_train, in model add the option to extract the atention vectors. To check fit as states.

6c91f9f4

Mar 17, 2020
- Added a prior version of the main loop on DQN, And some useful funtions on DQN_utis · 1b5085ff
  cariosr authored 4 years ago
  
  1b5085ff
Mar 16, 2020
- Already works the get_states funtions. Similar as the translate function · 9ce2c0ec
  cariosr authored 4 years ago
  
  9ce2c0ec
- On prediction.py, use the validate_on_data to create the funtion _states_data,... · 8a445d79
  cariosr authored 4 years ago
  
  On prediction.py, use the validate_on_data to create the funtion _states_data, wich return the states representation on the same way as the translation funtion
  8a445d79