Skip to content
Commit 36be00a3 authored by cariosr's avatar cariosr
Browse files

I set two states, the eos token (zeros). And the excess of the max length (-1...

I set two states, the eos token (zeros). And the excess of the max length (-1 s). And add a punishment on last state on the reward.
parent cea0559c
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment