This merges the last changes from Carlos' and the true action implementation for the Bellman Equation.