Skip to content
Commit 88da9946 authored by Julia Kreutzer's avatar Julia Kreutzer
Browse files

remove keys from attention forward function since projection has to get pre-computed

parent df8916ea
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment