Updated 7 months ago

layer_norm_expressivity_role • Science 54%

Code for the paper "On the Expressivity Role of LayerNorm in Transformers' Attention" (Findings of ACL'2023)