Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

In the context of attention, there is a very interesting recent paper that warns against conflating attention and token importance - "Is Attention Interpretable?" [1]. This is an accepted paper in ACL-2019:

[1] https://www.aclweb.org/anthology/P19-1282




See also Attention is Not Explanation [0].

[0] https://www.aclweb.org/anthology/N19-1357


There's a rebuttal to this as well: Attention is not not Explanation. https://arxiv.org/abs/1908.04626


Thanks!




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: