Positional Encoding How Does It Work

How large language models encode theory-of-mind: a study on sparse parameter patterns

This paper investigates the emergence of Theory-of-Mind (ToM) capabilities in large language models (LLMs) from a mechanistic perspective, focusing on the role of extremely sparse parameter patterns.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

How large language models encode theory-of-mind: a study on sparse parameter patterns

Trending now