This paper investigates the emergence of Theory-of-Mind (ToM) capabilities in large language models (LLMs) from a mechanistic perspective, focusing on the role of extremely sparse parameter patterns.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results