Reading List: Recent Advances in Machine Theory of Mind

This illustration is generated using DALL·E 3

Overview

Citation

This is a curated list of related literature and resources for machine theory of mind (ToM) research. Last Update: Nov 4th, 2023.

Note

In the next update, we will include additional links for code and data.

If you find our work useful, please give us credit by citing:

@inproceedings{ma2023towards,
  title={Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models},
  author={Ma, Ziqiao and Sansom, Jacob and Peng, Run and Chai, Joyce},
  booktitle={Findings of the Association for Computational Linguistics: EMNLP 2023},
  year={2023}
}

Contributors

Main Contributors: Martin Ziqiao Ma
Active Contributors: Jacob Sansom, Run Peng, Pony Zhang

How To Contribute

Welcome to contribute to our paper list or be a collaborator!

To add missing papers: Please create an issue or pull request, so the team can make the update.
To become a contributor: Please drop an email to Martin.

1. ToM Community Resources
2. Machine ToM Surveys and Position Papers
3. Cognitive Underpinnings of ToM
- 3.1 Definition and Importance of ToM in Human Cognition (Selected)
- 3.2 Taxonomies of ToM and Mental States
4. Computational Inquiry to ToM in Foundation Models
4. ToM Benchmarks and Platforms
5. Computational Modeling of ToM
6. ToM Application

1. ToM Community Resources

1.1 Workshops

(ToM 2024) 2nd Workshop on Theory-of-Mind @ ICLR 2024. [Web]
(ToM 2023) 1st Workshop on Theory-of-Mind @ ICML 2023. [Web]

1.2 Talks and Tutorials

To be updated

1.3 Tools

(ToM 2023) The SocialAI School: Insights from Developmental Psychology Towards Artificial Socio-Cultural Agents. [Web]
(Preprint 2023) SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents. [Paper] [Web]

2. Machine ToM Surveys and Position Papers

(EMNLP Findings 2023) Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models. [Paper]
(Preprint 2023) A Review on Machine Theory of Mind. [Paper]
(EMNLP Findings 2022) Language Models as Agent Models. [Paper]
(RO-MAN 2022) Understanding Intention for Machine Theory of Mind: A Position Paper. [Paper]
(Psychological Medicine 2020) Knowing Me, Knowing you: Theory of Mind in AI. [Paper]
(Neuropsychologia 2020) Theory of Mind and Decision Science: Towards a Typology of Tasks and Computational Models. [Paper]
(AI 2018) Autonomous Agents Modelling Other Agents: A Comprehensive Survey and Open Problems. [Paper]
(Preprint 2017) It Takes Two to Tango: Towards Theory of AI's Mind. [Paper]
(AI 2016) Integrating Social Power Into The Decision-making of Cognitive Agents. [Paper]

3. Cognitive Underpinnings of ToM

3.1 Definition and Importance of ToM in Human Cognition (Selected)

(Premack et al., 1978) Does the Chimpanzee Have a Theory of Mind? [Paper]
(Dennett, 1988) Précis of The Intentional Stance. [Paper]
(Gopnik et al., 1992) Why the Child's Theory of Mind Really Is a Theory. [Paper]
(Baron-Cohen, 1992) Mindblindness: An Essay on Autism and Theory of Mind. [Book]
(Blakemore et al,. 2001) From the Perception of Action to the Understanding of Intention. [Paper]
(Ho et al,. 2022) Planning With Theory Of Mind. [Paper]

3.2 Taxonomies of ToM and Mental States

(ToM 2023) EPITOME: Experimental Protocol Inventory for Theory Of Mind Evaluation. [Paper]
(Stack et al., 2022) Framework for a Multi-dimensional Test of Theory of Mind for Humans and AI Systems. [Paper]
(Osterhaus et al., 2022) Looking for the Lighthouse: A Systematic Review of Advanced Theory-of-mind Tests beyond Preschool. [Paper]
(Beaudoin et al., 2020) Systematic Review and Inventory of Theory of Mind Measures for Young Children. [Paper]

4. Computational Inquiry to ToM in Foundation Models

4.1 Probing Intrinsic Mental States

(EACL 2023) Methods for Measuring, Updating, and Visualizing Factual Beliefs in Language Models. [Paper]
(EMNLP Findings 2021) Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language Understanding. [Paper]
(ACL 2021) Implicit Representations of Meaning in Neural Language Models. [Paper]

4.1 Evidence for Understanding Extrinsic Mental States

(Preprint 2023) Unveiling Theory of Mind in Large Language Models: A Parallel to Single Neurons in the Human Brain. [Paper]
(Preprint 2023) Sparks of Artificial General Intelligence: Early experiments with GPT-4. [Paper]
(Preprint 2023) Theory of Mind Might Have Spontaneously Emerged in Large Language Models. [Paper]
(EMNLP Findings 2021) Effectiveness of Pre-training for Few-shot Intent Classification. [Paper]

4.2 Counter-Evidence for Understanding Extrinsic Mental States

(EMNLP 2023) FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions. [Paper]
(Preprint 2023) Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models. [Paper]
(Preprint 2023) Limitation of Theory of Mind In Large Language Model: Anthropomorphize Religous Figure. [Paper]
(Preprint 2023) Does ChatGPT have Theory of Mind? [Paper]
(Preprint 2023) Large Language Models Fail on Trivial Alterations to Theory-of-Mind Tasks. [Paper]
(AI Review 2023) Mind the Gap: Challenges of Deep Learning Approaches to Theory of Mind. [Paper]
(Preprint 2022) Do Large Language Models Know what Humans Know? [Paper]
(Preprint 2022) Large Language Models Are Not Zero-shot Communicators. [Paper]
(EMNLP 2022) Neural Theory-of-Mind? On the Limits of Social Intelligence in Large LMs. [Paper]

4. ToM Benchmarks and Platforms

A taxonomized review of existing benchmarks for machine ToM and their settings under ATOMS. We further break beliefs into first-order beliefs (1st) and second-order beliefs or beyond (2nd+); and break intentions into Action intentions and Communicative intentions. Tasks are divided into Inference, Question Answering, Natural Language Generation, MultiAgent Collaboration, and MultiAgent Competition. Input modalities consist of Text (Human, AI, or Template) and Nonlinguistic ones. The latter further breaks into Cartoon, Natural Images, Chess, 2D Grid World, and 3D Simulation. The Situatedness is divided into None, Passive Perceiver, and Active Interactor. Symmetricity refers to whether the tested agent is co-situated and engaged in mutual interactions with other ToM agents.

Benchmarks and Task Formulations	Tested Agent			Situatedness				ATOMS Mental States									Sym.
	Task	Input Modality		Physical		Social		Belief		Intention		Des.	Emo.	Know.	Per.	NLC
	Task	Text	Nonling.	Per.	Int.	Per.	Int.	1st	2nd+	Act.	Com.	Des.	Emo.	Know.	Per.	NLC
(Preprint 2021) Epistemic Reasoning	Infer	T	-	-	-	-	-	✔️	✔️	-	-	-	-	-	-	-	-
(EMNLP 2018) ToMi	QA	T	-	✔️	-	-	-	✔️	✔️	-	-	-	-	-	-	-	-
(EMNLP Findings 2023) Hi-ToM	QA	T	-	✔️	-	-	-	✔️	✔️	-	-	-	-	-	-	-	-
(EMNLP Findings 2023) MindGames	Infer	T	-	✔️	-	-	-	✔️	✔️	-	-	-	-	-	✔️	-	-
(ToM 2023) Selective Encoding	QA	T	-	✔️	-	-	-	-	-	✔️	-	✔️	-	-	-	-	-
(Preprint 2023) Adv-CSFB	QA	H	-	✔️	-	-	-	✔️	-	-	-	-	-	-	-	-	-
(EMNLP 2010) ConvEntail	Infer	H	-	-	-	✔️	-	✔️	-	-	✔️	✔️	-	-	-	-	-
(EMNLP 2019) SocialIQA	QA	H	-	-	-	✔️	-	-	-	✔️	-	-	✔️	-	-	-	-
(LREC 2022) BeSt	-	H	-	-	-	✔️	-	✔️	-	-	-	-	✔️	-	-	✔️	-
(ToM 2023) Loophole	NLG	H	-	-	-	✔️	-	-	-	-	-	-	-	-	-	✔️	-
(ACL Findings 2023) FauxPas-EAI	QA	H,AI	-	-	-	✔️	-	✔️	-	-	-	-	-	-	-	✔️	-
(Preprint 2023) COKE	NLG	AI	-	-	-	✔️	✔️	-	-	✔️	-	-	✔️	-	-	-	-
(Preprint 2022) ToM-in-AMC	Infer	H	-	✔️	-	✔️	-	-	-	✔️	✔️	-	-	-	-	-	-
(ACL 2023) G4C	NLG	H,AI	-	✔️	-	✔️	✔️	-	-	✔️	✔️	-	-	-	✔️	-	-
(Preprint 2016) VisualBeliefs	Infer	-	Cartoon	✔️	-	-	-	✔️	-	-	-	-	-	-	-	✔️	-
(AAAI 2016) Triangle COPA	QA	H	Cartoon	✔️	-	✔️	-	-	-	✔️	-	-	✔️	-	-	-	-
(NAACL 2022) MSED	Infer	H	Images	✔️	-	-	-	-	-	-	-	✔️	✔️	-	-	-	-
(NeurIPS 2021) BIB	Infer	-	2D Grid	✔️	-	-	-	-	-	✔️	-	✔️	-	-	-	-	-
(ICML 2021) AGENT	Infer	-	3D Sim.	✔️	-	-	-	-	-	✔️	-	✔️	-	-	✔️	-	-
(ToM 2023) RBC	Compete	-	Chess	✔️	-	-	-	-	-	-	-	-	-	✔️	-	-	-
(ICML 2018) MToM	Infer	-	2D Grid	✔️	-	-	-	✔️	-	✔️	-	-	-	-	-	-	-
(ICML 2022) SymmToM	Collab	-	2D Grid	✔️	✔️	✔️	✔️	-	-	-	-	-	-	✔️	-	-	✔️
(EMNLP 2023) Search & Rescue	Collab	AI	2D Grid	✔️	✔️	✔️	✔️	✔️	✔️	-	-	-	-	✔️	✔️	-	✔️
(EMNLP 2021) MindCraft	Infer	H	3D Sim.	✔️	✔️	✔️	✔️	-	-	✔️	-	-	-	✔️	✔️	-	✔️
(IJCAI 2023) CPA	Infer	H	3D Sim.	✔️	✔️	✔️	✔️	-	-	✔️	✔️	-	-	✔️	✔️	-	✔️
(EMNLP 2023) FANToM	QA	T	-	-	-	✔️	-	✔️	✔️	-	-	-	-	✔️	-	-	-

5. Computational Modeling of ToM

5.1 Learning Latent Representation for ToM

(IJCAI 2023) Towards Collaborative Plan Acquisition through Theory of Mind Modeling in Situated Dialogue. [Paper]
(EMNLP 2021) MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks. [Paper]
(RO-MAN 2021) Deep Interpretable Models of Theory of Mind. [Paper]
(EMNLP 2020) RMM: A Recursive Mental Model for Dialog Navigation. [Paper]
(ICML 2018) Machine Theory of Mind. [Paper]

5.2 Learning (Neural-)Symbolic Representation for ToM

(ACL 2023) Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker. [Paper]
(ToM 2023) The Neuro-Symbolic Inverse Planning Engine (NIPE): Modeling Probabilistic Social Inferences from Linguistic Inputs. [Paper]

5.3 Prompting and In-Context Learning for ToM in LLMs

(EMNLP 2023) Theory of Mind for Multi-Agent Collaboration via Large Language Models. [Paper]
(Preprint 2023) How FaR Are Large Language Models From Agents with Theory-of-Mind? [Paper]
(Preprint 2023) Violation of Expectation via Metacognitive Prompting Reduces Theory of Mind Prediction Error in Large Language Models. [Paper]
(Preprint 2023) CAMEL: Communicative Agents for "Mind" Exploration of Large Scale Language Model Society. [Paper]
(Preprint 2023) Boosting Theory-of-Mind Performance in Large Language Models via Prompting. [Paper]
(Preprint 2023) Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4. [Paper]

5.4 Bayesian and (Inverse) Reinforcement Learning Based ToM Modeling

(ToM 2023) Theory of Mind as Intrinsic Motivation for Multi-Agent Reinforcement Learning. [Paper]
(ToM 2023) Iterative Machine Teaching for Black-box Markov Learners. [Paper]
(ToM 2023) Between Prudence and Paranoia: Theory of Mind Gone Right, and Wrong. [Paper]
(ToM 2023) Emergent Deception and Skepticism via Theory of Mind. [Paper]
(ToM 2023) How To Make Social Decisions in a Heterogeneous Society? [Paper]
(ICML 2022) Symmetric Machine Theory of Mind. [Paper]
(ICML 2021) Few-shot Language Coordination by Modeling Theory of Mind. [Paper]
(CogSci 2020) Improving Multi-Agent Cooperation using Theory of Mind. [Paper]
(Preprint 2019) Modeling Theory of Mind in Multi-Agent Games Using Adaptive Feedback Control. [Paper]
(EmeComm 2019) Emergence of Theory of Mind Collaboration in Multiagent Systems. [Paper]
(Current Opinion in Behavioral Sciences 2019) Theory of Mind as Inverse Reinforcement Learning. [Paper]

5.5 Other ToM Modeling

(ToM 2023) Language Models are Bounded Pragmatic Speakers: Understanding RLHF from a Bayesian Cognitive Modeling Perspective. [Paper]
(ToM 2023) Inferring the Future by Imagining the Past. [Paper]
(ToM 2023) Inferring the Goals of Communicating Agents from Actions and Instructions. [Paper]
(RSS 2015) Grounding English Commands to Reward Functions. [Paper]
(CogSci 2011) Bayesian Theory of Mind: Modeling Joint Belief-Desire Attribution. [Paper]

6. ToM Application

6.1 Pragmatics and Instruction Generation/Following

(ToM 2023) Towards a Better Rational Speech Act Framework for Context-aware Modeling of Metaphor Understanding. [Paper]
(ACL Findings 2023) Define, Evaluate, and Improve Task-Oriented Cognitive Capabilities for Instruction Generation Models. [Paper]
(ACL 2022) Learning to Mediate Disparities Towards Pragmatic Communication. [Paper]
(ICML 2021) Few-shot Language Coordination by Modeling Theory of Mind. [Paper]
(Science 2012) Predicting Pragmatic Reasoning in Language Games. [Paper]

6.2 Dialogue Processing and Generation

(ToM 2023) MindDial: Belief Dynamics Tracking with Theory-of-Mind Modeling for Neural Dialogue Generation. [Paper]
(ACL Findings 2023) Speaking the Language of Your Listener: Audience-Aware Adaptation via Plug-and-Play Theory of Mind. [Paper]
(SIGDIAL 2022) Towards Socially Intelligent Agents with Mental State Transition and Human Utility. [Paper]
(EMNLP 2020) RMM: A Recursive Mental Model for Dialog Navigation. [Paper]

6.3 Language Acquisition

(ICLR 2023) Computational Language Acquisition with Theory of Mind. [Paper]
(Preprint 2023) Can Language Models Teach Weaker Agents? Teacher Explanations Improve Students via Theory of Mind. [Paper]

6.4 Human-AI Interactions

(ToM 2023) Preference Proxies: Evaluating Large Language Models in Capturing Human Preferences in Human-AI Tasks. [Paper]
(CHI 2021) Towards Mutual Theory of Mind in Human-AI Interaction: How Language Reflects What Students Perceive About a Virtual Teaching Assistant. [Paper]

6.5 Explainable AI

(iScience 2021) CX-ToM: Counterfactual Explanations with Theory-of-Mind for Enhancing Human Trust in Image Recognition Models. [Paper]

6.6 Healthcare

(ToM 2023) Discovering User Types: Mapping User Traits by Task-Specific Behaviors in Reinforcement Learning. [Paper]

6.7 Privacy

(Preprint 2023) Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory. [Paper]

skywalker023 / awesome-theory-of-mind Goto Github PK

awesome-theory-of-mind's Introduction

Reading List: Recent Advances in Machine Theory of Mind

Overview

Citation

Contributors

How To Contribute

Table of Contents

1. ToM Community Resources

1.1 Workshops

1.2 Talks and Tutorials

1.3 Tools

2. Machine ToM Surveys and Position Papers

3. Cognitive Underpinnings of ToM

3.1 Definition and Importance of ToM in Human Cognition (Selected)

3.2 Taxonomies of ToM and Mental States

4. Computational Inquiry to ToM in Foundation Models

4.1 Probing Intrinsic Mental States

4.1 Evidence for Understanding Extrinsic Mental States

4.2 Counter-Evidence for Understanding Extrinsic Mental States

4. ToM Benchmarks and Platforms

5. Computational Modeling of ToM

5.1 Learning Latent Representation for ToM

5.2 Learning (Neural-)Symbolic Representation for ToM

5.3 Prompting and In-Context Learning for ToM in LLMs

5.4 Bayesian and (Inverse) Reinforcement Learning Based ToM Modeling

5.5 Other ToM Modeling

6. ToM Application

6.1 Pragmatics and Instruction Generation/Following

6.2 Dialogue Processing and Generation

6.3 Language Acquisition

6.4 Human-AI Interactions

6.5 Explainable AI

6.6 Healthcare

6.7 Privacy

awesome-theory-of-mind's People

Contributors

Watchers

Recommend Projects

Recommend Topics

Recommend Org