Giter VIP home page Giter VIP logo

awesome-theory-of-mind's Introduction

Reading List: Recent Advances in Machine Theory of Mind

Illustration This illustration is generated using DALL·E 3

Overview

Citation

This is a curated list of related literature and resources for machine theory of mind (ToM) research. Last Update: Nov 4th, 2023.

Note

In the next update, we will include additional links for code and data.

If you find our work useful, please give us credit by citing:

@inproceedings{ma2023towards,
  title={Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models},
  author={Ma, Ziqiao and Sansom, Jacob and Peng, Run and Chai, Joyce},
  booktitle={Findings of the Association for Computational Linguistics: EMNLP 2023},
  year={2023}
}

Contributors

How To Contribute

Welcome to contribute to our paper list or be a collaborator!

  • To add missing papers: Please create an issue or pull request, so the team can make the update.
  • To become a contributor: Please drop an email to Martin.

Table of Contents

1. ToM Community Resources

1.1 Workshops

  • (ToM 2024) 2nd Workshop on Theory-of-Mind @ ICLR 2024. [Web]
  • (ToM 2023) 1st Workshop on Theory-of-Mind @ ICML 2023. [Web]

1.2 Talks and Tutorials

  • To be updated

1.3 Tools

  • (ToM 2023) The SocialAI School: Insights from Developmental Psychology Towards Artificial Socio-Cultural Agents. [Web]
  • (Preprint 2023) SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents. [Paper] [Web]

2. Machine ToM Surveys and Position Papers

  • (EMNLP Findings 2023) Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models. [Paper]
  • (Preprint 2023) A Review on Machine Theory of Mind. [Paper]
  • (EMNLP Findings 2022) Language Models as Agent Models. [Paper]
  • (RO-MAN 2022) Understanding Intention for Machine Theory of Mind: A Position Paper. [Paper]
  • (Psychological Medicine 2020) Knowing Me, Knowing you: Theory of Mind in AI. [Paper]
  • (Neuropsychologia 2020) Theory of Mind and Decision Science: Towards a Typology of Tasks and Computational Models. [Paper]
  • (AI 2018) Autonomous Agents Modelling Other Agents: A Comprehensive Survey and Open Problems. [Paper]
  • (Preprint 2017) It Takes Two to Tango: Towards Theory of AI's Mind. [Paper]
  • (AI 2016) Integrating Social Power Into The Decision-making of Cognitive Agents. [Paper]

3. Cognitive Underpinnings of ToM

3.1 Definition and Importance of ToM in Human Cognition (Selected)

  • (Premack et al., 1978) Does the Chimpanzee Have a Theory of Mind? [Paper]
  • (Dennett, 1988) Précis of The Intentional Stance. [Paper]
  • (Gopnik et al., 1992) Why the Child's Theory of Mind Really Is a Theory. [Paper]
  • (Baron-Cohen, 1992) Mindblindness: An Essay on Autism and Theory of Mind. [Book]
  • (Blakemore et al,. 2001) From the Perception of Action to the Understanding of Intention. [Paper]
  • (Ho et al,. 2022) Planning With Theory Of Mind. [Paper]

3.2 Taxonomies of ToM and Mental States

  • (ToM 2023) EPITOME: Experimental Protocol Inventory for Theory Of Mind Evaluation. [Paper]
  • (Stack et al., 2022) Framework for a Multi-dimensional Test of Theory of Mind for Humans and AI Systems. [Paper]
  • (Osterhaus et al., 2022) Looking for the Lighthouse: A Systematic Review of Advanced Theory-of-mind Tests beyond Preschool. [Paper]
  • (Beaudoin et al., 2020) Systematic Review and Inventory of Theory of Mind Measures for Young Children. [Paper]

4. Computational Inquiry to ToM in Foundation Models

4.1 Probing Intrinsic Mental States

  • (EACL 2023) Methods for Measuring, Updating, and Visualizing Factual Beliefs in Language Models. [Paper]
  • (EMNLP Findings 2021) Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language Understanding. [Paper]
  • (ACL 2021) Implicit Representations of Meaning in Neural Language Models. [Paper]

4.1 Evidence for Understanding Extrinsic Mental States

  • (Preprint 2023) Unveiling Theory of Mind in Large Language Models: A Parallel to Single Neurons in the Human Brain. [Paper]
  • (Preprint 2023) Sparks of Artificial General Intelligence: Early experiments with GPT-4. [Paper]
  • (Preprint 2023) Theory of Mind Might Have Spontaneously Emerged in Large Language Models. [Paper]
  • (EMNLP Findings 2021) Effectiveness of Pre-training for Few-shot Intent Classification. [Paper]

4.2 Counter-Evidence for Understanding Extrinsic Mental States

  • (EMNLP 2023) FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions. [Paper]
  • (Preprint 2023) Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models. [Paper]
  • (Preprint 2023) Limitation of Theory of Mind In Large Language Model: Anthropomorphize Religous Figure. [Paper]
  • (Preprint 2023) Does ChatGPT have Theory of Mind? [Paper]
  • (Preprint 2023) Large Language Models Fail on Trivial Alterations to Theory-of-Mind Tasks. [Paper]
  • (AI Review 2023) Mind the Gap: Challenges of Deep Learning Approaches to Theory of Mind. [Paper]
  • (Preprint 2022) Do Large Language Models Know what Humans Know? [Paper]
  • (Preprint 2022) Large Language Models Are Not Zero-shot Communicators. [Paper]
  • (EMNLP 2022) Neural Theory-of-Mind? On the Limits of Social Intelligence in Large LMs. [Paper]

4. ToM Benchmarks and Platforms

A taxonomized review of existing benchmarks for machine ToM and their settings under ATOMS. We further break beliefs into first-order beliefs (1st) and second-order beliefs or beyond (2nd+); and break intentions into Action intentions and Communicative intentions. Tasks are divided into Inference, Question Answering, Natural Language Generation, MultiAgent Collaboration, and MultiAgent Competition. Input modalities consist of Text (Human, AI, or Template) and Nonlinguistic ones. The latter further breaks into Cartoon, Natural Images, Chess, 2D Grid World, and 3D Simulation. The Situatedness is divided into None, Passive Perceiver, and Active Interactor. Symmetricity refers to whether the tested agent is co-situated and engaged in mutual interactions with other ToM agents.

Benchmarks and Task Formulations Tested Agent Situatedness ATOMS Mental States Sym.
Task Input Modality Physical Social Belief Intention Des. Emo. Know. Per. NLC
Text Nonling. Per. Int. Per. Int. 1st 2nd+ Act. Com.
(Preprint 2021) Epistemic Reasoning Infer T - - - - - ✔️ ✔️ - - - - - - - -
(EMNLP 2018) ToMi QA T - ✔️ - - - ✔️ ✔️ - - - - - - - -
(EMNLP Findings 2023) Hi-ToM QA T - ✔️ - - - ✔️ ✔️ - - - - - - - -
(EMNLP Findings 2023) MindGames Infer T - ✔️ - - - ✔️ ✔️ - - - - - ✔️ - -
(ToM 2023) Selective Encoding QA T - ✔️ - - - - - ✔️ - ✔️ - - - - -
(Preprint 2023) Adv-CSFB QA H - ✔️ - - - ✔️ - - - - - - - - -
(EMNLP 2010) ConvEntail Infer H - - - ✔️ - ✔️ - - ✔️ ✔️ - - - - -
(EMNLP 2019) SocialIQA QA H - - - ✔️ - - - ✔️ - - ✔️ - - - -
(LREC 2022) BeSt - H - - - ✔️ - ✔️ - - - - ✔️ - - ✔️ -
(ToM 2023) Loophole NLG H - - - ✔️ - - - - - - - - - ✔️ -
(ACL Findings 2023) FauxPas-EAI QA H,AI - - - ✔️ - ✔️ - - - - - - - ✔️ -
(Preprint 2023) COKE NLG AI - - - ✔️ ✔️ - - ✔️ - - ✔️ - - - -
(Preprint 2022) ToM-in-AMC Infer H - ✔️ - ✔️ - - - ✔️ ✔️ - - - - - -
(ACL 2023) G4C NLG H,AI - ✔️ - ✔️ ✔️ - - ✔️ ✔️ - - - ✔️ - -
(Preprint 2016) VisualBeliefs Infer - Cartoon ✔️ - - - ✔️ - - - - - - - ✔️ -
(AAAI 2016) Triangle COPA QA H Cartoon ✔️ - ✔️ - - - ✔️ - - ✔️ - - - -
(NAACL 2022) MSED Infer H Images ✔️ - - - - - - - ✔️ ✔️ - - - -
(NeurIPS 2021) BIB Infer - 2D Grid ✔️ - - - - - ✔️ - ✔️ - - - - -
(ICML 2021) AGENT Infer - 3D Sim. ✔️ - - - - - ✔️ - ✔️ - - ✔️ - -
(ToM 2023) RBC Compete - Chess ✔️ - - - - - - - - - ✔️ - - -
(ICML 2018) MToM Infer - 2D Grid ✔️ - - - ✔️ - ✔️ - - - - - - -
(ICML 2022) SymmToM Collab - 2D Grid ✔️ ✔️ ✔️ ✔️ - - - - - - ✔️ - - ✔️
(EMNLP 2023) Search & Rescue Collab AI 2D Grid ✔️ ✔️ ✔️ ✔️ ✔️ ✔️ - - - - ✔️ ✔️ - ✔️
(EMNLP 2021) MindCraft Infer H 3D Sim. ✔️ ✔️ ✔️ ✔️ - - ✔️ - - - ✔️ ✔️ - ✔️
(IJCAI 2023) CPA Infer H 3D Sim. ✔️ ✔️ ✔️ ✔️ - - ✔️ ✔️ - - ✔️ ✔️ - ✔️
(EMNLP 2023) FANToM QA T - - - ✔️ - ✔️ ✔️ - - - - ✔️ - - -

5. Computational Modeling of ToM

5.1 Learning Latent Representation for ToM

  • (IJCAI 2023) Towards Collaborative Plan Acquisition through Theory of Mind Modeling in Situated Dialogue. [Paper]
  • (EMNLP 2021) MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks. [Paper]
  • (RO-MAN 2021) Deep Interpretable Models of Theory of Mind. [Paper]
  • (EMNLP 2020) RMM: A Recursive Mental Model for Dialog Navigation. [Paper]
  • (ICML 2018) Machine Theory of Mind. [Paper]

5.2 Learning (Neural-)Symbolic Representation for ToM

  • (ACL 2023) Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker. [Paper]
  • (ToM 2023) The Neuro-Symbolic Inverse Planning Engine (NIPE): Modeling Probabilistic Social Inferences from Linguistic Inputs. [Paper]

5.3 Prompting and In-Context Learning for ToM in LLMs

  • (EMNLP 2023) Theory of Mind for Multi-Agent Collaboration via Large Language Models. [Paper]
  • (Preprint 2023) How FaR Are Large Language Models From Agents with Theory-of-Mind? [Paper]
  • (Preprint 2023) Violation of Expectation via Metacognitive Prompting Reduces Theory of Mind Prediction Error in Large Language Models. [Paper]
  • (Preprint 2023) CAMEL: Communicative Agents for "Mind" Exploration of Large Scale Language Model Society. [Paper]
  • (Preprint 2023) Boosting Theory-of-Mind Performance in Large Language Models via Prompting. [Paper]
  • (Preprint 2023) Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4. [Paper]

5.4 Bayesian and (Inverse) Reinforcement Learning Based ToM Modeling

  • (ToM 2023) Theory of Mind as Intrinsic Motivation for Multi-Agent Reinforcement Learning. [Paper]
  • (ToM 2023) Iterative Machine Teaching for Black-box Markov Learners. [Paper]
  • (ToM 2023) Between Prudence and Paranoia: Theory of Mind Gone Right, and Wrong. [Paper]
  • (ToM 2023) Emergent Deception and Skepticism via Theory of Mind. [Paper]
  • (ToM 2023) How To Make Social Decisions in a Heterogeneous Society? [Paper]
  • (ICML 2022) Symmetric Machine Theory of Mind. [Paper]
  • (ICML 2021) Few-shot Language Coordination by Modeling Theory of Mind. [Paper]
  • (CogSci 2020) Improving Multi-Agent Cooperation using Theory of Mind. [Paper]
  • (Preprint 2019) Modeling Theory of Mind in Multi-Agent Games Using Adaptive Feedback Control. [Paper]
  • (EmeComm 2019) Emergence of Theory of Mind Collaboration in Multiagent Systems. [Paper]
  • (Current Opinion in Behavioral Sciences 2019) Theory of Mind as Inverse Reinforcement Learning. [Paper]

5.5 Other ToM Modeling

  • (ToM 2023) Language Models are Bounded Pragmatic Speakers: Understanding RLHF from a Bayesian Cognitive Modeling Perspective. [Paper]
  • (ToM 2023) Inferring the Future by Imagining the Past. [Paper]
  • (ToM 2023) Inferring the Goals of Communicating Agents from Actions and Instructions. [Paper]
  • (RSS 2015) Grounding English Commands to Reward Functions. [Paper]
  • (CogSci 2011) Bayesian Theory of Mind: Modeling Joint Belief-Desire Attribution. [Paper]

6. ToM Application

6.1 Pragmatics and Instruction Generation/Following

  • (ToM 2023) Towards a Better Rational Speech Act Framework for Context-aware Modeling of Metaphor Understanding. [Paper]
  • (ACL Findings 2023) Define, Evaluate, and Improve Task-Oriented Cognitive Capabilities for Instruction Generation Models. [Paper]
  • (ACL 2022) Learning to Mediate Disparities Towards Pragmatic Communication. [Paper]
  • (ICML 2021) Few-shot Language Coordination by Modeling Theory of Mind. [Paper]
  • (Science 2012) Predicting Pragmatic Reasoning in Language Games. [Paper]

6.2 Dialogue Processing and Generation

  • (ToM 2023) MindDial: Belief Dynamics Tracking with Theory-of-Mind Modeling for Neural Dialogue Generation. [Paper]
  • (ACL Findings 2023) Speaking the Language of Your Listener: Audience-Aware Adaptation via Plug-and-Play Theory of Mind. [Paper]
  • (SIGDIAL 2022) Towards Socially Intelligent Agents with Mental State Transition and Human Utility. [Paper]
  • (EMNLP 2020) RMM: A Recursive Mental Model for Dialog Navigation. [Paper]

6.3 Language Acquisition

  • (ICLR 2023) Computational Language Acquisition with Theory of Mind. [Paper]
  • (Preprint 2023) Can Language Models Teach Weaker Agents? Teacher Explanations Improve Students via Theory of Mind. [Paper]

6.4 Human-AI Interactions

  • (ToM 2023) Preference Proxies: Evaluating Large Language Models in Capturing Human Preferences in Human-AI Tasks. [Paper]
  • (CHI 2021) Towards Mutual Theory of Mind in Human-AI Interaction: How Language Reflects What Students Perceive About a Virtual Teaching Assistant. [Paper]

6.5 Explainable AI

  • (iScience 2021) CX-ToM: Counterfactual Explanations with Theory-of-Mind for Enhancing Human Trust in Image Recognition Models. [Paper]

6.6 Healthcare

  • (ToM 2023) Discovering User Types: Mapping User Traits by Task-Specific Behaviors in Reinforcement Learning. [Paper]

6.7 Privacy

  • (Preprint 2023) Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory. [Paper]

awesome-theory-of-mind's People

Contributors

mars-tin avatar skywalker023 avatar roihn avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.