WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000+ "why" question-answer-rationale triplets.
First of all, congratulations on your paper being accepted, and thank you for your open source code,
I would like to ask why in the example in Figure 8 in the appendix, the annotator needs to choose one of the four questions to explain, and where do these four options come from?