BAbI: A Test of Commonsense Ability

The BAbI benchmark presents a difficult set of tasks designed to evaluate the abilities of AI systems in interpreting commonsense knowledge. It includes a wide range of cases that require logic about everyday notions. By evaluating how well AI models can address these problems, researchers aim to better understand the character of commonsense reaso

read more