The researchers started with the GSM8K's standardized set of 8,000 grade-school level mathematics word problems ... And when an AI cannot perform simple math because the words are essentially ...