LLM math: test suite has ~90 tests that fail one of the fixes addre...