Sitemap

Cantor’s Paradise

Medium’s #1 Math Publication

GPT-4 is Amazing but Still Struggles at High School Math Competitions

Its greatest strength is its biggest flaw

7 min readMar 24, 2023

--

Press enter or click to view image in full size
Text GPT-4 overlaying image of GPT-4 performing mathematical calculations
Image created by author with screenshots from chat.openai.com

GPT-4 is the latest version of the GPT architecture developed by OpenAI, released on 14 March 2023.

We are told on the landing page that GPT-4 “surpasses ChatGPT in its advanced reasoning capabilities”, and presented with some impressive results for the USA Bar Exam and Biology Olympiad.

Press enter or click to view image in full size
GPT-4 scored in the top 10% on the US Bar Exam and top 1% on the USA Biology Olympiad
Screenshot (by author) from openai.com

The GPT-4 Technical Report contains many other simulated exams used to test the reasoning and problem solving ability of GPT-4. When it comes to Mathematics, GPT-4 ranked in the top 11% of scores on the SAT Math Test (a significant improvement from GPT-3.5).

Press enter or click to view image in full size

But if you’ve ever sat the SAT, or ever seen the SAT, you’ll know the SAT Math questions are quite predictable and formulaic. Most of them just require the application of a pre-defined set of skills — solving equations, trigonometry laws, index laws, etc.

--

--

Russell Lim
Russell Lim

Written by Russell Lim

I teach high school mathematics in Melbourne, Australia. I like thinking about interesting problems and learning new things.

Responses (4)