Loading...

Can ChatGPT reason mathematically?

13997 779________

AI Researchers at Apple just released a new paper that breaks down limitations in the ability of Large Language Models (LLMs) like ChatGPT to reason mathematically.
The paper: arxiv.org/pdf/2410.05229
AI researchers analyze the mathematical reasoning ability of LLMs using a database of math questions called GSM8k for Grade School Math. What the apple researchers did was to tweak this database by first changing names and numbers, second making problems longer with more clauses at the same reasoning level, and finally by adding irrelevant clauses that should be ignored. Each of these caused some level of problems, but it was the irrelevant clauses that were most challenging to even top LLMs like chatgpt o1-mini and o1-preview.

The video of grade 8 students:    • How Old Is The Shepherd?  
Twitter thread from Mehrdad Farajtabar, one of the authors: x.com/MFarajtabar/status/1844456880971858028

BECOME A MEMBER:►Join: youtube.com/channel/UC9rTsvTxJnx1DNrDA3Rqa6A/join

MATH BOOKS I LOVE (affilliate link):
www.amazon.com/shop/treforbazett

COURSE PLAYLISTS:
►DISCRETE MATH:    • Discrete Math (Full Course: Sets, Log...  
►LINEAR ALGEBRA:    • Linear Algebra (Full Course)  
►CALCULUS I:    • Calculus I (Limits, Derivative, Integ...  
► CALCULUS II:    • Calculus II (Integration Methods, Ser...  
►MULTIVARIABLE CALCULUS (Calc III):    • Calculus III: Multivariable Calculus ...  
►VECTOR CALCULUS (Calc IV)    • Calculus IV: Vector Calculus (Line In...  
►DIFFERENTIAL EQUATIONS:    • Ordinary Differential Equations (ODEs)  
►LAPLACE TRANSFORM:    • Laplace Transforms and Solving ODEs  
►GAME THEORY:    • Game Theory  

OTHER PLAYLISTS:
► Learning Math Series
   • 5 Tips To Make Math Practice Problems...  
►Cool Math Series:
   • Cool Math Series  

SOCIALS:
►X/Twitter: X.com/treforbazett
►TikTok: tiktok.com/@drtrefor
►Instagram (photography based): instagram.com/treforphotography

コメント