Abstract: Recently, researchers in the field of math word problem (MWP) solving have reported performance metrics for various large language models (LLMs) on benchmark datasets, with some models ...
GSM8K-V is a purely visual multi-image mathematical reasoning benchmark that systematically maps each GSM8K math word problem into its visual counterpart to enable a clean, within-item comparison ...
Pre-training Large Language Models (LLMs) on high-quality, meticulously curated datasets is widely recognized as critical for enhancing their performance and generalization capabilities. This study ...
You’d be surprised how many young people can’t read this. One of its conclusions tells the sad tale. “Between 2020 and 2025, the number of students whose math skills fall below high school level has ...
Missing Condition Natalia sold 48 clips in April and half as many clips in May. How many clips did Natalia sell altogether in April and June? We don't know anything about June, so it's impossible to ...
President Donald Trump was expected to sign a “Ways and Means Bill,” focused on “protecting taxpayer rights” by requiring the IRS to “show its math when changing returns,” according to a news release ...
Abstract: Vision-based sign language recognition aims at helping the deaf people to communicate with others. However, most existing sign language datasets are limited to a small number of words. Due ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results