Caymans Post

A world within. A state apart.
Tuesday, May 30, 2023

Students Beat ChatGPT At This Exam, Score 76%, Compared To Chatbot's 47%

Students Beat ChatGPT At This Exam, Score 76%, Compared To Chatbot's 47%

Despite this, they said that ChatGPT's performance was "impressive" and that it was a "game changer that will change the way everyone teaches and learns - for the better."
Researchers found students to have fared better at accounting exams than ChatGPT, OpenAI's chatbot product.

Despite this, they said that ChatGPT's performance was "impressive" and that it was a "game changer that will change the way everyone teaches and learns - for the better."

The researchers from Brigham Young University (BYU), US, and 186 other universities wanted to know how OpenAI's technology would fare on accounting exams. They have published their findings in the journal Issues in Accounting Education.

In the researchers' accounting exam, students scored an overall average of 76.7 per cent, compared to ChatGPT's score of 47.4 per cent.

While in 11.3 per cent of the questions, ChatGPT was found to score higher than the student average, doing particularly well on accounting information systems (AIS) and auditing, the AI bot was found to perform worse on tax, financial, and managerial assessments. Researchers think this could possibly be because ChatGPT struggled with the mathematical processes required for the latter type.

The AI bot, which uses machine learning to generate natural language text, was further found to do better on true/false questions (68.7 per cent correct) and multiple-choice questions (59.5 per cent), but struggled with short-answer questions (between 28.7 and 39.1 per cent).

In general, the researchers said that higher-order questions were harder for ChatGPT to answer. In fact, sometimes ChatGPT was found to provide authoritative written descriptions for incorrect answers, or answer the same question different ways.

They also found that ChatGPT often provided explanations for its answers, even if they were incorrect. Other times, it went on to select the wrong multiple-choice answer, despite providing accurate descriptions.

Researchers importantly noted that ChatGPT sometimes made up facts. For example, when providing a reference, it generated a real-looking reference that was completely fabricated. The work and sometimes the authors did not even exist.

The bot was seen to also make nonsensical mathematical errors such as adding two numbers in a subtraction problem, or dividing numbers incorrectly.

Wanting to add to the intense ongoing debate about how how models like ChatGPT should factor into education, lead study author David Wood, a BYU professor of accounting, decided to recruit as many professors as possible to see how the AI fared against actual university accounting students.

His co-author recruiting pitch on social media exploded: 327 co-authors from 186 educational institutions in 14 countries participated in the research, contributing 25,181 classroom accounting exam questions.

They also recruited undergraduate BYU students to feed another 2,268 textbook test bank questions to ChatGPT. The questions covered AIS, auditing, financial accounting, managerial accounting and tax, and varied in difficulty and type (true/false, multiple choice, short answer).
Newsletter

Related Articles

Caymans Post
Close
0:00
0:00
Nvidia CEO Huang says firms, individuals without AI expertise will be left behind
WPP Revolutionizes Advertising with NVIDIA's AI Powerhouse
Two US Employees Fired For Chasing Robbers Out Of Store As They Broke ''Company Policy''
If you donated to BLM, you got played
Pfizer, the EU, and disappearing ink - Smoke, Mirrors, and the Billion-Dose Pfizer Vaccine Deal: EU's 'Open Secret
Actor Tom Hanks told Harvard University graduates to be superheroes in their defense of truth and American ideals, and to resist those who twist the truth for their own gain
The Sussexes' Royal Rebound: Could Harry and Meghan Markle Return to the UK?
A provocative study suggests: Left-Wing Extremism and its Unsettling Connection to Psychopathy and Narcissism
France Arrests 10 on Suspicion of Failing to Respond in Time to Migrant Drowning
Neuralink Receives FDA Approval for First-in-Human Clinical Study
Is Saudi Arabia the holiest place in the world? Ancient Hebrew Inscriptions from "The Mount Sinai Stand" Discovered in Saudi Arabia
Ukrainian Intelligence Official Admits to Assassination Attempts on Putin
Bernard Arnault Loses $11.2 Billion in One Day as Investors Fear Slowdown in US Growth Will Reduce Demand for Luxury Products
Russian’s Wagner Group leader: “I am not a chef, I am a butcher. Russia is in danger of a revolution like in 1917.”
TikTok Sues Montana Over Law Banning the App
Ron DeSantis Jumps Into 2024 Presidential Race, Setting Up Showdown With Trump
Last Walmart in North Portland Closing Down
Florida's DeSantis seeks to disqualify judge in Disney case
Talks between US House Republicans and President Biden's Democratic administration on raising the federal government's $31.4tn debt ceiling have paused
Biden Administration Eyeing High-Profile Visits to China: The Biden Administration is heating things up by looking into setting up a series of top-level visits to Beijing by top officials in the coming months
New evidence in special counsel probe may undercut Trump’s claim documents he took were automatically declassified
A French court of appeals confirmed former President Nicolas Sarkozy's three-year jail term for corruption and influence peddling
Debt Ceiling Crises Have Unleashed Political Chaos
Weibao Wang, a former software engineer at Apple, was charged with stealing trade secrets related to autonomous systems, including self-driving cars
Mobile phone giant Vodafone to cut 11,000 jobs globally over three years as new boss says its performance not good enough
Elon Musk compares George Soros to Magneto, the supervillain from the Marvel Comics series.
Warren Buffett Sells TSMC Shares Over Concerns About Taiwan's Stability
New Study Finds That Secondary Bacterial Pneumonia Is a Major Cause of Death in COVID-19 Patients Who Require Ventilator Assistance
The Prime Minister of St. Vincent and the Grenadines calls the British monarchy "an absurdity" he wants to remove in his lifetime
King Charles III being crowned.
'Godfather Of AI' Geoffrey Hinton Quits Google To Warn Of The Tech's Dangers
A Real woman
Vermont Man Charged with Stalking After Secretly Tracking Woman with Apple AirTag
Elon Musk Statements About Tesla Autopilot Could Be 'Deepfakes,' Lawyers Claim. Judge Evette Pennypacker Does Not Understand How Far and Advanced This Technology Became
Ukraine More Prepared for Counterattack as Reinforcements Arrive
UK Prime Minister Rishi Sunak and Italian Prime Minister Giorgia Meloni Discuss Migration, Defence, and Ukraine
AT&T's Successful Test of Satellite-Based Phone Call Raises Possibility of Widespread Coverage
CNN: "Joe Biden is asking for four more years — when 74% of Americans think the country is heading the wrong way“
Turkish President Recep Tayyip Erdogan Cuts Short Live TV Interview Due to Health Issue
US Congresswoman threaten Twitter Files journalist with arrest
Pulitzer Prize-winning journalist Seymour Hersh slams New York Times' pro-government stance and treatment of sources
Enough is enough: it's time to end the war in Ukraine. While Russia may be to blame for starting it, Russia is not the one refusing to stop it
Fox News Settles their case with Dominion Voting Systems for a staggering $787.5 MILLION
AG decries scapegoating and rushed lawmaking by government
The land of the free violence
21-year-old Massachusetts Air National Guard member Jack Teixeira has been arrested for leaking classified Pentagon Documents
The Supreme Court will allow a 12-year-old transgender West Virginia girl to compete on her middle school’s girls' sports teams amid a lawsuit over a ban
Bank of America cuts short conference after outrage at Ukraine comments
Revealed: royals took more than £1bn income from controversial estates
Mitt Romney calls Trump indictment 'overreach,' says charges were 'stretched' to suit a 'political agenda'
×