AI Beats Humans in Stanford Reading Comprehension Test
The robots are coming, and they can read.
Its natural-language processing AI scored higher than humans
Artificial intelligence programs built by Alibaba () and Microsoft ( ) have beaten humans on a Stanford University reading comprehension test.
“This is the first time that a machine has outperformed humans on such a test,” Alibaba said in a statement Monday.
The test was devised by artificial intelligence experts at Stanford to measure computers’ growing reading abilities. Alibaba’s software was the first to beat the human score.
Alibaba has developed an artificial intelligence model that scored better than humans in a Stanford University reading and comprehension test.
Alibaba Group Holding Ltd. put its deep neural network model through its paces last week, asking the AI to provide exact answers to more than 100,000 questions comprising a quiz that’s considered one of the world’s most authoritative machine-reading gauges. The model developed by Alibaba’s Institute of Data Science of Technologies scored 82.44, edging past the 82.304 that rival humans achieved.
Alibaba said it’s the first time a machine has out-done a real person in such a contest. Microsoft achieved a similar feat, scoring 82.650 on the same test, but those results were finalized a day after Alibaba’s, the company said.
The Chinese e-commerce titan has joined the likes of Tencent Holdings Ltd. and Baidu Inc. in a race to develop AI that can enrich social media feeds, target ads and services or even aid in autonomous driving. Beijing has endorsed the technology in a national-level plan that calls for the country to become the industry leader 2030.
So-called natural language processing mimics human comprehension of words and sentences. Based on more than 500 Wikipedia articles, Stanford’s set of questions are designed to tease out whether machine-learning models can process large amounts of information before supplying precise answers to queries.
“That means objective questions such as ‘what causes rain’ can now be answered with high accuracy by machines,” Luo Si, chief scientist for natural language processing at the Alibaba institute, said in a statement. “The technology underneath can be gradually applied to numerous applications such as customer service, museum tutorials and online responses to medical inquiries from patients, decreasing the need for human input in an unprecedented way.”