OpisPerformance of AI models on various benchmarks from 1998 to 2024.png |
English: Figure 2. Performance of AI models on various benchmarks from 1998 to 2024, including computer vision (MNIST, ImageNet), speech recognition (Switchboard), natural language understanding (SQuAD 1.1, MMLU, GLUE), general language model evaluation (MMLU, Big-Bench, and GPQA), and mathematical reasoning (MATH). Many models surpass human-level performance (black solid line) by 2024, demonstrating significant advancements in AI capabilities across different domains over the past two decades. Data are from (94) for MNSIT, Switchboard, ImageNet, SQuAD 1.1, 2 and GLUE. Data for MMLU, Big Bench, GPQA are from the relevant papers (95, 96, 97). |
Autor |
CHAIR
Prof. Yoshua Bengio, Université de Montréal / Mila - Quebec AI Institute
EXPERT ADVISORY PANEL
Prof. Bronwyn Fox, The Commonwealth
Scientific and Industrial Research Organisation
(CSIRO) (Australia)
André Carlos Ponce de Leon Ferreira de
Carvalho, Institute of Mathematics and
Computer Sciences, University of São Paulo
(Brazil)
Dr. Mona Nemer, Chief Science Advisor of
Canada (Canada)
Raquel Pezoa Rivera, Federico Santa María
Technical University (Chile)
Dr. Yi Zeng, Institute of Automation, Chinese
Academy of Sciences (China)
Juha Heikkilä, DG Connect (European Union)
Guillaume Avrin, General Directorate of
Enterprises (France)
Prof. Antonio Krüger, German Research
Center for Artificial Intelligence (Germany)
Prof. Balaraman Ravindran, Indian Institute of
Technology, Madras (India)
Prof. Hammam Riza, KORIKA (Indonesia)
Dr. Ciarán Seoighe, Science Foundation
Ireland (Ireland)
Dr. Ziv Katzir, Israel Innovation Authority
(Israel)
Dr. Andrea Monti, University of Chieti-Pescara
(Italy)
Dr. Hiroaki Kitano, Sony Group (Japan)
[Interim] Mary Kerema, Ministry of Information
Communications Technology and Digital
Economy (Kenya)
Dr. José Ramón López Portillo, Q Element
(Mexico)
Prof. Haroon Sheikh, Netherlands’ Scientific
Council for Government Policy (Netherlands)
Dr. Gill Jolly, Ministry of Business, Innovation
and Employment (New Zealand)
Dr. Olubunmi Ajala, Innovation and Digital
Economy (Nigeria)
Dominic Ligot, CirroLytix (Philippines)
Prof. Kyoung Mu Lee, Department of Electrical
and Computer Engineering, Seoul National
University (Republic of Korea)
Ahmet Halit Hatip, Turkish Ministry of Industry
and Technology (Republic of Turkey)
Crystal Rugege, National Center for AI and
Innovation Policy (Rwanda)
Dr. Fahad Albalawi, Saudi Authority for Data
and Artificial Intelligence (Kingdom of Saudi
Arabia)
Denise Wong, Data Innovation and Protection
Group, Infocomm Media Development
Authority (IMDA) (Singapore)
Dr. Nuria Oliver, ELLIS Alicante (Spain)
Dr. Christian Busch, Federal Department of
Economic Affairs, Education and Research
(Switzerland)
Oleksii Molchanovskyi, Expert Committee on
the Development of Artificial intelligence in
Ukraine (Ukraine)
Marwan Alserkal, Ministry of Cabinet Affairs,
Prime Minister’s Office (United Arab Emirates)
Saif M. Khan, U.S. Department of Commerce
(United States)
Dame Angela McLean, Government Chief
Scientific Adviser (United Kingdom)
Amandeep Gill, UN Tech Envoy (United
Nations) |