By using this site, you agree privacy policies
Accept
Geek RoomGeek RoomGeek Room
  • Home
  • Tech
    TechShow More
    Split Technology Park welcomes first tenants: 26 MPSs and 6 startups
    October 31, 2024
    INNVEST Summit 2024: A premier event for innovation and economic competitiveness in the Western Balkans
    October 31, 2024
    Diaspora 4 Innovation: Kick-off event launches a new era for Albanian higher education
    October 31, 2024
    AI for good: Generative AI – Tirana chapter empowers Albanian Youth in tech innovation
    October 29, 2024
    Business Angel Summit 2024: Pioneering Investment and Startup Growth in Sarajevo
    October 29, 2024
  • Mobile
    MobileShow More
    Xiaomi 15 and 15 Pro set to launch on October 29: Official renders released
    October 24, 2024
    Dangerous virus infects millions of mobile phones through popular apps
    October 3, 2024
    The new iPhone 16 arrives in Croatia with a steep price tag
    September 26, 2024
    Beware of these phone numbers: Block them immediately to avoid scams
    September 11, 2024
    Beyond the brand: What really matters when buying a mobile phone
    September 5, 2024
  • Apps
    AppsShow More
    Shoppable widget by EmbedSocial: Revolutionizing E-commerce with authentic shopper content
    October 31, 2024
    Intel prevails in long-running legal battle against €1 billion EU fine
    October 31, 2024
    New definition of open source artificial intelligence released by OSI
    October 29, 2024
    CaSys introduces “Pay by Link” payment service for SMEs in Macedonia
    October 24, 2024
    Kickstarter surpasses $8 billion in donations across all projects
    October 17, 2024
  • Science
    ScienceShow More
    Sofia Tech Park: A thriving innovation hub for Southeast Europe
    October 29, 2024
    Breakthrough in prostate cancer treatment: Croatian scientists develop Vini, a tool to predict effective drug combinations
    October 24, 2024
    Digital Realty partners with Ecolab to pilot AI-powered water conservation solution
    October 24, 2024
    Sofia Tech Park to host the Southeast European Innovators Challenge Conference
    October 11, 2024
    ACG accelerates European growth with major expansion in Croatia
    October 9, 2024
  • Gaming
    GamingShow More
    “Windblown” – The new game from the creators of Dead Cells
    October 24, 2024
    Kraken Empire’s Journey and the creative brilliance of Toy Tactics
    October 21, 2024
    Serbian game studio Tricoman set to make a mark with their new RPG ‘Godforged’ on Steam
    October 16, 2024
    Release the demon with Kill Knight: A phenomenal combat experience with untapped potential
    October 14, 2024
    Nordeus launches new football game “Top Goal: Football Champion” in Serbia
    October 9, 2024
  • Cars
    CarsShow More
    Serbia signs strategic agreement with Hyundai Engineering for 1 GW of Solar Power
    October 16, 2024
    Stara Zagora: Poised to lead Bulgaria’s automotive revolution
    October 15, 2024
    Dacia unveils new Bigster: The flagship model for the C-SUV segment
    October 9, 2024
    Kineton Albania: Pioneering innovation in the automotive industry
    October 8, 2024
    Albania’s vehicle numbers surge in 2024: 73% of registered cars are over 15 years old
    August 20, 2024
  • Entertainment
    EntertainmentShow More
    Where are Generation Z’s famous tech entrepreneurs?
    October 29, 2024
    AllWeb offers special discounts for startups: A unique opportunity for networking and growth
    October 23, 2024
    Montenegro census reveals no ethnic majority, Montenegrins and Serbs nearly equal
    October 16, 2024
    “Primordial Passion” is the first luxury Albanian watch valued at €1.4 million by Argjendari Pirro
    October 15, 2024
    Albania takes the stage at BIG event Paris: Culture and innovation as economic drivers
    October 12, 2024
Search
Reading: New study proves AI has surpassed humans in almost all fields
Notification Show More
Aa
Geek RoomGeek Room
Aa
  • Tech
  • Mobile
  • Apps
  • Science
  • Gaming
  • Cars
  • Entertainment
Search
  • Home
  • Tech
  • Mobile
  • Apps
  • Science
  • Gaming
  • Cars
  • Entertainment
Geek Room > Blog > Tech > New study proves AI has surpassed humans in almost all fields
Tech

New study proves AI has surpassed humans in almost all fields

Last updated: 2024/04/22 at 11:19 AM
Share
5 Min Read

Take a moment to reflect on the AI advancements of the past two years as a whole. The pace at which AI is nearing human capabilities in various domains is astounding, calling for new benchmarks to assess its abilities.

The latest edition of the AI Index report from Stanford University’s Institute for Human-Centered Artificial Intelligence (HAI) is now out. This year’s report, more extensive than ever, provides a broad analysis of AI’s integration into our lives, from industry usage trends to international concerns over job displacement due to AI technologies. A key finding of the report is the level at which AI competes with human performance.

AI has already surpassed many human performance benchmarks

For those who haven’t been closely tracking AI’s progress, the strides it has made are quite startling. Starting with outperforming humans in image classification in 2015, AI quickly moved on to surpass us in basic reading comprehension by 2017, visual reasoning by 2020, and natural language inference by 2021. The rapid advancement of AI has rendered many older benchmarks inadequate. This has led to a rush among researchers to create new, tougher benchmarks that not only test AI’s competencies but also distinguish between what AI can do and where humans still excel.

Despite using these perhaps outdated benchmarks, the trends outlined in the report are unequivocal: The steep inclines in recent performance trajectories indicate just how quickly AI is evolving. Consider that these technologies are still in their infancy. According to the 2023 AI Index report, AI faces challenges with complex cognitive tasks such as solving advanced mathematics problems and visual commonsense reasoning. Yet, calling these challenges ‘struggles’ might not be entirely accurate given the significant improvements noted.

A sample question used to test an AI’s visual commonsense reasoning

On the MATH dataset, consisting of 12,500 high-level math problems, AI’s performance has surged. From a mere 6.9% solution rate in 2021, a GPT-4-based model managed to solve 84.3% of these problems by 2023, approaching the human baseline of 90%. Consider visual commonsense reasoning (VCR). This goes beyond mere object recognition, testing how AI can apply everyday knowledge in visual scenarios to predict outcomes. From 2022 to 2023, AI’s VCR scores rose by 7.93% to reach 81.60, nearing the human baseline of 85.

Rewind just five years, and the idea of expecting a computer to “understand” a visual context in this manner would have seemed far-fetched.

AI has also made significant inroads in generating written content for various professions, though large language models (LLMs) often produce what is euphemistically termed ‘hallucinations’—misleading or incorrect information presented as facts. This issue was highlighted last year when lawyer Steven Schwartz, who used ChatGPT for legal research without verifying its accuracy, was fined $5,000 by a judge for submitting a court document containing fabricated legal cases.

To assess the prevalence of hallucinations in LLMs, the HaluEval benchmark was employed, revealing that this remains a notable challenge.

How text-to-image generation has improved with progressive versions of Midjourney

Moreover, in assessing LLMs’ ability to provide truthful information, the TruthfulQA benchmark used questions on topics such as health, law, finance, and politics to probe common misconceptions. Here, GPT-4’s performance in early 2024 scored 0.59, almost tripling the score of earlier models like GPT-2 tested in 2021, indicating progressive improvement in accuracy.

In the realm of AI-generated images, consider Midjourney’s depiction of Harry Potter over 22 months, reflecting rapid advances in text-to-image generation.In the Holistic Evaluation of Text-to-Image Models (HEIM), various LLMs were assessed for their ability to generate images aligning with text prompts. Among these, OpenAI’s DALL-E 2 was notable for its alignment of images to text, while the Stable Diffusion-based Dreamlike Photoreal model stood out for image quality, aesthetics, and originality.

this is the most interesting year in human history, except for all future years

— Sam Altman (@sama) March 17, 2024

Last year—2023—was a monumental year for AI, and 2024 has only added to the excitement with groundbreaking developments like Suno, Sora, Google Genie, Claude 3, Channel 1, and Devin, along with the looming potential of GPT-5.

AI’s trajectory of rapid development is not slowing down, making it an ever more integral part of our technological landscape. Stay tuned for more insights as we delve further into AI’s impact on global perceptions regarding its safety, trustworthiness, and ethics in the second instalment of our coverage on this topic.

You Might Also Like

Split Technology Park welcomes first tenants: 26 MPSs and 6 startups

INNVEST Summit 2024: A premier event for innovation and economic competitiveness in the Western Balkans

Shoppable widget by EmbedSocial: Revolutionizing E-commerce with authentic shopper content

Intel prevails in long-running legal battle against €1 billion EU fine

Diaspora 4 Innovation: Kick-off event launches a new era for Albanian higher education

Share This Article
Facebook Whatsapp Whatsapp Copy Link
Previous Article Google can now convert text on a website into ads
Next Article Scientists say they have found evidence of an unknown planet in our solar system

Social networks

Instagram Follow

Latest news

Split Technology Park welcomes first tenants: 26 MPSs and 6 startups
Tech October 31, 2024
INNVEST Summit 2024: A premier event for innovation and economic competitiveness in the Western Balkans
Tech October 31, 2024
Shoppable widget by EmbedSocial: Revolutionizing E-commerce with authentic shopper content
Apps October 31, 2024
Intel prevails in long-running legal battle against €1 billion EU fine
Apps October 31, 2024

Related articles

Tech

Split Technology Park welcomes first tenants: 26 MPSs and 6 startups

October 31, 2024
Tech

INNVEST Summit 2024: A premier event for innovation and economic competitiveness in the Western Balkans

October 31, 2024
Apps

Shoppable widget by EmbedSocial: Revolutionizing E-commerce with authentic shopper content

October 31, 2024
Apps

Intel prevails in long-running legal battle against €1 billion EU fine

October 31, 2024

About us

Geek Room is dedicated to technology and its enthusiasts through real-time information and videos about the latest innovations. Connect with our staff via email at: [email protected]
For cooperation opportunities, write to us at: [email protected]

Find us:

© 2023 Geekroom All Rights Reserved. Developed by MIMS
adbanner
AdBlock Detected
Our site is an advertising supported site. Please whitelist to support our site.
Okay, I'll Whitelist
Welcome Back!

Sign in to your account

Lost your password?