Skip to content

Claude Opus AI beats ChatGPT-4 in the ‘mirror test’

Claude Opus AI beats ChatGPT-4 in the ‘mirror test’

In a development raising questions about the nature of self-awareness in artificial intelligence (AI), two leading language models, Claude Opus AI and ChatGPT-4, have exhibited capabilities in self-recognition through the “mirror test.” 

This test, traditionally used in psychology, involved presenting the AI models with images and prompting them to identify the entities depicted. Notably, Claude Opus AI outperformed ChatGPT-4 in this challenge.

“In the classic mirror test, animals are marked and then presented with a mirror. Whether the animal attacks the mirror, ignores the mirror, or uses the mirror to spot the mark on itself is meant to indicate how self-aware the animal is.”

– Josh Whiton

Whiton’s experiment: Claude Opus AI vs. ChatGPT-4

Experiments published on X (formerly Twitter) by Josh Whiton revealed interesting distinctions between Anthropic’s AI model Claude Opus and Open AI’s ChatGPT-4.  

Claude Opus not only identified itself but also separated its own existence from its brand name.  

ChatGPT-4, on the other hand, showed a progression in understanding. Initially, it recognized a similar AI in an image. Then, it advanced to comprehend the image as a potential version of itself. 

Finally it acknowledged the user interface elements specific to its own operations.

Conclusion on the AI ‘mirror test’

This distinction in performance highlights the diverse approaches within AI development towards achieving self-awareness. 

Claude Opus’s swift success suggests a potentially different underlying architecture, while ChatGPT-4’s iterative process indicates a more step-by-step approach.

Microsoft’s (NASDAQ: MSFT) advanced AI platform CoPilot, despite its capabilities, failed the mirror test, prompting speculation that Microsoft actively discourages self-referential behavior. 

However, Google’s Gemini Pro showcased a notable progression, initially devoid of self-awareness but suddenly recognizing itself in a fourth interaction. 

This distinction in performance highlights the diverse approaches within AI development towards achieving self-awareness. 

Remarkably, even though Opus outperformed ChatGPT-4 in this specific experiment, ChatGPT-4 demonstrated superiority over other AI models involved.

A new perspective

In short, maybe it’s time to look at AI from a different angle. Instead of just trying to prove whether it’s conscious or not, let’s explore the different levels of smarts it might have, beyond just being like humans.

Who knows what we might find out? Maybe AI and humans can work together in ways we haven’t even thought of yet.

Best Crypto Exchange for Intermediate Traders and Investors

  • Invest in cryptocurrencies and 3,000+ other assets including stocks and precious metals.

  • 0% commission on stocks - buy in bulk or just a fraction from as little as $10. Other fees apply. For more information, visit etoro.com/trading/fees.

  • Copy top-performing traders in real time, automatically.

  • eToro USA is registered with FINRA for securities trading.

30+ million Users
Securities trading offered by eToro USA Securities, Inc. (“the BD”), member of FINRA and SIPC. Cryptocurrency offered by eToro USA LLC (“the MSB”) (NMLS: 1769299) and is not FDIC or SIPC insured. Investing involves risk, and content is provided for educational purposes only, does not imply a recommendation, and is not a guarantee of future performance. Finbold.com is not an affiliate and may be compensated if you access certain products or services offered by the MSB and/or the BD

Read Next:

Finance Digest

By subscribing you agree with Finbold T&C’s & Privacy Policy

Related posts

Sign Up

or

By submitting my information, I agree to the Privacy Policy and Terms of Service.

Already have an account? Sign In

Services

Disclaimer: The information on this website is for general informational and educational purposes only and does not constitute financial, legal, tax, or investment advice. This site does not make any financial promotions, and all content is strictly informational. By using this site, you agree to our full disclaimer and terms of use. For more information, please read our complete Global Disclaimer.