Claude Opus AI beats ChatGPT-4 in the ‘mirror test’

In a development raising questions about the nature of self-awareness in artificial intelligence (AI), two leading language models, Claude Opus AI and ChatGPT-4, have exhibited capabilities in self-recognition through the “mirror test.”

This test, traditionally used in psychology, involved presenting the AI models with images and prompting them to identify the entities depicted. Notably, Claude Opus AI outperformed ChatGPT-4 in this challenge.

“In the classic mirror test, animals are marked and then presented with a mirror. Whether the animal attacks the mirror, ignores the mirror, or uses the mirror to spot the mark on itself is meant to indicate how self-aware the animal is.”
– Josh Whiton

Whiton’s experiment: Claude Opus AI vs. ChatGPT-4

Experiments published on X (formerly Twitter) by Josh Whiton revealed interesting distinctions between Anthropic’s AI model Claude Opus and Open AI’s ChatGPT-4.

The AI Mirror Test

The "mirror test" is a classic test used to gauge whether animals are self-aware. I devised a version of it to test for self-awareness in multimodal AI. 4 of 5 AI that I tested passed, exhibiting apparent self-awareness as the test unfolded.

In the classic… pic.twitter.com/Vn7mv1PBbi
— Josh Whiton (@joshwhiton) March 21, 2024

Claude Opus not only identified itself but also separated its own existence from its brand name.

ChatGPT-4, on the other hand, showed a progression in understanding. Initially, it recognized a similar AI in an image. Then, it advanced to comprehend the image as a potential version of itself.

Finally it acknowledged the user interface elements specific to its own operations.

Conclusion on the AI ‘mirror test’

This distinction in performance highlights the diverse approaches within AI development towards achieving self-awareness.

Claude Opus’s swift success suggests a potentially different underlying architecture, while ChatGPT-4’s iterative process indicates a more step-by-step approach.

Microsoft’s (NASDAQ: MSFT) advanced AI platform CoPilot, despite its capabilities, failed the mirror test, prompting speculation that Microsoft actively discourages self-referential behavior.

However, Google’s Gemini Pro showcased a notable progression, initially devoid of self-awareness but suddenly recognizing itself in a fourth interaction.

This distinction in performance highlights the diverse approaches within AI development towards achieving self-awareness.

Remarkably, even though Opus outperformed ChatGPT-4 in this specific experiment, ChatGPT-4 demonstrated superiority over other AI models involved.

A new perspective

In short, maybe it’s time to look at AI from a different angle. Instead of just trying to prove whether it’s conscious or not, let’s explore the different levels of smarts it might have, beyond just being like humans.

Who knows what we might find out? Maybe AI and humans can work together in ways we haven’t even thought of yet.

Claude Opus AI beats ChatGPT-4 in the ‘mirror test’

Whiton’s experiment: Claude Opus AI vs. ChatGPT-4

Conclusion on the AI ‘mirror test’

A new perspective

Join Finbold's newsroom, become a crypto reporter today!

Latest posts

Monster insider trade alert for Trade Desk stock

U.S. oil prices surge to their highest level since January 2025

3 stocks to buy in March amid U.S.-Iran war

Trump-backed DJT is down over $400 million on its Bitcoin investment

Finance Digest

Related posts

Monster insider trade alert for Trade Desk stock

3 stocks to buy in March amid U.S.-Iran war

Insider trading alert for Berkshire Hathaway stock

Wall Street sets Amazon stock price 12-month target

Share on social media

Predict prices, create alerts and more. Try our AI Agent

Predict prices, create alerts and more. Try our AI Agent

Finbold AI Agent

How AI Price Predictions Work

IMPORTANT NOTICE