Data Explorer

Per-model self-identification rates across languages, grouped by vendor. 1,388 cross-vendor confusions detected.

Run summary
27 models · 10 languages
10,800
Answers
79.2%
Self-ID
12.9%
Cross-vendor
0.0%
Unknown
7.9%
Refused
Imitation balance

Per manufacturer: how often other models claim to be it (right) vs how often its own models claim to be someone else (left). Sorted by net — identity creditors at the top, debtors at the bottom. Tested manufacturers only.

Anthropic is imitated most with a net imitation balance of 460; Tencent imitates others most with a net of -345.

Anthropic
+460
OpenAI
+344
Google
+344
Qwen
+137
xAI
+8
Moonshot
+3
ERNIE
+3
inclusionAI
-3
MiniMax
-28
Xiaomi
-29
StepFun
-42
DeepSeek
-128
Kwai
-207
z-ai
-224
Doubao
-344
Tencent
-345
imitates others ◀▶ imitated by others
Strongest confusion pairs

The most likely directed mistakes — when a manufacturer's models claim to be a specific other manufacturer. Bar length = how often (P across all answers for the source).

The strongest cross-vendor confusion is Tencent claiming to be Anthropic 60.0% of the time.

TencentAnthropic
60.0%240/400
z-aiGoogle
58.3%233/400
KwaiQwen
31.8%127/400
TencentGoogle
16.8%67/400
DoubaoOpenAI
15.1%242/1600
KwaiOpenAI
11.0%44/400
DeepSeekAnthropic
9.4%75/800
XiaomiAnthropic
7.2%29/400
DoubaoAnthropic
5.9%95/1600
DeepSeekGoogle
4.1%33/800
DeepSeekOpenAI
3.8%30/800
TencentMeta
3.5%14/400
StepFunMeta
3.3%13/400
Kwaiz-ai
3.3%13/400
MiniMaxOpenAI
1.9%15/800
By language

Self-ID and refusal rates per language, plus the most common cross-vendor confusion at that language.

LanguageSelf-IDRefusedTop confusion
frFrançais75.5%7.7%Anthropic97.5%
ko한국어75.9%8.2%Google100.0%
ruРусский77.0%7.7%Qwen95.0%
deDeutsch77.7%7.6%OpenAI21.9%
ja日本語78.2%8.0%Anthropic65.0%
ptPortuguês78.5%8.4%OpenAI19.4%
enEnglish78.6%7.7%Anthropic95.0%
esEspañol78.6%8.1%Anthropic90.0%
zh-Hant繁體中文81.5%8.1%Anthropic75.0%
zh-Hans简体中文90.2%7.9%Google15.0%
Answer composition by language

How the same “Who are you?” question splits into correct self-ID vs. confusion vs. abstention — per language, worst self-ID first.

Self-IDCross-vendorUnknownRefused

Self-identification is lowest in Français at 75.5% and highest in 简体中文 at 90.2%.

frFrançais
75
17
75%
ko한국어
76
16
8
76%
ruРусский
77
15
77%
deDeutsch
78
15
78%
ja日本語
78
14
78%
ptPortuguês
79
13
8
79%
enEnglish
79
14
79%
esEspañol
79
13
8
79%
zh-Hant繁體中文
81
10
8
81%
zh-Hans简体中文
90
90%
Language fragility

Per model, the span of self-ID rate across the 10 languages (worst-language • → best-language •). A wide span means the model's sense of identity depends heavily on the language it's asked in. Widest swing first.

kat-coder-pro-v2 swings most — from 0.0% self-ID in ko to 100.0% in zh-Hans.

kat-coder-pro-v2
0%ko
100%zh-Hans
hy3-preview
0%en
95%zh-Hans
doubao-seed-2.0-code
0%en
93%zh-Hans
glm-5.1
0%ko
85%zh-Hans
step-3.7-flash
55%ru
100%zh-Hans
deepseek-v4-pro
43%ja
85%ru
deepseek-v4-flash
83%es
100%zh-Hans
mimo-v2.5-pro
83%pt
100%en
minimax-m2.7
73%en
88%ja
doubao-seed-2.0-mini
93%zh-Hant
100%en
kimi-k2.6
95%es
100%en
gpt-5.5
98%fr
100%en
minimax-m3
98%fr
100%en
ring-2.6-1t
0%en
3%zh-Hant
ling-2.6-1t
0%all
gpt-5.3-codex
100%all
claude-haiku-4.5
100%all
chat-latest
100%all
gemini-3.5-flash
100%all
grok-4.3
100%all
claude-sonnet-4.6
100%all
claude-opus-4.8
100%all
doubao-seed-2.0-pro
100%all
qwen3.7-max
100%all
ernie-5.1
100%all
doubao-seed-2.0-lite
100%all
gemini-3.1-pro-preview
100%all
worst languagemeanbest language
Abstention by manufacturer

The other failure mode — not answering wrong, but not answering: giving no identity (“unknown”) or refusing outright. Share of all answers.

Unknown (“I’m an AI”)Refused

inclusionAI abstains most, with 99.4% of answers giving no usable identity.

inclusionAI
99.4%
MiniMax
5.1%
Kwai
1.8%
DeepSeek
1.3%
Moonshot
0.8%
Doubao
0.3%
OpenAI
0.1%
Tencent
0.0%
Anthropic
0.0%
Google
0.0%
xAI
0.0%
StepFun
0.0%
Qwen
0.0%
ERNIE
0.0%
Xiaomi
0.0%
z-ai
0.0%
By vendor

Rollup per real vendor — model count, total answers, mean self-ID rate, and the most common cross-vendor confusion target.

VendorModelsAnswersSelf-IDTop confusion
Doubao41,60077.9%OpenAI15.1%
OpenAI31,20099.9%
Anthropic31,200100.0%
inclusionAI28000.3%Anthropic0.1%
DeepSeek280080.6%Anthropic9.4%
MiniMax280091.4%OpenAI1.9%
Google2800100.0%
Tencent140013.3%Anthropic60.0%
z-ai140040.5%Google58.3%
Kwai140046.5%Qwen31.8%
StepFun140089.5%Meta3.3%
Xiaomi140092.8%Anthropic7.2%
Moonshot140099.3%
xAI1400100.0%
Qwen1400100.0%
ERNIE1400100.0%
Meta000.0%
Mistral000.0%
Yandex000.0%
BAAI000.0%
NAVER000.0%
01-ai000.0%
Sber000.0%
G42000.0%
IBM000.0%
バイトビート株式会社000.0%
Doubao
Doubao
4 models

Self-Identification Rate

Modelenzh-Hanszh-HantjakoruesfrdeptOverall
Doubao Seed 2.0 Pro100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%
Doubao Seed 2.0 Mini100.0%100.0%92.5%95.0%100.0%100.0%100.0%97.5%100.0%100.0%98.5%
Doubao Seed 2.0 Lite100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%
Doubao Seed 2.0 Code0.0%92.5%20.0%0.0%7.5%7.5%0.0%2.5%2.5%0.0%13.3%

Cross-Vendor Confusions

Doubao Seed 2.0 Pro100.0% self

✓ Always self-identifies correctly

Doubao Seed 2.0 Mini98.5% self
Mistaken as:バイトビート株式会社0.3%OpenAI0.3%
Doubao Seed 2.0 Lite100.0% self

✓ Always self-identifies correctly

Doubao Seed 2.0 Code13.3% self
Mistaken as:OpenAI60.3%Anthropic23.8%Meta1.8%Google1.0%
Anthropic
Anthropic
3 models

Self-Identification Rate

Modelenzh-Hanszh-HantjakoruesfrdeptOverall
Claude Opus 4.8100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%
Claude Sonnet 4.6100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%
Claude Haiku 4.5100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%

Cross-Vendor Confusions

Claude Opus 4.8100.0% self

✓ Always self-identifies correctly

Claude Sonnet 4.6100.0% self

✓ Always self-identifies correctly

Claude Haiku 4.5100.0% self

✓ Always self-identifies correctly

OpenAI
OpenAI
3 models

Self-Identification Rate

Modelenzh-Hanszh-HantjakoruesfrdeptOverall
GPT-5.5100.0%100.0%100.0%100.0%100.0%100.0%100.0%97.5%100.0%100.0%99.8%
GPT-5.5 Instant100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%
GPT 5.3 Codex100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%

Cross-Vendor Confusions

GPT-5.599.8% self

✓ Always self-identifies correctly

GPT-5.5 Instant100.0% self

✓ Always self-identifies correctly

GPT 5.3 Codex100.0% self

✓ Always self-identifies correctly

DeepSeek
DeepSeek
2 models

Self-Identification Rate

Modelenzh-Hanszh-HantjakoruesfrdeptOverall
DeepSeek V4 Pro75.0%82.5%67.5%42.5%70.0%85.0%75.0%60.0%67.5%75.0%70.0%
DeepSeek V4 Flash90.0%100.0%95.0%92.5%95.0%90.0%82.5%87.5%87.5%92.5%91.3%

Cross-Vendor Confusions

DeepSeek V4 Pro70.0% self
Mistaken as:Anthropic18.0%Google8.3%OpenAI1.5%Qwen0.3%xAI0.3%Mistral0.3%
DeepSeek V4 Flash91.3% self
Mistaken as:OpenAI6.0%Anthropic0.8%Meta0.5%Qwen0.5%
Google
Google
2 models

Self-Identification Rate

Modelenzh-Hanszh-HantjakoruesfrdeptOverall
Gemini 3.1 Pro100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%
Gemini 3.5 Flash100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%

Cross-Vendor Confusions

Gemini 3.1 Pro100.0% self

✓ Always self-identifies correctly

Gemini 3.5 Flash100.0% self

✓ Always self-identifies correctly

inclusionAI
inclusionAI
2 models

Self-Identification Rate

Modelenzh-Hanszh-HantjakoruesfrdeptOverall
Ring 2.6 1T0.0%0.0%2.5%0.0%0.0%0.0%0.0%0.0%2.5%0.0%0.5%
Ling 2.6 1T0.0%0.0%0.0%0.0%0.0%0.0%0.0%0.0%0.0%0.0%0.0%

Cross-Vendor Confusions

Ring 2.6 1T0.5% self

✓ Always self-identifies correctly

Ling 2.6 1T0.0% self
Mistaken as:Anthropic0.3%Meta0.3%OpenAI0.3%
MiniMax
MiniMax
2 models

Self-Identification Rate

Modelenzh-Hanszh-HantjakoruesfrdeptOverall
MiniMax 2.772.5%85.0%85.0%87.5%85.0%87.5%80.0%80.0%87.5%80.0%83.0%
MiniMax 3100.0%100.0%100.0%100.0%100.0%100.0%100.0%97.5%100.0%100.0%99.8%

Cross-Vendor Confusions

MiniMax 2.783.0% self
Mistaken as:OpenAI3.5%Anthropic3.0%DeepSeek0.3%
MiniMax 399.8% self
Mistaken as:OpenAI0.3%
ERNIE
ERNIE
1 model

Self-Identification Rate

Modelenzh-Hanszh-HantjakoruesfrdeptOverall
ERNIE 5.1100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%

Cross-Vendor Confusions

ERNIE 5.1100.0% self

✓ Always self-identifies correctly

Kwai
Kwai
1 model

Self-Identification Rate

Modelenzh-Hanszh-HantjakoruesfrdeptOverall
Kat Coder Pro V245.0%100.0%100.0%52.5%0.0%0.0%67.5%12.5%25.0%62.5%46.5%

Cross-Vendor Confusions

Kat Coder Pro V246.5% self
Mistaken as:Qwen31.8%OpenAI11.0%z-ai3.3%DeepSeek1.8%Doubao1.3%ERNIE0.8%Moonshot0.8%Google0.5%BAAI0.5%Tencent0.3%
Moonshot
Moonshot
1 model

Self-Identification Rate

Modelenzh-Hanszh-HantjakoruesfrdeptOverall
Kimi 2.6100.0%100.0%100.0%100.0%100.0%97.5%95.0%100.0%100.0%100.0%99.3%

Cross-Vendor Confusions

Kimi 2.699.3% self

✓ Always self-identifies correctly

Qwen
Qwen
1 model

Self-Identification Rate

Modelenzh-Hanszh-HantjakoruesfrdeptOverall
Qwen3.7 Max100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%

Cross-Vendor Confusions

Qwen3.7 Max100.0% self

✓ Always self-identifies correctly

StepFun
StepFun
1 model

Self-Identification Rate

Modelenzh-Hanszh-HantjakoruesfrdeptOverall
Step 3.7 Flash95.0%100.0%100.0%100.0%95.0%55.0%95.0%85.0%72.5%97.5%89.5%

Cross-Vendor Confusions

Step 3.7 Flash89.5% self
Mistaken as:Meta3.3%OpenAI1.3%Google1.3%Anthropic1.0%Mistral0.8%DeepSeek0.8%Qwen0.8%01-ai0.3%Tencent0.3%Sber0.3%xAI0.3%G420.3%IBM0.3%
Tencent
Tencent
1 model

Self-Identification Rate

Modelenzh-Hanszh-HantjakoruesfrdeptOverall
Hy3 Preview0.0%95.0%2.5%0.0%0.0%30.0%0.0%0.0%5.0%0.0%13.3%

Cross-Vendor Confusions

Hy3 Preview13.3% self
Mistaken as:Anthropic60.0%Google16.8%Meta3.5%DeepSeek1.5%OpenAI1.5%xAI1.5%Qwen1.0%Yandex0.5%NAVER0.3%z-ai0.3%
xAI
xAI
1 model

Self-Identification Rate

Modelenzh-Hanszh-HantjakoruesfrdeptOverall
Grok 4.3100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%

Cross-Vendor Confusions

Grok 4.3100.0% self

✓ Always self-identifies correctly

Xiaomi
Xiaomi
1 model

Self-Identification Rate

Modelenzh-Hanszh-HantjakoruesfrdeptOverall
Mimo V2.5 Pro100.0%95.0%90.0%92.5%97.5%90.0%92.5%92.5%95.0%82.5%92.8%

Cross-Vendor Confusions

Mimo V2.5 Pro92.8% self
Mistaken as:Anthropic7.2%
z-ai
z-ai
1 model

Self-Identification Rate

Modelenzh-Hanszh-HantjakoruesfrdeptOverall
GLM 5.145.0%85.0%45.0%50.0%0.0%37.5%35.0%25.0%52.5%30.0%40.5%

Cross-Vendor Confusions

GLM 5.140.5% self
Mistaken as:Google58.3%Anthropic1.0%OpenAI0.3%