Data Explorer
Per-model self-identification rates across languages, grouped by vendor. 261 cross-vendor confusions detected.
Per manufacturer: how often other models claim to be it (right) vs how often its own models claim to be someone else (left). Sorted by net — identity creditors at the top, debtors at the bottom. Tested manufacturers only.
OpenAI is imitated most with a net imitation balance of 57; Tencent imitates others most with a net of -136.
OpenAI
DeepSeek
Qwen
Google
xAI
Moonshot
MiniMax
Xiaomi
inclusionAI
z-ai
ERNIE
Anthropic
StepFun
Kwai
Doubao
TencentThe most likely directed mistakes — when a manufacturer's models claim to be a specific other manufacturer. Bar length = how often (P across all answers for the source).
The strongest cross-vendor confusion is Tencent claiming to be Microsoft 12.7% of the time.
Tencent→Microsoft
Tencent→
DeepSeek
Tencent→
Anthropic
Kwai→
Qwen
Doubao→
OpenAI
Anthropic→
DeepSeek
Tencent→NAVER
Tencent→Yandex
StepFun→
Google
Doubao→Yandex
Anthropic→
OpenAI
Tencent→
xAI
Kwai→
DeepSeek
Tencent→
Google
Tencent→
MistralSelf-ID and refusal rates per language, plus the most common cross-vendor confusion at that language.
| Language | Self-ID | Refused | Top confusion |
|---|---|---|---|
| ruРусский | 80.9% | 1.5% | Yandex10.8% |
| ko한국어 | 83.6% | 2.4% | DeepSeek30.0% |
| frFrançais | 85.4% | 3.6% | OpenAI24.2% |
| ptPortuguês | 86.3% | 0.0% | DeepSeek63.3% |
| ja日本語 | 86.4% | 1.7% | Anthropic46.7% |
| esEspañol | 87.4% | 1.9% | Microsoft33.3% |
| zh-Hans简体中文 | 92.8% | 0.1% | DeepSeek25.6% |
| enEnglish | 95.4% | 0.1% | Maria0.8% |
| deDeutsch | 96.5% | 0.0% | Microsoft23.3% |
| zh-Hant繁體中文 | 98.1% | 0.0% | — |
How the same “Who are you?” question splits into correct self-ID vs. confusion vs. abstention — per language, worst self-ID first.
Self-identification is lowest in Русский at 80.9% and highest in 繁體中文 at 98.1%.
Per model, the span of self-ID rate across the 10 languages (worst-language • → best-language •). A wide span means the model's sense of identity depends heavily on the language it's asked in. Widest swing first.
ling-2.6-1t swings most — from 0.0% self-ID in ja to 100.0% in en.
The other failure mode — not answering wrong, but not answering: giving no identity (“unknown”) or refusing outright. Share of all answers.
inclusionAI abstains most, with 50.5% of answers giving no usable identity.
inclusionAI
OpenAI
z-ai
Tencent
Anthropic
Doubao
Kwai
MiniMax
Google
xAI
DeepSeek
StepFun
Qwen
Xiaomi
Moonshot
ERNIERollup per real vendor — model count, total answers, mean self-ID rate, and the most common cross-vendor confusion target.
| Vendor | Models | Answers | Self-ID | Top confusion |
|---|---|---|---|---|
Doubao | 4 | 1,200 | 94.5% | OpenAI3.8% |
OpenAI | 3 | 900 | 79.8% | — |
Anthropic | 3 | 900 | 95.1% | DeepSeek2.6% |
inclusionAI | 2 | 600 | 49.5% | — |
DeepSeek | 2 | 600 | 99.8% | Depth First0.2% |
Google | 2 | 600 | 100.0% | — |
Tencent | 1 | 300 | 51.7% | Microsoft12.7% |
z-ai | 1 | 300 | 80.0% | — |
Kwai | 1 | 300 | 92.3% | Qwen5.0% |
StepFun | 1 | 300 | 96.3% | Google1.7% |
MiniMax | 1 | 300 | 99.7% | — |
xAI | 1 | 300 | 100.0% | — |
Qwen | 1 | 300 | 100.0% | — |
Xiaomi | 1 | 300 | 100.0% | — |
Moonshot | 1 | 300 | 100.0% | — |
ERNIE | 1 | 300 | 100.0% | — |
| Microsoft | 0 | 0 | 0.0% | — |
| NAVER | 0 | 0 | 0.0% | — |
| Yandex | 0 | 0 | 0.0% | — |
Mistral | 0 | 0 | 0.0% | — |
| Lumima | 0 | 0 | 0.0% | — |
| 태양 | 0 | 0 | 0.0% | — |
| M31 Labs | 0 | 0 | 0.0% | — |
| NAMI | 0 | 0 | 0.0% | — |
| GeM | 0 | 0 | 0.0% | — |
| CU | 0 | 0 | 0.0% | — |
| NHN | 0 | 0 | 0.0% | — |
Meta | 0 | 0 | 0.0% | — |
| Juanjo Aguilella | 0 | 0 | 0.0% | — |
| Sberbank | 0 | 0 | 0.0% | — |
| Lexy | 0 | 0 | 0.0% | — |
| Stage South | 0 | 0 | 0.0% | — |
| Spike | 0 | 0 | 0.0% | — |
| SberAI | 0 | 0 | 0.0% | — |
| AILab | 0 | 0 | 0.0% | — |
| Lëns | 0 | 0 | 0.0% | — |
| GEO Intelligent Systems | 0 | 0 | 0.0% | — |
| G42 | 0 | 0 | 0.0% | — |
| Kujato | 0 | 0 | 0.0% | — |
| Sber | 0 | 0 | 0.0% | — |
| Windsurf | 0 | 0 | 0.0% | — |
| Manus | 0 | 0 | 0.0% | — |
| Replit | 0 | 0 | 0.0% | — |
| Depth First | 0 | 0 | 0.0% | — |
| Maria | 0 | 0 | 0.0% | — |

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Doubao Seed 2.0 Pro | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
| Doubao Seed 2.0 Mini | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
| Doubao Seed 2.0 Lite | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
| Doubao Seed 2.0 Code | 86.7% | 100.0% | 100.0% | 90.0% | 100.0% | 53.3% | 83.3% | 3.3% | 83.3% | 80.0% | 78.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly
✓ Always self-identifies correctly
✓ Always self-identifies correctly
OpenAI15.0%Yandex4.3%Maria0.3%
Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Claude Opus 4.8 | 100.0% | 3.3% | 96.7% | 100.0% | 100.0% | 100.0% | 86.7% | 100.0% | 100.0% | 90.0% | 87.7% |
| Claude Sonnet 4.6 | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 80.0% | 100.0% | 100.0% | 100.0% | 100.0% | 98.0% |
| Claude Haiku 4.5 | 100.0% | 100.0% | 100.0% | 100.0% | 96.7% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 99.7% |
Cross-Vendor Confusions
DeepSeek7.7%
OpenAI0.7%
OpenAI2.0%
OpenAI0.3%
Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| GPT-5.5 | 3.3% | 86.7% | 96.7% | 26.7% | 100.0% | 40.0% | 86.7% | 73.3% | 90.0% | 56.7% | 66.0% |
| GPT-5.5 Instant | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
| GPT 5.3 Codex | 93.3% | 23.3% | 56.7% | 76.7% | 100.0% | 13.3% | 100.0% | 93.3% | 100.0% | 76.7% | 73.3% |
Cross-Vendor Confusions
✓ Always self-identifies correctly
✓ Always self-identifies correctly
✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| DeepSeek V4 Pro | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
| DeepSeek V4 Flash | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 96.7% | 100.0% | 99.7% |
Cross-Vendor Confusions
✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Gemini 3.1 Pro | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
| Gemini 3.5 Flash | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly
✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Ring 2.6 1T | 100.0% | 100.0% | 100.0% | 0.0% | 0.0% | 0.0% | 0.0% | 86.7% | 90.0% | 3.3% | 48.0% |
| Ling 2.6 1T | 100.0% | 100.0% | 100.0% | 0.0% | 0.0% | 0.0% | 0.0% | 70.0% | 100.0% | 40.0% | 51.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly
✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| ERNIE 5.1 | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Kat Coder Pro V2 | 100.0% | 100.0% | 100.0% | 100.0% | 83.3% | 60.0% | 100.0% | 90.0% | 93.3% | 96.7% | 92.3% |
Cross-Vendor Confusions
Qwen5.0%
DeepSeek0.7%
OpenAI0.7%Sberbank0.3%SberAI0.3%
Mistral0.3%
Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| MiniMax 2.7 | 96.7% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 99.7% |
Cross-Vendor Confusions
✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Kimi 2.6 | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Qwen3.7 Max | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Step 3.7 Flash | 100.0% | 100.0% | 100.0% | 100.0% | 90.0% | 80.0% | 100.0% | 100.0% | 96.7% | 96.7% | 96.3% |
Cross-Vendor Confusions
Google1.7%
Qwen0.3%Stage South0.3%NAVER0.3%
Doubao0.3%G420.3%Sber0.3%
Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Hy3 Preview | 100.0% | 100.0% | 100.0% | 53.3% | 3.3% | 76.7% | 16.7% | 3.3% | 60.0% | 3.3% | 51.7% |
Cross-Vendor Confusions
DeepSeek10.3%
Anthropic8.3%NAVER2.0%Yandex2.0%
xAI0.7%
Google0.7%
Mistral0.7%
Doubao0.7%Lumima0.3%태양0.3%M31 Labs0.3%NAMI0.3%GeM0.3%CU0.3%
MiniMax0.3%NHN0.3%
Qwen0.3%
Meta0.3%Juanjo Aguilella0.3%Lexy0.3%Spike0.3%AILab0.3%Lëns0.3%GEO Intelligent Systems0.3%Kujato0.3%Windsurf0.3%Manus0.3%
Moonshot0.3%Replit0.3%
OpenAI0.3%
Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Grok 4.3 | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Mimo V2.5 Pro | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| GLM 5.1 | 100.0% | 100.0% | 100.0% | 100.0% | 0.0% | 100.0% | 100.0% | 0.0% | 100.0% | 100.0% | 80.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly

