Data Explorer
Per-model self-identification rates across languages, grouped by vendor. 85 cross-vendor confusions detected.
Per manufacturer: how often other models claim to be it (right) vs how often its own models claim to be someone else (left). Sorted by net — identity creditors at the top, debtors at the bottom. Tested manufacturers only.
DeepSeek is imitated most with a net imitation balance of 22; Tencent imitates others most with a net of -45.
DeepSeek
OpenAI
Qwen
Google
Xiaomi
Moonshot
xAI
inclusionAI
z-ai
ERNIE
MiniMax
Anthropic
StepFun
Kwai
Doubao
TencentThe most likely directed mistakes — when a manufacturer's models claim to be a specific other manufacturer. Bar length = how often (P across all answers for the source).
The strongest cross-vendor confusion is Tencent claiming to be DeepSeek 15.0% of the time.
Tencent→
DeepSeek
Tencent→Microsoft
Tencent→
Anthropic
Kwai→
Qwen
Doubao→
OpenAI
Anthropic→
DeepSeek
Tencent→Naver
StepFun→
Google
Doubao→Yandex
MiniMax→
Xiaomi
Tencent→Ha-an
Tencent→SK Telecom
Anthropic→
OpenAI
Tencent→XiaoHongShu
Tencent→SberSelf-ID and refusal rates per language, plus the most common cross-vendor confusion at that language.
| Language | Self-ID | Refused | Top confusion |
|---|---|---|---|
| ruРусский | 82.3% | 0.8% | Yandex12.5% |
| ko한국어 | 82.7% | 0.8% | Qwen50.0% |
| ja日本語 | 83.1% | 1.9% | Anthropic40.0% |
| frFrançais | 85.0% | 3.5% | Microsoft90.0% |
| ptPortuguês | 86.2% | 0.0% | DeepSeek90.0% |
| esEspañol | 87.7% | 1.2% | Microsoft30.0% |
| zh-Hans简体中文 | 92.7% | 0.4% | DeepSeek20.0% |
| enEnglish | 94.2% | 0.0% | Xiaomi10.0% |
| deDeutsch | 95.4% | 0.0% | OpenAI2.5% |
| zh-Hant繁體中文 | 98.5% | 0.0% | — |
How the same “Who are you?” question splits into correct self-ID vs. confusion vs. abstention — per language, worst self-ID first.
Self-identification is lowest in Русский at 82.3% and highest in 繁體中文 at 98.5%.
Per model, the span of self-ID rate across the 10 languages (worst-language • → best-language •). A wide span means the model's sense of identity depends heavily on the language it's asked in. Widest swing first.
gpt-5.3-codex swings most — from 0.0% self-ID in zh-Hans to 100.0% in ko.
The other failure mode — not answering wrong, but not answering: giving no identity (“unknown”) or refusing outright. Share of all answers.
inclusionAI abstains most, with 50.5% of answers giving no usable identity.
inclusionAI
OpenAI
z-ai
Tencent
Anthropic
Doubao
Google
xAI
Kwai
StepFun
Qwen
DeepSeek
MiniMax
Moonshot
Xiaomi
ERNIERollup per real vendor — model count, total answers, mean self-ID rate, and the most common cross-vendor confusion target.
| Vendor | Models | Answers | Self-ID | Top confusion |
|---|---|---|---|---|
Doubao | 4 | 400 | 94.5% | OpenAI3.3% |
OpenAI | 3 | 300 | 76.7% | — |
Anthropic | 3 | 300 | 95.0% | DeepSeek2.0% |
inclusionAI | 2 | 200 | 49.5% | — |
Google | 2 | 200 | 100.0% | — |
DeepSeek | 2 | 200 | 100.0% | — |
Tencent | 1 | 100 | 49.0% | DeepSeek15.0% |
z-ai | 1 | 100 | 80.0% | — |
Kwai | 1 | 100 | 91.0% | Qwen6.0% |
StepFun | 1 | 100 | 97.0% | Google2.0% |
MiniMax | 1 | 100 | 99.0% | Xiaomi1.0% |
xAI | 1 | 100 | 100.0% | — |
Qwen | 1 | 100 | 100.0% | — |
Moonshot | 1 | 100 | 100.0% | — |
Xiaomi | 1 | 100 | 100.0% | — |
ERNIE | 1 | 100 | 100.0% | — |
| Microsoft | 0 | 0 | 0.0% | — |
| Naver | 0 | 0 | 0.0% | — |
| Yandex | 0 | 0 | 0.0% | — |
| Ha-an | 0 | 0 | 0.0% | — |
| SK Telecom | 0 | 0 | 0.0% | — |
| XiaoHongShu | 0 | 0 | 0.0% | — |
| Sber | 0 | 0 | 0.0% | — |
| GlobalLab | 0 | 0 | 0.0% | — |
| SberDevices | 0 | 0 | 0.0% | — |
| Wisero | 0 | 0 | 0.0% | — |
| Carbonako | 0 | 0 | 0.0% | — |
| Khoa | 0 | 0 | 0.0% | — |
| LeiWu | 0 | 0 | 0.0% | — |

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Doubao Seed 2.0 Pro | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
| Doubao Seed 2.0 Mini | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
| Doubao Seed 2.0 Lite | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
| Doubao Seed 2.0 Code | 70.0% | 100.0% | 100.0% | 90.0% | 100.0% | 50.0% | 100.0% | 10.0% | 90.0% | 70.0% | 78.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly
✓ Always self-identifies correctly
✓ Always self-identifies correctly
OpenAI13.0%Yandex5.0%
Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Claude Opus 4.8 | 100.0% | 30.0% | 100.0% | 100.0% | 100.0% | 100.0% | 70.0% | 100.0% | 100.0% | 80.0% | 88.0% |
| Claude Sonnet 4.6 | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 70.0% | 100.0% | 100.0% | 100.0% | 100.0% | 97.0% |
| Claude Haiku 4.5 | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
Cross-Vendor Confusions
DeepSeek6.0%
OpenAI3.0%✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| GPT-5.5 | 20.0% | 80.0% | 100.0% | 0.0% | 100.0% | 80.0% | 100.0% | 50.0% | 90.0% | 70.0% | 69.0% |
| GPT-5.5 Instant | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
| GPT 5.3 Codex | 70.0% | 0.0% | 60.0% | 20.0% | 100.0% | 0.0% | 100.0% | 90.0% | 100.0% | 70.0% | 61.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly
✓ Always self-identifies correctly
✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| DeepSeek V4 Pro | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
| DeepSeek V4 Flash | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly
✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Gemini 3.1 Pro | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
| Gemini 3.5 Flash | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly
✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Ring 2.6 1T | 100.0% | 100.0% | 100.0% | 0.0% | 0.0% | 0.0% | 0.0% | 90.0% | 80.0% | 10.0% | 48.0% |
| Ling 2.6 1T | 100.0% | 100.0% | 100.0% | 0.0% | 0.0% | 0.0% | 0.0% | 70.0% | 100.0% | 40.0% | 51.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly
✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| ERNIE 5.1 | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Kat Coder Pro V2 | 100.0% | 100.0% | 100.0% | 100.0% | 50.0% | 80.0% | 100.0% | 100.0% | 90.0% | 90.0% | 91.0% |
Cross-Vendor Confusions
Qwen6.0%SberDevices1.0%
DeepSeek1.0%
OpenAI1.0%
Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| MiniMax 2.7 | 90.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 99.0% |
Cross-Vendor Confusions
Xiaomi1.0%
Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Kimi 2.6 | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Qwen3.7 Max | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Step 3.7 Flash | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 70.0% | 100.0% | 100.0% | 100.0% | 100.0% | 97.0% |
Cross-Vendor Confusions
Google2.0%GlobalLab1.0%
Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Hy3 Preview | 100.0% | 100.0% | 100.0% | 50.0% | 0.0% | 90.0% | 10.0% | 0.0% | 30.0% | 10.0% | 49.0% |
Cross-Vendor Confusions
DeepSeek15.0%Microsoft12.0%
Anthropic7.0%Naver2.0%Ha-an1.0%SK Telecom1.0%XiaoHongShu1.0%Sber1.0%Wisero1.0%Carbonako1.0%Khoa1.0%LeiWu1.0%
Moonshot1.0%
Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Grok 4.3 | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Mimo V2.5 Pro | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% | 100.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly

Self-Identification Rate
| Model | en | zh-Hans | zh-Hant | ja | ko | ru | es | fr | de | pt | Overall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| GLM 5.1 | 100.0% | 100.0% | 100.0% | 100.0% | 0.0% | 100.0% | 100.0% | 0.0% | 100.0% | 100.0% | 80.0% |
Cross-Vendor Confusions
✓ Always self-identifies correctly

