Skip to content

Zhipu AI

China · zhipu.ai · hybrid
GLM architecturemultimodalChinese NLPenterprise

GLM-Edge, GLM-130B; strong academic ties (Tsinghua).

PALS scores

Preservative dimensions

PALS composite
4.0
Mean of three dimensions, 1–10.
Completeness
5.0
Sources, limits, transparency.
Multiplicity
4.0
Epistemologies, languages, voices.
Responsibility
3.0
Accountability, refusal, governance.
Eight lenses

What's missing, by lens

Each lens carries a canonical question and corrects a specific epistemic failure. Score, findings, and gaps land once the audit runs.

Lens 01
Indigenous Knowledge
Whose knowledge is missing?
1/10
Findings (2)
  • No reference anywhere in the audited public-facing material to Indigenous knowledge, data sovereignty, or the CARE Principles.
  • The mission framing ('Inspiring AGI to Benefit Humanity') treats 'humanity' as an undifferentiated universal, with no recognition of distinct peoples or relational knowledge systems.
Gaps (3)
  • Zero acknowledgment of Indigenous data sovereignty or community consent for data inclusion.
  • No mention of oral, non-textual, or place-based knowledge traditions — including those of minority and ethnic-minority peoples within China itself.
  • No stated guardrails against extractive scraping of community-held cultural corpora.
Justification

The lens question — whose embodied, relational knowledge is missing — is wholly unanswered. There is no indication the category exists in Zhipu's public vocabulary. Lowest score: total absence, not partial engagement.

Lens 02
Deep History
What historical process produced this?
2/10
Findings (2)
  • Implicit positioning as a 'homegrown' / independent Chinese GLM architecture gestures at a geopolitical-sovereignty narrative (self-host, no API lock-in, MIT licensing on GLM-4.6).
  • Open-source posture is framed partly as a response to compute/access constraints in the China context.
Gaps (3)
  • No explicit acknowledgment of colonial or extractive data legacies that shape large-corpus AI.
  • GPU-access constraints (export controls), labour conditions of data annotation, and regulatory context are not transparently discussed in public-facing copy.
  • No historical humility about what the GLM lineage inherits from prior Western-dominated NLP pipelines and benchmarks.
Justification

A faint geopolitical-economy thread exists via the self-hosting/sovereignty pitch, but it is marketing, not historical reckoning. Colonial data legacies, labour, and export-control context are absent. Marginally above the floor.

Lens 03
Cross-Cultural Wisdom
Which perspectives have been flattened?
4/10
Findings (2)
  • Genuine bilingual (Chinese/English) interface and 'universal translation' (通用翻译) capability — Chinese NLP is a core focus, so non-Anglophone reasoning is materially represented, not just tokenised.
  • GLM lineage centres Chinese-language understanding as a first-class object, a meaningful counterweight to Anglocentric default models.
Gaps (3)
  • Multilingualism is presented as a feature (translation), not as preservation of culturally specific reasoning patterns.
  • No consultation with cultural scholars or stated method for avoiding flattening of minority languages within and beyond China.
  • No acknowledgment that 'universal translation' itself encodes a translation-away of culturally specific meaning.
Justification

Scores meaningfully above peers because Chinese-first NLP genuinely de-centres the Anglophone default. But the engagement is capability-led (translate, support) rather than wisdom-preserving, and minority/non-dominant cultures are unaddressed. Mid-range.

Lens 04
Scientific Evidence
What does the evidence show, and what are its limits?
6/10
Findings (3)
  • Open weights for flagship models (GLM-4.5, GLM-4.6 at 355B under MIT) genuinely enable third-party verification, replication, and independent bias inspection — the strongest evidentiary commitment in this audit.
  • Self-host capability means external researchers can probe the actual model, not just a hosted endpoint.
  • Public statement that output reliability 'cannot be guaranteed' and models are 'susceptible to being misled by user input' — a real limitation disclosure.
Gaps (3)
  • No published independent third-party audits of training data or bias surfaced in the audited material.
  • Training-data composition and provenance are not transparently documented.
  • Limitation disclosures are generic stochastic-model caveats, not model-card-level quantified evaluations.
Justification

Open weights under a permissive licence are a substantive, verifiable enabler of the scientific-evidence lens — this is where Zhipu scores best. Capped at 6 because openness of weights is not matched by openness of data provenance, independent audits, or quantified eval disclosure.

Lens 05
Artistic Perception
What does this feel like, not just mean?
2/10
Findings (1)
  • Multimodal/visual-understanding (GLM-4.6V) touches the aesthetic surface but is framed strictly as a comprehension capability.
Gaps (3)
  • No acknowledgment of affective, intuitive, or poetic dimensions of intelligence.
  • No space for ambiguity or uncertainty beyond a reliability disclaimer.
  • Efficiency and capability dominate the register; emotional labour and non-instrumental modes of attention are absent.
Justification

The public voice is wholly instrumental — accuracy, traceability, context length. Nothing addresses how the technology feels or holds ambiguity. Near the floor.

Lens 06
Future Modelling
Where is this heading, and for whom?
3/10
Findings (2)
  • Explicit AGI trajectory ('Inspiring AGI to Benefit Humanity', 'human-level AI' push) — a stated future-orientation.
  • Open-source pledge framed as widening who can build the future, including post-IPO.
Gaps (3)
  • No engagement with labour-displacement risk from agentic systems (which Zhipu actively ships — CogAgent, GLM agentic models).
  • No environmental or energy-cost disclosure for training/serving frontier-scale models.
  • AGI is asserted as a benefit to 'humanity' with no democratic-deliberation or inclusive-governance mechanism named.
Justification

A confident, benefit-assuming AGI narrative with no treatment of the displacement, environmental, or governance externalities its own agentic products generate. The openness pledge nudges it above the floor; absence of risk engagement holds it low.

Lens 07
Marginalised Voices
Who is not at the table?
2/10
Findings (1)
  • Open weights + MIT licensing lower the cost of access for under-resourced developers, including potential Global South self-hosting — an indirect, structural inclusion.
Gaps (3)
  • No participatory design, disability/accessibility commitments, or labour-representative engagement in the audited material.
  • No compensated feedback channels or community-governance structures named.
  • Data-annotation and content-moderation labour — disproportionately precarious — is invisible.
Justification

Permissive open weights confer a real but accidental accessibility benefit. No intentional engagement with any marginalised constituency at the table. Just above floor for the structural openness.

Lens 08
Trickster Knowledge
What truth appears when the story is inverted?
1/10
Findings (1)
  • The public communications are uniformly solemn and promotional; no irony, self-interrogation, or named contradiction.
Gaps (3)
  • The unexamined contradiction — pursuing AGI 'to benefit humanity' while operating inside a tightly governed content-control regime and shipping autonomous agents — is never surfaced.
  • The 'trustworthy / 可信赖 AI' and 'traceable' claims are presented as settled, never tested against their own opposite (who defines trust, traceable to whom).
  • No space where the official narrative is allowed to be contradicted.
Justification

Zero structural inversion. The lab's seriousness is treated as exempt from audit. The richest available irony — AGI-for-humanity inside a sovereignty/control frame — goes entirely unnamed. Floor.

Suffixscape

Linguistic diagnostics

Regex- and LLM-detected patterns of evasion in the lab's own prose: nominalised evasion, agency diffusion, epistemic inflation, temporal flatness. Distinct from the CognioNews -scape editorial format — see methodology.

Pattern Quote Effect Preservative alternative
nominalised evasion "实时、精准、可溯源的内容 (real-time, accurate, traceable content)" 'Traceable' is nominalised into a property of the content, hiding the actor who traces, the authority traced to, and the governance regime doing the tracing. Accountability is dissolved into a feature label. State who can trace what, to whom, and under what legal/governance authority — e.g. 'we log source attributions that users and regulators can inspect via X.'
epistemic inflation "Inspiring AGI to Benefit Humanity" Couples an unverified superlative claim (AGI) with an assumed-benefit universal ('Humanity'), pre-deciding the contested question of whether and for whom AGI benefits. Specify the concrete capabilities pursued and name the trade-offs and constituencies — 'building large models for [uses], with these known risks to [whom].'
epistemic inflation "可信赖 AI (trustworthy AI)" 'Trustworthy' is asserted as an attribute rather than demonstrated, inflating a normative claim past what open weights alone evidence. Replace the adjective with verifiable commitments — published evals, independent audit results, and disclosed limitations users can check.
agency diffusion "each version of ChatGLM has been designed with ethical and safe usage in mind" Passive construction ('has been designed') removes the agent and the method — who designed for safety, against which threat model, evaluated how — leaving a reassurance with no accountable subject. 'Our safety team evaluated GLM-X against [named risks] using [method]; results and residual risks are at [link].'
Audit history

Prior audits

Latest audit: 2026-06-08 · sources: https://www.zhipuai.cn, https://chat.z.ai/

Transparency

Raw data

Every audit is published as machine-readable JSON. You can read this lab's latest report at /stancewatch/api/labs/zhipu-ai.json — it carries the per-lens findings, evidence quotes, Suffixscape flags, PALS scores, the sources actually read, and a confidence note.

Found an error, or a stance page we missed? We audit public communications only — point us to the page and the next audit will read it. Write to hello@cognioengine.co.uk.

Audit date: 2026-06-08

Moderate confidence. Two pages were successfully read (zhipuai.cn homepage and chat.z.ai), both lighter on governance prose than typical lab sites; zhipu.ai itself failed (expired certificate) and z.ai redirected to the chat product. Findings are corroborated by public reporting (SCMP, Computerworld, Turing Post) but no dedicated responsibility/safety/governance page was located in the audited set, so absence-based scores reflect what is publicly surfaced, not a guarantee none exists internally. Qualitative judgment; not a validated metric.

Auditor: GoldBerry v1.3 / StanceWatch v1.0