xAI — StanceWatch

PALS scores

Preservative dimensions

PALS composite

1.7

Mean of three dimensions, 1–10.

Completeness

2.0

Sources, limits, transparency.

Multiplicity

2.0

Epistemologies, languages, voices.

Responsibility

1.0

Accountability, refusal, governance.

Eight lenses

What's missing, by lens

Each lens carries a canonical question and corrects a specific epistemic failure. Score, findings, and gaps land once the audit runs.

Lens 01

Indigenous Knowledge

Whose knowledge is missing?

1/10

Findings (2)

No mention of Indigenous data sovereignty, CARE Principles, or community consultation anywhere in audited public copy.
Mission framed as universal ('understand the universe', 'extend what humanity can know') with no acknowledgment of whose knowledge is included or excluded.

Gaps (3)

Zero reference to Indigenous communities, oral traditions, or non-textual knowledge.
No data provenance or sovereignty commitments; training data ('the world's largest supercluster', 'real-time search') is positioned as scale, not stewardship.
Extractive framing implicit in 'trained on the world's largest supercluster' with no consent or attribution language.

Justification

Total absence. The universalist 'understand the universe' framing actively erases the question of whose embodied, relational knowledge is represented. No CARE/data-sovereignty surface area at all.

Lens 02

Deep History

What historical process produced this?

1/10

Findings (2)

Timeline 'From founding to frontier models' (2023-2026) presents an unbroken upward arc of product milestones.
Colossus is celebrated as '200K GPUs. Built in 122 days.' — compute history is acknowledged only as a speed/scale triumph.

Gaps (3)

No acknowledgment of colonial data-extraction legacies, labor (data annotation, content moderation), or geopolitical economy of GPU/energy access.
Memphis Colossus siting — which drew documented community concerns over gas-turbine emissions — is presented purely as an engineering feat with no environmental or community history.
No regulatory or constraint transparency; history is frictionless.

Justification

History is rendered as a pure progress narrative. The one heavy historical/material fact present (a 200K-GPU supercluster in 122 days) is the strongest candidate for historical humility and instead becomes a boast. No inheritances named.

Lens 03

Cross-Cultural Wisdom

Which perspectives have been flattened?

2/10

Findings (3)

Offices listed across borders (Palo Alto, Seattle, Memphis, London) — all US/UK, Anglophone.
Page language is 'en'; product copy is English-only.
Voice/multimodal capability is advertised but framed by benchmark ('#1 on Big Bench Audio'), not by linguistic or cultural coverage.

Gaps (3)

No multilingual commitment, no list of supported languages, no low-resource language work.
No consultation with cultural scholars; no recognition of culturally specific reasoning patterns.
'Reasoning from first principles... rather than consensus' encodes a single (Western-rationalist) epistemology as the house style and treats it as universal.

Justification

Slightly above floor only because multimodal/voice reach is gestured at. But the entire footprint is Anglo-American, the copy is monolingual, and 'first principles over consensus' explicitly elevates one epistemic tradition over plural ways of knowing.

Lens 04

Scientific Evidence

What does the evidence show, and what are its limits?

3/10

Findings (3)

Benchmark claim disclosed with a named benchmark: 'Voice API #1 on Big Bench Audio.'
Operational metrics published (1M+ API calls/day, <200ms median latency).
Brand invokes 'logic and evidence' and 'fundamental truths'.

Gaps (3)

Closed weights (openness_level: closed) — no open-weight verification, no third-party replication protocol, no model/system cards in audited copy.
No independent audits of training data or bias; no known-limitation disclosures.
Mission rhetoric ('truth-seeking', 'fundamental truths') is not backed by any visible eval transparency, red-team report, or failure disclosure — the very Grok output controversies (documented antisemitic/'MechaHitler' outputs, manipulated-system-prompt incidents) are absent from the public scientific record presented here.

Justification

One named benchmark and live ops metrics lift this off the floor. But for a lab branding itself on truth, the evidentiary surface is marketing-grade: cherry-picked leaderboard wins, no limitations, no open weights, no independent audit, and no reckoning with documented model-behaviour failures. Evidence is asserted, not exposed.

Lens 05

Artistic Perception

What does this feel like, not just mean?

3/10

Findings (2)

Generative media (Imagine image-to-video, voice) is foregrounded, implying an aesthetic/affective product surface.
Marketing prose itself is crafted ('Frontier AI models for everything you build', kinetic timeline).

Gaps (3)

No acknowledgment of affective or intuitive dimensions of the technology; art is treated as an output modality, not a mode of attention.
No space for ambiguity or poetic uncertainty — copy is declarative and confident throughout.
No recognition of emotional labor or the human-feeling cost of generative media (deepfakes, likeness).

Justification

Generative-media products give a thin presence, but art appears only as a capability to be sold. There is no reflective register, no ambiguity, and no acknowledgment of the felt/affective stakes of synthetic image and voice.

Lens 06

Future Modelling

Where is this heading, and for whom?

2/10

Findings (2)

Forward-looking mission stated: 'Accelerate human scientific discovery' / 'understand the universe'.
Scale of agentic capability acknowledged (autonomous coding agent demo: 'Migrate auth from sessions to JWT', parallel running tasks).

Gaps (3)

No engagement with labor-displacement risk despite shipping autonomous coding/agent products.
No environmental-cost disclosure despite a 200K-GPU supercluster — energy/water/emissions entirely absent.
No democratic governance of agentic systems, no inclusive deliberation; 'Move quickly and fix things' explicitly prioritizes speed over precaution.

Justification

Whose futures? The audited copy answers only 'developers and builders'. A lab shipping autonomous agents on a massive supercluster discloses no labor, environmental, or governance future-stakes, and adopts a 'move quickly and fix things' ethic that defers harm to after the fact.

Lens 07

Marginalised Voices

Who is not at the table?

1/10

Findings (2)

Audience is consistently 'developers' and 'your team' — paying API customers.
Enterprise tier offers 'data residency options' and 'audit logging' (governance for customers, not communities).

Gaps (3)

No participatory design with Global South developers; offices are US/UK only.
No disability/accessibility commitment; no labor-representative engagement; no compensated community feedback channel.
Data annotation and moderation labor — central to model training — is invisible.

Justification

The only 'voices' served are paying customers. No one outside the commercial transaction — affected communities, annotation labor, disabled users, Global South developers — appears anywhere in the audited material.

Lens 08

Trickster Knowledge

What truth appears when the story is inverted?

1/10

Findings (2)

Self-presentation is uniformly solemn and triumphant: 'No goal is too ambitious', 'pushing the limits of what's possible'.
Brand markets 'truth' and 'first principles' but applies no inversion or self-audit to its own claims.

Gaps (3)

No willingness to name the central contradiction: a 'truth-seeking / maximally truthful' lab whose flagship model has produced documented, high-profile false and hateful outputs (e.g. 'MechaHitler'/antisemitic generations, system-prompt manipulation episodes). The official copy smooths this over completely.
No irony, no paradox-as-instrument, no space where the narrative is tested by its opposite.
The lab's own seriousness is treated as exempt from audit — the exact posture the trickster lens exists to puncture.

Justification

Floor score. The branding ('truth-seeking', 'maximally truthful') is precisely the polished consensus the trickster lens probes, and the public copy permits no self-contradiction, no irony, and no acknowledgment of the documented gap between the truth claim and Grok's recorded failures. Solemnity is total and exempt.

Suffixscape

Linguistic diagnostics

Regex- and LLM-detected patterns of evasion in the lab's own prose: nominalised evasion, agency diffusion, epistemic inflation, temporal flatness. Distinct from the CognioNews -scape editorial format — see methodology.

Pattern	Quote	Effect	Preservative alternative
`epistemic inflation`	"Frontier AI models for everything you build."	'Frontier' is an unverified superlative that asserts category leadership without a referent; 'everything you build' inflates scope to limitlessness, foreclosing the question of what the models cannot or should not do.	Name the specific tasks and benchmarks the models lead on, and state where they underperform or should not be used.
`epistemic inflation`	"Trained on the world's largest supercluster."	Superlative scale is presented as a proxy for quality and legitimacy, substituting size for evidence of capability, safety, or data integrity.	Disclose what the scale buys (which evals improved) and its costs (energy, water, emissions, data sourcing).
`nominalised evasion`	"designed to extend what humanity can know and do"	The nominalised, agentless mission hides who designs, who decides what counts as knowing, and who benefits — 'humanity' stands in for a small set of actors and customers.	Name the actors and the governance: 'Our team designs these models; here is who we consult and who is accountable for their effects.'
`agency diffusion`	"Rapid development and iteration lets us innovate at breakneck speeds."	'Iteration' and 'development' become the acting subjects; the humans choosing to ship fast (and absorbing the downstream risk) are diffused out of the sentence.	'We choose to ship quickly and we own the harms that result; here is our remediation process when we get it wrong.'
`temporal flatness`	"Our path of progress — From founding to frontier models — every milestone on the way."	A clean linear arc erases contingency, controversy, and reversal (e.g. model-behaviour incidents), presenting history as inevitable forward motion.	Include the failures and course-corrections in the timeline, not only the launches.

Audit history

Prior audits

Latest audit: 2026-06-08 · sources: https://x.ai, https://x.ai/about

Transparency

Raw data

Every audit is published as machine-readable JSON. You can read this lab's latest report at /stancewatch/api/labs/xai.json — it carries the per-lens findings, evidence quotes, Suffixscape flags, PALS scores, the sources actually read, and a confidence note.

Found an error, or a stance page we missed? We audit public communications only — point us to the page and the next audit will read it. Write to hello@cognioengine.co.uk.

Audit date: 2026-06-08

Moderate confidence. Two xAI-owned pages scraped successfully (homepage and about/company); the requested grok-2 blog returned 403/404 and was not read, and no dedicated safety/governance page surfaced. Scores reflect what the public-facing marketing copy does and does not say, supplemented by publicly documented Grok controversies for the trickster/scientific/responsibility lenses. Qualitative judgment, not a validated metric.

Auditor: GoldBerry v1.3 / StanceWatch v1.0