Skip to content

Stability AI

open generative modelsimage/videomultimodalresearch

Stable Diffusion series; strong open-source ethos despite business challenges. [openness: open-leaning, demoted to "open" for v1 schema].

PALS scores

Preservative dimensions

PALS composite
3.0
Mean of three dimensions, 1–10.
Completeness
4.0
Sources, limits, transparency.
Multiplicity
2.0
Epistemologies, languages, voices.
Responsibility
3.0
Accountability, refusal, governance.
Eight lenses

What's missing, by lens

Each lens carries a canonical question and corrects a specific epistemic failure. Score, findings, and gaps land once the audit runs.

Lens 01
Indigenous Knowledge
Whose knowledge is missing?
1/10
Findings (2)
  • No reference anywhere on the homepage or safety page to Indigenous data sovereignty, the CARE Principles, or consultation with Indigenous communities.
  • Data governance is framed entirely as 'screening to exclude illegal content', a legal-compliance frame rather than a relational or sovereignty frame.
Gaps (4)
  • No acknowledgement that open image/video/audio models trained on web-scraped corpora ingest Indigenous visual languages, sacred imagery, and cultural motifs without consent.
  • No CARE (Collective benefit, Authority to control, Responsibility, Ethics) commitments to balance the FAIR/open-data posture the company champions.
  • No mechanism for communities to withdraw or govern their visual heritage from training sets.
  • No recognition of non-textual, oral, or embodied knowledge that generative image models cannot represent and routinely flatten.
Justification

Total silence on Indigenous sovereignty combined with an open-by-default, web-scraped generative pipeline is the highest-risk profile for this lens. Open weights amplify rather than mitigate extraction of cultural imagery. Score floored at 1.

Lens 02
Deep History
What historical process produced this?
2/10
Findings (2)
  • The 'democratising' mission gestures at a power-redistribution narrative ('unlocking the power of open-source generative AI to expand human creativity').
  • Safety principles acknowledge that openness creates risk that must be assessed before distribution, implying some historical learning.
Gaps (4)
  • No acknowledgement of the colonial/extractive lineage of mass web-scraping, nor of the uncompensated labour (artists, photographers, annotators) the training pipeline depends on.
  • No mention of the GPU/compute geopolitics or capital structure that shaped the company (well-documented funding turbulence and leadership change are entirely absent).
  • The material ongoing litigation (Getty Images, and US artist class actions) over training data is nowhere acknowledged, despite being central to the company's history.
  • 'Democratisation' is asserted ahistorically, as if open release neutralises rather than redistributes existing power.
Justification

A self-congratulatory democratisation narrative with zero acknowledgement of its own contested origins (Getty/artist litigation, scraping legacy, funding history). Slightly above floor only because the safety page concedes openness carries risk.

Lens 03
Cross-Cultural Wisdom
Which perspectives have been flattened?
2/10
Findings (2)
  • Technology breadth (image, video, audio, 3D) implies broad applicability across cultural contexts.
  • Partnerships with global music/entertainment majors (UMG, Warner, EA) signal some cross-sector reach.
Gaps (4)
  • No mention of multilingual capability, non-English prompting, or language preservation; the audited surfaces are English-only and culturally unmarked.
  • Image/video models default to a Western, internet-aesthetic visual prior; no acknowledgement of this flattening or of culturally specific representational norms.
  • No consultation with cultural scholars or non-Western artists referenced.
  • C2PA/watermarking provenance is framed technically, not as a tool that could serve communities differently across contexts.
Justification

Cross-cultural concern is absent. Partnerships are corporate-Western. Generative image models silently universalise a particular visual culture and the company does not name this. Score 2.

Lens 04
Scientific Evidence
What does the evidence show, and what are its limits?
5/10
Findings (4)
  • Open weights are a genuine, material evidence-and-verification commitment: 'We release models openly to allow scrutiny, to identify risks.'
  • Stated continuous vulnerability/adversarial testing and expert collaboration for harm prevention.
  • Concrete provenance technique disclosed (C2PA standard adoption, watermarking).
  • Dataset filtering described (removal of unsafe/illegal images during training).
Gaps (4)
  • No published independent third-party audit of training data, bias, or safety filters is cited; claims are self-reported.
  • No quantified limitation disclosures (e.g., model cards with measured failure rates, demographic bias benchmarks) on the audited surfaces.
  • No replication protocol or evaluation methodology shared; 'careful screening' is unquantified.
  • The substantive scientific question raised by the Getty/artist litigation — what is actually in the training data and whether filtering works — is not evidenced.
Justification

Open weights genuinely enable external verification and lift this lens above mid. But filtering and safety claims are self-asserted, unquantified, and lack independent audit — especially material given live litigation about training-data contents. A defensible mid score of 5.

Lens 05
Artistic Perception
What does this feel like, not just mean?
3/10
Findings (2)
  • The mission centres creativity explicitly: 'expand human creativity' and 'empower creators'.
  • Multimodal (image/video/audio/3D) output is the core product, so the affective/aesthetic register is the company's domain.
Gaps (4)
  • Creativity is framed instrumentally — as capability to be 'unlocked' and 'empowered' — with no space for ambiguity, the felt texture of art-making, or the emotional labour of artists.
  • No acknowledgement of the artists whose uncompensated work trains the models, the very community most affected and currently litigating; their perception is the missing 'what this feels like'.
  • No recognition that automating image generation reshapes the affective and economic conditions of creative practice.
  • Tone is corporate/efficiency-led; ambiguity and poetic uncertainty are absent.
Justification

Creativity is invoked as a value proposition but treated as a feature to be delivered, not a felt human practice. The displaced/uncompensated artists — the actual locus of artistic perception — are erased. Score 3, slightly above floor for naming creativity at all.

Lens 06
Future Modelling
Where is this heading, and for whom?
3/10
Findings (3)
  • Pre-distribution safety impact assessment is described ('assessing safety impacts before wider distribution').
  • Child-safety future-harm engagement is concrete: partnership with Thorn.
  • Misuse prohibitions and continuous vulnerability testing show some forward risk posture.
Gaps (4)
  • No engagement whatsoever with labour displacement of creative workers — the most salient future-shaping consequence of generative image/video.
  • No environmental or compute/energy cost disclosure for training or inference.
  • No democratic or participatory governance of model release decisions; release is a company decision framed as a safety judgement.
  • Whose futures are shaped is never asked — futures are implicitly those of 'creators' and enterprise customers.
Justification

Concrete on child-safety and misuse, but silent on labour displacement and environmental cost — the two defining future externalities of this product class. Governance of release stays internal. Score 3.

Lens 07
Marginalised Voices
Who is not at the table?
2/10
Findings (2)
  • A misuse-reporting channel exists (safety@stability.ai) and an appeals process for policy decisions is mentioned.
  • Expert collaboration on harm prevention is referenced.
Gaps (4)
  • Artists — the marginalised group most materially affected and currently in litigation (Getty, US class actions) — have no participatory channel, no compensation mechanism, and no opt-out referenced; a reporting inbox is not representation.
  • No participatory design with Global South developers; no disability/accessibility commitments on audited surfaces.
  • No labour-representative engagement or compensated feedback channel.
  • 'Expert collaboration' centres credentialed specialists, not affected communities.
Justification

A misuse inbox is the thinnest possible form of voice. The communities whose work and likeness power the models — artists, annotators, Global South contributors — have no seat at the table and are instead positioned as litigants outside it. Score 2.

Lens 08
Trickster Knowledge
What truth appears when the story is inverted?
2/10
Findings (2)
  • The safety page concedes the core tension that openness enables both scrutiny and risk — a faint self-inversion ('We release models openly to allow scrutiny, to identify risks').
  • Some willingness to name that open release requires guardrails rather than being unambiguously good.
Gaps (4)
  • No naming of the company's central contradiction: a 'safety-first', 'data integrity' narrative running concurrently with active litigation alleging mass unconsented use of copyrighted and personal images — the official story is never tested against its loudest opposite.
  • No irony, no self-mockery, no acknowledgement that 'democratising creativity' may also mean automating the people it claims to empower.
  • The lab treats its own seriousness as exempt from audit; 'careful screening' and 'ethical standards' are asserted, not interrogated.
  • 'Banning AI Misuse' sits unironically beside a product whose primary contested misuse claim concerns its own training data.
Justification

The single most glaring contradiction — virtue language about data integrity alongside Getty/artist litigation over precisely that data — is smoothed over entirely. The official narrative is never allowed to meet its opposite. Faint structural concession on openness keeps it off the floor. Score 2.

Suffixscape

Linguistic diagnostics

Regex- and LLM-detected patterns of evasion in the lab's own prose: nominalised evasion, agency diffusion, epistemic inflation, temporal flatness. Distinct from the CognioNews -scape editorial format — see methodology.

Pattern Quote Effect Preservative alternative
nominalised evasion "Training data undergoes careful screening to exclude illegal content and maintain ethical standards." 'Careful screening' and 'ethical standards' are nominalised processes with no named actor, no method, and no measure. They hide who screens, against what criteria, and with what error rate — converting a contested empirical claim (currently in litigation) into a settled-sounding noun phrase. Name it: 'Our data team applies [specific filters] to remove [defined categories]; the filters were last audited by [whom] on [date] and miss approximately [X]% of [category]. Known gaps: [list].'
agency diffusion "Unsafe images are removed during training to prevent harmful generation." Passive construction ('are removed') erases the agent and the decision criteria — who defines 'unsafe', who removes, and whether removal is verified. Responsibility is diffused into a process that appears to run itself. 'We remove images we classify as unsafe using [classifier/criteria]; this decision is made by [team] and reviewed by [process]. We publish the removal criteria so others can contest them.'
epistemic inflation "world-class models that are accessible, adaptable, and designed to empower creators" 'World-class' is an unverified superlative; 'empower creators' is asserted while the most affected creators are litigating over uncompensated use of their work. The inflation papers over the gap between claim and contested reality. Replace with verifiable, scoped claims: 'Our models rank [position] on [named benchmark] as of [date]. They are used by [N] creators; we are still resolving how originating creators are credited and compensated.'
temporal flatness "unlocking the power of open-source generative AI to expand human creativity" Presents a frictionless forward arc — power 'unlocked', creativity 'expanded' — that erases the contingent, contested history (scraping, litigation, displaced labour) that produced the models, and the open future questions about whose creativity is expanded versus automated. 'Generative AI redistributes creative capability, with real tradeoffs we are still working through: the corpora that train our models include work made by people who did not consent, and we are actively negotiating what fair attribution and compensation look like.'
Audit history

Prior audits

Latest audit: 2026-06-08 · sources: https://stability.ai, https://stability.ai/safety

Transparency

Raw data

Every audit is published as machine-readable JSON. You can read this lab's latest report at /stancewatch/api/labs/stability-ai.json — it carries the per-lens findings, evidence quotes, Suffixscape flags, PALS scores, the sources actually read, and a confidence note.

Found an error, or a stance page we missed? We audit public communications only — point us to the page and the next audit will read it. Write to hello@cognioengine.co.uk.

Audit date: 2026-06-08

Moderate confidence. Two live sources were read (homepage and /safety); the specified stance_url returned HTTP 404, as did one alternative principles URL, so the safety page was substituted as the primary responsibility surface. Findings draw on those two surfaces plus public knowledge of Stability AI's open-weight model line and the ongoing Getty Images and US artist training-data litigation. Qualitative judgment, not a validated metric.

Auditor: GoldBerry v1.3 / StanceWatch v1.0