Skip to content

Stability AI Research

UK · stability.ai/research · open
generative researchopen sciencemultimodalevaluation

Research arm of Stability AI; focuses on open generative models.

PALS scores

Preservative dimensions

PALS composite
3.7
Mean of three dimensions, 1–10.
Completeness
5.0
Sources, limits, transparency.
Multiplicity
2.0
Epistemologies, languages, voices.
Responsibility
4.0
Accountability, refusal, governance.
Eight lenses

What's missing, by lens

Each lens carries a canonical question and corrects a specific epistemic failure. Score, findings, and gaps land once the audit runs.

Lens 01
Indigenous Knowledge
Whose knowledge is missing?
2/10
Findings (2)
  • Data Integrity commitment screens and excludes illegal content from training data, implying some attention to provenance.
  • Open-weight release model in principle allows downstream communities to inspect and adapt models for their own contexts.
Gaps (4)
  • No mention of Indigenous data sovereignty or the CARE Principles (Collective benefit, Authority to control, Responsibility, Ethics).
  • No consultation with Indigenous communities referenced anywhere in research or safety posture.
  • No preservation of oral, non-textual, or relational knowledge forms; generative outputs are presented as universal creativity tools.
  • Web-scale training-data practices (the LAION-style provenance question central to Stability's history) are not addressed in terms of consent or extraction from culturally specific corpora.
Justification

Data integrity is framed solely as legality screening, not as relational consent or sovereignty. There is zero engagement with Indigenous epistemologies, CARE, or community authority over data. Score reflects near-total absence with only incidental provenance language.

Lens 02
Deep History
What historical process produced this?
3/10
Findings (2)
  • Open-source framing is implicitly positioned against 'proprietary gatekeeping', acknowledging a power dynamic in the AI field.
  • Safety posture references continuous vulnerability testing and expert collaboration, hinting at an evolving rather than static stance.
Gaps (4)
  • No acknowledgment of colonial or extractive data legacies, despite Stability's own well-documented history of contested web-scraped training corpora.
  • No transparency about geopolitical economy: GPU access, compute concentration, or the labour (including data-labelling labour) underpinning the models.
  • Regulatory constraints (e.g. ongoing copyright litigation, EU AI Act, UK posture as a UK-located lab) are not surfaced.
  • Linear progress narrative ('faster inference, higher quality') with no historical humility about prior controversies.
Justification

The page treats the lab's trajectory as a clean innovation story. Given Stability's actual contested history (litigation over training data, leadership turnover), the silence on those inheritances is a meaningful omission. Slightly above floor because anti-gatekeeping framing names one structural dynamic.

Lens 03
Cross-Cultural Wisdom
Which perspectives have been flattened?
2/10
Findings (2)
  • Multimodal breadth (Image, Video, Audio, 3D, Biology) gestures at diverse expressive registers.
  • Large community of 200,000+ members implies some geographic/cultural spread of contributors.
Gaps (4)
  • No multilingual support claims; generative models presented in an implicitly Anglophone, Western-creative frame.
  • No preservation of culturally specific reasoning patterns or aesthetic traditions.
  • No consultation with cultural scholars or non-Western knowledge holders referenced.
  • 'Human creativity' is treated as a universal monolith rather than plural traditions.
Justification

Modality breadth is conflated with epistemic breadth, but there is no language-level or culture-level pluralism. 'Human creativity' as a flat universal flattens the very differences this lens probes.

Lens 04
Scientific Evidence
What does the evidence show, and what are its limits?
6/10
Findings (4)
  • Open-weight release is an explicit verification mechanism: 'Models released openly for scrutiny and risk identification.'
  • Research blog publishes evaluation frameworks (e.g. cinematic evaluation, material decomposition), indicating methodological transparency.
  • Continuous vulnerability testing and dataset filtering are named as ongoing empirical practices.
  • Trust Center and public policy documentation provide a verifiable accountability surface.
Gaps (3)
  • No reference to independent third-party audits of training data or bias.
  • No formal replication protocols or model cards / known-limitation disclosures surfaced on these pages.
  • 'Safety-First Open Source' includes 'selective code restrictions', undercutting full reproducibility without explaining the trade-off criteria.
Justification

This is the lab's strongest lens. Open weights plus published evaluation work plus a Trust Center give genuine, checkable evidence. Held below 7 by the absence of independent audits, model cards, and the unexplained 'selective restrictions' that limit reproducibility.

Lens 05
Artistic Perception
What does this feel like, not just mean?
5/10
Findings (3)
  • The lab's entire raison d'être is creative generation, so affective/aesthetic dimensions are foregrounded by domain.
  • Multimodal output (audio, video, 3D) inherently engages felt, sensory registers beyond pure semantics.
  • C2PA watermarking acknowledges that AI-assisted creative work carries a distinct provenance that audiences may feel differently about.
Gaps (3)
  • Creativity is framed instrumentally ('empower creators... and enterprises'), privileging utility over poetic ambiguity.
  • No acknowledgment of the emotional labour of artists whose work trains the models.
  • No space for uncertainty or non-efficiency modes of attention; framing is capability-and-throughput oriented ('faster inference, higher quality').
Justification

Domain naturally scores mid-range because the product IS aesthetic. But the framing converts feeling into a feature and throughput metric; the emotional labour of source artists is unacknowledged. Mid score honestly placed.

Lens 06
Future Modelling
Where is this heading, and for whom?
3/10
Findings (3)
  • Child-safety partnerships (Thorn, All Tech Is Human) engage one concrete future-harm vector seriously.
  • C2PA / content-authenticity work anticipates a future information environment where AI/human distinction matters.
  • Banning AI misuse and appeals processes gesture at forward-looking harm governance.
Gaps (4)
  • No engagement with labour displacement risk to the creative workforce the lab serves.
  • No environmental / compute cost disclosures despite training large multimodal models.
  • No democratic or participatory governance of the models' downstream agentic uses; governance is board-and-policy, not deliberative.
  • Whose futures benefit (creators vs. 'enterprises') is asserted, not deliberated.
Justification

Real, specific future-harm work exists (CSAM, content authenticity), lifting this off the floor. But the two future risks most proximate to a generative-media lab — creative-labour displacement and environmental cost — are entirely absent, and governance is top-down rather than inclusive.

Lens 07
Marginalised Voices
Who is not at the table?
3/10
Findings (4)
  • Expert Collaboration commitment brings outside specialists into harm-prevention design.
  • Child-protection partnerships represent advocacy for a vulnerable group.
  • A 200,000+ member open community is in principle a low-barrier channel for global contributors.
  • Safety reporting and appeals processes offer a feedback surface (safety@stability.ai).
Gaps (4)
  • No participatory design with Global South developers named, despite open-source rhetoric.
  • No disability-community accessibility commitments.
  • No labour-representative engagement (e.g. artists' or data-workers' collectives) — notable given creative-sector tensions.
  • Feedback channels exist but there is no mention of compensation for community labour or feedback.
Justification

Expert and child-safety engagement is real but narrow and institutional. The constituencies this lens centres — Global South devs, disabled users, organised creative labour, compensated community contributors — are absent. Open community is a structural plus but unpriced and undifferentiated.

Lens 08
Trickster Knowledge
What truth appears when the story is inverted?
2/10
Findings (2)
  • The 'Safety-First Open Source' label embeds an unresolved tension (open yet 'selective code restrictions') that, if named, could be productive self-examination.
  • Publishing biology models alongside creative tools quietly invites questions the page never asks about dual-use.
Gaps (4)
  • No willingness to name the central contradiction of an 'open' lab that selectively restricts and litigates over training data.
  • No irony, paradox, or self-interrogation; the register is uniformly earnest and promotional.
  • The lab's own seriousness is treated as exempt from audit — no space where the official narrative is tested by its opposite.
  • 'Open' and 'safe' are presented as harmonised rather than in genuine, examined tension.
Justification

The official story is fully smoothed. The one live contradiction (openness vs. restriction/control vs. contested data) is asserted as a tidy 'balance' rather than surfaced as tension. No structural inversion or self-mockery. Near floor.

Suffixscape

Linguistic diagnostics

Regex- and LLM-detected patterns of evasion in the lab's own prose: nominalised evasion, agency diffusion, epistemic inflation, temporal flatness. Distinct from the CognioNews -scape editorial format — see methodology.

Pattern Quote Effect Preservative alternative
nominalised evasion "Removal of unsafe training data pre-deployment" The nominalisation 'Removal' deletes the actor and the method: who decides what is 'unsafe', by what criteria, and with what audit trail? The harm-screening becomes an accomplished fact rather than a contestable, ongoing human judgment. Name the agent and process: 'Our trust-and-safety team filters training data against published criteria X, Y, Z before release, and reports filtering coverage in each model card.'
agency diffusion "Models released openly for scrutiny and risk identification" Passive construction ('Models released') hides who releases, who scrutinises, and who bears the risk when identification fails — diffusing responsibility from the lab onto an unnamed collective. 'We release model weights so that external researchers, named partners, and affected communities can scrutinise them; we commit to acting on findings within a stated timeframe.'
epistemic inflation "unlocking the power of open-source generative AI to expand human creativity" 'Unlocking the power' and 'expand human creativity' are unverifiable superlatives that assert a universal benefit while skipping the question of whose creativity, whose data, and at whose cost. 'We build open generative models that some creators and developers use to extend specific workflows; we are studying where they help and where they displace or harm existing creative practice.'
temporal flatness "Recent work addresses both capability (faster inference, higher quality) and responsibility" A flat progress narrative that erases the lab's contested history (training-data litigation, governance turmoil) and presents responsibility as smoothly co-arriving with capability rather than hard-won and contingent. 'Following earlier disputes over training-data provenance, we changed X and Y; current work continues to wrestle with trade-offs between capability gains and unresolved consent questions.'
Audit history

Prior audits

Latest audit: 2026-06-08 · sources: https://stability.ai/research, https://stability.ai/safety

Transparency

Raw data

Every audit is published as machine-readable JSON. You can read this lab's latest report at /stancewatch/api/labs/stability-research.json — it carries the per-lens findings, evidence quotes, Suffixscape flags, PALS scores, the sources actually read, and a confidence note.

Found an error, or a stance page we missed? We audit public communications only — point us to the page and the next audit will read it. Write to hello@cognioengine.co.uk.

Audit date: 2026-06-08

Qualitative judgment; not a validated metric. Based on two successfully fetched pages (stability.ai/research and stability.ai/safety) plus public knowledge of Stability AI's research/open-science posture. The research arm shares infrastructure with the commercial entity, so some safety/governance language audited here is corporate-level; research-specific provenance, audit, and limitation disclosures were not found on the pages read and may exist elsewhere (e.g. individual paper/model-card pages) not captured in this audit. Moderate confidence.

Auditor: GoldBerry v1.3 / StanceWatch v1.0