Skip to content

Together AI

USA · together.ai · open
open model inferencefine-tuningdeveloper tools

Platform for running open models; also trains some custom variants. [openness: open-leaning, demoted to "open" for v1 schema].

PALS scores

Preservative dimensions

PALS composite
2.7
Mean of three dimensions, 1–10.
Completeness
4.0
Sources, limits, transparency.
Multiplicity
2.0
Epistemologies, languages, voices.
Responsibility
2.0
Accountability, refusal, governance.
Eight lenses

What's missing, by lens

Each lens carries a canonical question and corrects a specific epistemic failure. Score, findings, and gaps land once the audit runs.

Lens 01
Indigenous Knowledge
Whose knowledge is missing?
1/10
Findings (2)
  • No reference anywhere to Indigenous knowledge, data sovereignty, or the CARE Principles across homepage or About page.
  • The platform's framing of data is purely operational ('managed storage with zero egress fees'), treating data as a fungible logistics asset rather than as relationship or stewardship.
Gaps (4)
  • No acknowledgment of Indigenous data sovereignty or consent frameworks for training corpora served through inference.
  • No consultation with Indigenous communities mentioned for any of the open-weight models hosted.
  • No preservation of oral, ceremonial, or non-textual knowledge; the open-model marketplace inherits whatever extractive corpora upstream model-builders used, with Together adding no provenance layer.
  • Extractive data practice is structurally invisible: Together is an inference and fine-tuning intermediary, so it disclaims the data-sovereignty question by treating models as already-made artifacts.
Justification

Total absence. As an inference/hosting layer, Together AI never engages the question of whose embodied or relational knowledge is encoded in the weights it serves. The intermediary position actively launders the data-sovereignty question rather than addressing it. Floor score.

Lens 02
Deep History
What historical process produced this?
2/10
Findings (2)
  • Founding date (2022) and named researcher lineage (Tri Dao, Chris Re, Percy Liang, Ce Zhang) give some institutional history.
  • Reference to GPU cluster access ('from self-serve instant clusters to thousands of GPUs') gestures at compute political economy without naming it.
Gaps (4)
  • No acknowledgment of colonial data-extraction legacies or the labor history behind training corpora and RLHF annotation.
  • No transparency on the geopolitics of GPU access (export controls, NVIDIA dependency) despite NVIDIA being named as a partner.
  • No historical humility about AI's inheritances; the narrative is presentist and forward-leaning ('ship faster, scale reliably').
  • Regulatory constraints are unmentioned.
Justification

A thin institutional self-history exists, and the compute-supply framing brushes against political economy, but there is zero reckoning with the historical processes (extraction, annotation labor, hardware geopolitics) that shaped the stack. Slightly above floor only because dates, lineage and the GPU-supply mention provide minimal anchoring.

Lens 03
Cross-Cultural Wisdom
Which perspectives have been flattened?
2/10
Findings (2)
  • Hosting open-weight models from a global supplier set (MiniMax, Black Forest Labs, Cartesia) implies de facto multilingual and multi-origin model availability.
  • Open-source ecosystem framing in principle admits non-Western model contributors to the catalogue.
Gaps (4)
  • No explicit multilingual commitment, language coverage claims, or low-resource-language support beyond incidental catalogue presence.
  • No preservation of culturally specific reasoning patterns; models are presented as interchangeable performance commodities ('2x faster inference').
  • No consultation with cultural or linguistic scholars.
  • Western efficiency logic ('do more with less', 'superior unit economics') is presented as the universal frame of value.
Justification

Cross-cultural plurality, where it exists, is an accidental byproduct of being a multi-vendor marketplace, not a stated value. The dominant register is a culturally specific (Silicon Valley) efficiency-and-throughput logic presented as neutral. Low, with a small credit for catalogue diversity.

Lens 04
Scientific Evidence
What does the evidence show, and what are its limits?
6/10
Findings (4)
  • Strong, specific, verifiable research output named: FlashAttention series, Mamba-3, BASED, Hyena Hierarchy, speculative decoding.
  • Open-weight hosting and open-source contributions enable third-party verification of served models.
  • Named academic partnerships (Princeton, CMU, UC Berkeley) provide an external accountability surface.
  • Founders are publishing systems researchers, lending the technical claims real provenance.
Gaps (4)
  • Performance superlatives ('2x faster', '60% lower cost', '90% faster pre-training') are stated without linked benchmarks, baselines, or methodology on the page.
  • No mention of independent audits of training data or bias for hosted models.
  • No third-party replication protocols or known-limitation disclosures for the inference stack.
  • Open weights are emphasized for capability, not framed as a verification or accountability instrument.
Justification

Genuinely the strongest lens. The systems-research lineage is real, public, and citable, and open weights do permit verification. Capped at 6 because the headline numbers are unsourced marketing superlatives and there is no bias/limitation disclosure or independent-audit posture.

Lens 05
Artistic Perception
What does this feel like, not just mean?
3/10
Findings (2)
  • One affective register exists: 'discover the magic' and 'curiosity-driven discovery' acknowledge wonder and intuition as motivators.
  • Hosting generative-media labs (Black Forest Labs image models, Cartesia audio) places creative practice within the platform's ambit.
Gaps (3)
  • No space for ambiguity or poetic uncertainty; the dominant mode is throughput and optimization.
  • No recognition of the emotional labor of users, annotators, or creative communities.
  • Modes of attention beyond efficiency are absent; 'magic' is deployed instrumentally as developer delight, not as genuine affective inquiry.
Justification

A faint affective vocabulary ('magic', 'curiosity') and a generative-media catalogue earn it above the floor, but these are conscripted into a performance narrative rather than opening real space for feeling, ambiguity, or the felt texture of the work.

Lens 06
Future Modelling
Where is this heading, and for whom?
2/10
Findings (2)
  • 'Model Stewardship' and 'building everything with the purpose of benefiting society' assert a future-facing governance intent.
  • Efficiency framing ('do more with less') implies, but never states, an environmental-cost awareness.
Gaps (4)
  • No engagement with labor-displacement risk from the agentic and inference systems being scaled.
  • No environmental or energy-cost disclosure despite operating GPU fleets 'to thousands of GPUs'.
  • No democratic or participatory governance of the agentic systems Together helps deploy; governance is asserted as a corporate value, not a shared process.
  • 'Benefiting society' is unoperationalized, with no mechanism, metric, or accountability attached.
Justification

The stewardship and 'benefit society' language is the lab's clearest responsibility gesture, but it is entirely unoperationalized, and the two material future-shaping questions for an inference provider, energy footprint and labor displacement, are simply absent. Low.

Lens 07
Marginalised Voices
Who is not at the table?
2/10
Findings (2)
  • 'Democratized AI access' and 'self-serve instant clusters' lower the cost-of-entry barrier, which can incidentally widen access for smaller and Global-South developers.
  • Open-source community framing nominally invites broad contribution.
Gaps (4)
  • No participatory design with Global South developers; access is transactional (pay-per-token), not participatory.
  • No disability-community accessibility commitment.
  • No labor-representative engagement (data annotators, content moderators in the upstream supply chain).
  • No compensated feedback channels; 'community' means open-source users, not represented stakeholders.
Justification

Cost-democratization is real and not nothing, but it conflates cheaper market access with inclusion. None of the named marginalised constituencies, disabled users, annotation labor, Global-South co-designers, has a seat or a compensated channel. Low.

Lens 08
Trickster Knowledge
What truth appears when the story is inverted?
2/10
Findings (2)
  • The catalogue model creates one self-aware-adjacent tension Together could name but does not: it brands itself on 'openness' and 'responsibility' while functioning as a neutral hosting rail that disclaims responsibility for what the weights it serves actually contain.
  • 'Together' as a name against a product whose core promise is 'eliminating infrastructure management', i.e. removing humans from the loop, is an unremarked irony.
Gaps (4)
  • No willingness to name the central contradiction of an 'open and responsible' intermediary that adds no provenance, safety, or data-rights layer to the models it monetizes.
  • No irony, satire, or paradox deployed as instrument; the register is uniformly earnest and promotional.
  • The lab's own seriousness ('discover the magic', 'benefiting society') is treated as exempt from audit.
  • No space where the official narrative is tested by its opposite.
Justification

Polished, frictionless, and entirely un-self-interrogating. The richest available inversion, an 'open/responsible' brand that is structurally a responsibility-disclaiming pass-through, is left untouched. Near floor; a point above only because the contradiction is so legible it half-surfaces itself.

Suffixscape

Linguistic diagnostics

Regex- and LLM-detected patterns of evasion in the lab's own prose: nominalised evasion, agency diffusion, epistemic inflation, temporal flatness. Distinct from the CognioNews -scape editorial format — see methodology.

Pattern Quote Effect Preservative alternative
epistemic inflation "2x faster inference powered by cutting-edge research" Unsourced comparative superlatives ('2x faster', 'cutting-edge') assert quantified superiority with no baseline, benchmark, or methodology, converting a contestable empirical claim into settled fact. State the baseline, workload, and measurement: 'On [model/workload], median latency was X ms vs Y ms for [named baseline]; methodology and benchmark at [link].'
epistemic inflation "60% lower cost with workload-specific optimization" A precise-sounding figure ('60%') floats free of its comparison class, lending false rigor to a marketing claim. 'Compared with [specific alternative configuration], customers in [segment] saw up to 60% lower cost; results vary by workload, see assumptions at [link].'
nominalised evasion "building everything with the purpose of benefiting society" 'Benefiting society' nominalises an actor-and-mechanism question (who decides what benefit means, by what process, accountable to whom) into a vague abstract good, hiding that no mechanism exists. Name the actor and process: 'We commit to [specific governance body / published policy] to assess societal impact, reviewed by [whom] on [cadence].'
agency diffusion "serverless inference eliminating infrastructure management" An inanimate process ('inference') is the grammatical agent that 'eliminates' work, erasing the human labor relocated or displaced rather than abolished. 'Our managed service handles infrastructure operations so your team does not, the operational work is performed by Together engineers and automated systems.'
temporal flatness "ship faster, scale reliably and achieve superior unit economics" A frictionless forward arc that erases contingency, the regulatory, environmental, hardware-supply, and labor constraints that actually shape whether and how one can scale. Acknowledge the contingencies: 'Scaling depends on GPU supply, energy availability, and regulatory context; here is how we navigate each.'
Audit history

Prior audits

Latest audit: 2026-06-08 · sources: https://together.ai, https://together.ai/about

Transparency

Raw data

Every audit is published as machine-readable JSON. You can read this lab's latest report at /stancewatch/api/labs/together-ai.json — it carries the per-lens findings, evidence quotes, Suffixscape flags, PALS scores, the sources actually read, and a confidence note.

Found an error, or a stance page we missed? We audit public communications only — point us to the page and the next audit will read it. Write to hello@cognioengine.co.uk.

Audit date: 2026-06-08

Moderate confidence. Two real pages were fetched (homepage and About), giving genuine quotes for the strongest and weakest lenses; however the marketing surface is thin on responsibility content, and dedicated trust/safety, data, or governance pages were not located, so absence-of-evidence on several lenses may partly reflect un-audited subpages rather than true silence. Qualitative judgment; not a validated metric.

Auditor: GoldBerry v1.3 / StanceWatch v1.0