Graph · Message

Have I Been Trained

01 · In focus

One message, in the field.

The structured facts the source records about Have I Been Trained, the count of declared adjacencies in the corpus, and the federation map zoomed on this node and its neighbours.

message

↑1 declared connection

Kind: Message
Status: active
Confidence: high
Entity ID: msg-have-i-been-trained
Network: View in network

Tags us-based, chapel-hill, north-carolina, berlin, creative-industry, visual-artist, illustrator, professional-community, sector-response, generative-ai, training-data, dataset, laion-5b, opt-out, opt-in, consent, do-not-train, registry, data-dignity, consensual-ai, framing, tool, slogan, artist-organizing, stability-ai, spawning-ai, source-plus, public-diffusion, pd12m, gdpr, eu-ai-act, andersen-v-stability-ai, holly-herndon, mat-dryhurst, jordan-meyer

Have I Been Trained · 1 direct neighbour visible

02 · Connections

1 adjacency, by relation.

Split by direction. Direct links are the ones Have I Been Trained’s source record names; inferred backlinks are records elsewhere in the corpus that point at this entity.

Direct from this record

1 link

Links named in this entity's structured fields.

1 link

Consent, Credit, Compensation (and Transparency) Message

03 · Background

From the source record.

Body prose as it appears in movement-graph’s published markdown for this entity. Links to other corpus entities resolve to their graph page; links to deeper repo paths are kept as text so the page does not invent a route.

Have I Been Trained is the question-as-framing of the visual-artist consent-and-data-dignity register inside the 2022–2024 U.S. and European pushback on generative-AI training-data ingestion. The phrase, fixed by the tool of the same name launched in mid-September 2022 by the Spawning AI team of Jordan Meyer, Mat Dryhurst, and Holly Herndon, names the harm of unconsented training-data ingestion at the level of the individual artist's dignity rather than at the level of the bargaining-table demand: the question itself makes the harm legible — am I in there? — and the Do Not Train registry the tool surfaces instantiates the default the question presupposes, no, until I say so. The framing runs in parallel with, and is the visual-artist data-dignity sister of, the legal-pleading Consent, Credit, Compensation register that the Andersen v. Stability AI class-action complaint and Karla Ortiz's Senate Judiciary testimony anchor.

The tool and the registry

The Have I Been Trained search interface lets any user type a query, paste a URL, or upload an image, and search the LAION-5B image-text dataset — the open ~5.85-billion-image scrape that anchors Stable Diffusion and most other widely deployed open generative-image models — for matches against their own work or likeness. Matched images can then be added to a Do Not Train registry, which Spawning maintains as a portable opt-out list any downstream model-trainer can ingest before its next training run. The tool's first headline find, a patient discovering private medical photographs of herself inside LAION-5B within days of launch, fixed the dignity-harm argument behind the framing: that the question "have I been trained?" is not abstract, because the answer for many people is yes-without-knowledge-or-consent on artefacts ranging from copyrighted artwork to private medical records.

The Stability AI opt-out proof point — December 2022

The framing's working credibility as a campaign — and the registry's existence as a mechanism rather than a posture — was fixed by the December 2022 agreement under which Stability AI committed to honour the Do Not Train registry ahead of training the next iteration of Stable Diffusion. The VentureBeat write-up carried Stability AI's commitment alongside the working two-week opt-out window the partnership opened. By 7 March 2023 Spawning reported that the registry had collected opt-outs on roughly 78 million artworks ahead of the Stable Diffusion 3 training run — a figure independently carried by The Decoder and broken down as approximately 40,000 individual artworks opted out via Have I Been Trained itself plus tens of millions more via partner platforms (the ArtStation NoAI tag, Shutterstock's default opt-out for non-contributing content).

The artistic-practice register and the data-dignity argument

Spawning's working framing of the registry as "a creator consent layer for AI" anchors on an artistic-practice register that distinguishes the Have I Been Trained framing from the parallel U.S. legal-pleading register of Consent, Credit, Compensation. Holly Herndon and Mat Dryhurst — Berlin-based musicians whose prior practice included the Holly+ vocal-deepfake consent project — have framed the company's working posture as pro-AI-and-pro-consent: the position that the right response to the unconsented-ingestion question is not to halt model development but to insist that training is a relationship a creator can decline, accept, or shape. That stance puts the Have I Been Trained framing at a distance from both the SAG-AFTRA existential-threat register (which anchors on displacement of creative work) and the Concept Art Association–led Consent-Credit-Compensation legal register (which anchors on a copyright cause of action) — same wave, distinct argumentative shape.

The contested edge — opt-out as default

The framing's strongest published critique is structural rather than rhetorical. The 3 May 2023 TechCrunch seed-round write-up carried the working European-creator position that opt-out — even backed by a registry as well-built as Spawning's — fails the GDPR actively-given-consent standard and so cannot be the right legal baseline for an EU-applicable training-data regime. The European Guild for Artificial Intelligence Regulation (EGAIR), the organised European creators' coalition, has carried the structurally adjacent position that the underlying harm is the inability to deny consent at all and that opt-in by default — not opt-out by default — is the correct regulatory and ethical baseline. The two positions are working sisters of each other inside the same training-data-consent debate, with Have I Been Trained operating as the working artist-side instantiation of the opt-out posture and the EGAIR / EU AI Act push operating as the opt-in counterweight.

Carrying the framing into consensual training — Source.Plus and Public Diffusion

The framing's working continuation since 2024 has run through Spawning's own move from opt-out (defensive) to opt-in / consensual curation (constructive). The 11 June 2024 Source.Plus launch put roughly 40 million public-domain and Creative-Commons-zero images behind a curation tool that pays creators a tenth-of-a-penny per training download, and the parallel Public Diffusion / PD12M project trained an image generator only on public-domain and CC0 content as proof-of-concept that the data-dignity framing is buildable rather than only refusable. The argumentative arc those two projects close is the through-line of the Have I Been Trained framing as a whole: the question that opened the campaign in September 2022 — have I been trained? — is followed by the registry that lets a creator answer no, and then by the curation tool and the model that show what training looks like when the answer is yes-with-consent.

04 · Sources

Where this came from.

15 sources listed from the pinned corpus. Links are shown only when the source URL is a valid HTTP(S) address.

15 sources linked out
5 body mentions linked into the corpus
0 references kept as text

petapixel.com
Checked 2026-05-19

URL

https://petapixel.com/2022/09/19/you-can-now-check-if-your-photos-were-used-to-train-ai-image-generators/

PetaPixel, 19 September 2022 — earliest dated mainstream piece on the tool's launch, fixing mid-September 2022 as the public-record launch date and naming Spawning AI as the tool's maker
techcrunch.com
Checked 2026-05-19

URL

https://techcrunch.com/2022/09/21/who-fed-the-ai/

TechCrunch, 21 September 2022 — early mainstream-tech-press launch coverage; describes the 5.85-billion-image LAION-5B searchable index and names co-founders Jordan Meyer and Mat Dryhurst
petapixel.com
Checked 2026-05-19

URL

https://petapixel.com/2022/09/26/shocked-artist-finds-private-medical-photos-in-ai-training-data-set/

PetaPixel, 26 September 2022 — primary-source record of the tool's first headline-grabbing find (private medical photographs surfaced inside LAION-5B by a patient using the search interface), the case that fixed the dignity-harm argument behind the framing
technologyreview.com
Checked 2026-05-19

URL

https://www.technologyreview.com/2022/12/16/1065247/artists-can-now-opt-out-of-the-next-version-of-stable-diffusion/

MIT Technology Review, 16 December 2022 — primary mainstream-press source for the Stability AI / Spawning agreement under which the Do Not Train registry would be honoured ahead of Stable Diffusion 3 training, and for the artist-side opt-out window opened ahead of that training run
venturebeat.com
Checked 2026-05-19

URL

https://venturebeat.com/ai/stability-ai-to-honor-artist-opt-out-requests-for-stable-diffusion-3

VentureBeat, 16 December 2022 — parallel mainstream-tech-press coverage of the Stability AI opt-out commitment, with Stability AI public statement on the partnership
spawning.substack.com
Checked 2026-05-19

URL

https://spawning.substack.com/p/spawning-opts-out-78-million-artworks

Spawning blog, 7 March 2023 — primary source for the headline 78-million-artworks opt-out figure announced for the Stable Diffusion 3 training window, breaking down the figure as roughly 40,000 individual artworks via Have I Been Trained plus tens of millions more via partner platforms (ArtStation NoAI tag, Shutterstock default opt-out)
the-decoder.com
Checked 2026-05-19

URL

https://the-decoder.com/artists-remove-80-million-images-from-stable-diffusion-3-training-data/

The Decoder coverage of the opt-out roll-up — independent secondary confirmation of the ~80-million-images figure, useful for the cross-check and for the 3-percent-of-LAION-5B scale caveat that contextualises the headline number
techcrunch.com
Checked 2026-05-19

URL

https://techcrunch.com/2023/05/03/spawning-lays-out-its-plans-for-letting-creators-opt-out-of-generative-ai-training/

TechCrunch, 3 May 2023 — primary source on Spawning AI's 3-million-USD seed round (True Ventures lead), on the company's working framing of itself as the consent layer for AI, and on the strongest published critique of the opt-out model (that opt-out fails the GDPR actively-given consent standard); the seed-round piece anchors the legal-founder framing as Meyer + Dryhurst
spawning.substack.com
Checked 2026-05-19

URL

https://spawning.substack.com/p/our-seed-round-of-funding-to-build

Spawning blog seed-round announcement — primary source for the company's self-description as a creator consent layer for AI and for the trio framing (Jordan Meyer, Mat Dryhurst, Holly Herndon) Spawning uses for itself in art-press and primary materials, alongside the legal-cap-table Meyer-and-Dryhurst framing TechCrunch uses
haveibeentrained.com
Checked 2026-05-19

URL

https://haveibeentrained.com/about

Have I Been Trained own About page — primary-source description of the tool's scope, dataset coverage, and opt-out mechanism, and primary anchor for the registry's working public framing
techcrunch.com
Checked 2026-05-19

URL

https://techcrunch.com/2024/06/11/spawning-wants-to-build-more-ethical-ai-training-data-sets/

TechCrunch, 11 June 2024 — primary mainstream-tech-press source for Source.Plus, Spawning's ~40-million-image public-domain and CC0 curation tool, and for Jordan Meyer's working framing of the move from opt-out (defensive) to opt-in / consensual curation (constructive) as the same data-dignity argument carried into building
spawning.substack.com
Checked 2026-05-19

URL

https://spawning.substack.com/p/announcing-sourceplus-a-curation

Spawning blog Source.Plus launch — primary source for the working framing of the move from opt-out to opt-in curation as the constructive continuation of the consent layer, and for the tenth-of-a-penny per-download fee that operationalises compensation inside the registry's working model
spawning.substack.com
Checked 2026-05-19

URL

https://spawning.substack.com/p/a-fireside-chat-with-the-creators

Spawning blog interview with the Public Diffusion team — primary source for the PD12M dataset and the Public Diffusion image model trained only on public-domain and CC0 content, the working endpoint of the Have-I-Been-Trained framing carried into model-building (consensually trained generative AI as proof-of-concept)
artbasel.com
Checked 2026-05-19

URL

https://www.artbasel.com/stories/ai-holly-herndon-mat-dryhurst-data-training-art-making

Art Basel long-form interview with Holly Herndon and Mat Dryhurst — primary source for the artistic-practice register the framing carries (pro-AI, pro-consent, data-dignity-as-aesthetic-stance) that distinguishes it from the legal-cause-of-action register of the parallel CAA / Andersen v. Stability AI consent-credit-compensation framing
egair.eu
Checked 2026-05-19

URL

https://www.egair.eu/

European Guild for Artificial Intelligence Regulation (EGAIR) — institutional counterweight position, organised European creators arguing that the underlying problem is the structural inability to deny consent at all and that opt-in by default is the correct baseline; useful for naming the framing's contested edge without rebutting it

Source: entities/messages/msg-have-i-been-trained.md — movement-graph pin 5d136ad.