Skip to content
Make AI Good

Graph · Message

Indigenous data sovereignty

01 · In focus

One message, in the field.

The structured facts the source records about Indigenous data sovereignty, the count of declared adjacencies in the corpus, and the federation map zoomed on this node and its neighbours.

message

4 declared connections

Kind
Message
Status
active
Confidence
high
Entity ID
msg-indigenous-data-sovereignty
Network
View in network

Tags oceania, pacific, aotearoa, new-zealand, canada, australia, united-states, global, indigenous, maori, first-nations, aboriginal, care-principles, gida, ocap, framing, data-governance, self-determination, community-governance, ai-governance, data-rights, language-revitalisation, kaitiakitanga, te-mana-raraunga, fnigc, decolonial, be-fair-and-care, undrip

Indigenous data sovereignty · 4 direct neighbours visible

02 · Connections

4 adjacencies, by relation.

Split by direction. Direct links are the ones Indigenous data sovereignty’s source record names; inferred backlinks are records elsewhere in the corpus that point at this entity.

Direct from this record

4 links

Links named in this entity's structured fields.

03 · Background

From the source record.

Body prose as it appears in movement-graph’s published markdown for this entity. Links to other corpus entities resolve to their graph page; links to deeper repo paths are kept as text so the page does not invent a route.

Indigenous data sovereignty (IDS) is the movement framing through which indigenous peoples globally assert their inherent right to govern data generated from and about their communities, territories, languages, and knowledge systems — and, increasingly, AI systems trained on that data. The framing's working premise is that data sovereignty flows from political sovereignty: nations and peoples who hold inherent rights to self-determination also hold rights over information about themselves, and no data-collection, analysis, or training process performed on indigenous peoples' data is legitimate without their ongoing governance of that process. Distinct from the adjacent data colonialism framing that Latin American civil society carries — which diagnoses AI extraction as a structural harm — and from the algorithmic colonialism framing through which African civil society names the imposition of alien AI systems on African populations — IDS is as much a governance-design demand as a critique. It does not primarily ask that data systems harm indigenous peoples less; it asks who holds authority over those systems in the first place, and answers that indigenous peoples themselves must own, control, access, and possess the data that flows from within their communities.

Origins: from OCAP® to an international agenda

The framing's institutional precursor is OCAP® — Ownership, Control, Access, Possession — the governance framework developed in 1998 by the National Steering Committee of the First Nations and Inuit Regional Longitudinal Health Survey, a body of Canadian First Nations leadership that framed data governance as inseparable from nation-to-nation relationships. The four principles asserted that First Nations alone control data collection in their communities, own how information is stored and shared, determine who may access their data, and physically possess it. The First Nations Information Governance Centre (FNIGC), formed in April 2010 as a stand-alone non-profit from that Steering Committee lineage, trademarked OCAP® and built a national certification and training apparatus around the framework. OCAP® is explicitly a framework for First Nations in Canada; the movement's convergence into a shared international agenda came a decade and a half later.

The international academic codification of IDS as a field came with the 2016 ANU Press volume Indigenous Data Sovereignty: Toward an Agenda, edited by Tahu Kukutai (then a Professor of Demography at the University of Waikato, later a founding member of Te Mana Raraunga) and John Taylor of the Australian National University. The book drew on a 2015 Canberra workshop with contributors primarily from CANZUS states — Canada, Australia, New Zealand, United States — and grounded IDS as a rights claim under the United Nations Declaration on the Rights of Indigenous Peoples (UNDRIP), arguing that indigenous peoples hold inherent and inalienable rights relating to the collection, ownership, and application of data about their members, territories, and lifeways, and that no open-data or open-science convention overrides those prior rights. The 2016 volume established the CANZUS network as the framing's early institutional home and set the UNDRIP foundation on which GIDA and the CARE Principles would subsequently build.

The CARE Principles

The globally-propagated codification of the framing is the CARE Principles for Indigenous Data Governance, developed at an International Data Week workshop held on 8 November 2018 in Gaborone, Botswana, co-led by Stephanie Russo Carroll (University of Arizona) and Maui Hudson (University of Waikato, Aotearoa New Zealand), with thirteen additional contributors drawn from the nascent international IDS network. Formally released by GIDA on 6 September 2019 and peer-reviewed published in the Data Science Journal in November 2020, the principles — Collective Benefit, Authority to Control, Responsibility, Ethics — address a structural gap in the open-data movement: where FAIR (Findable, Accessible, Interoperable, Reusable) governs data as an object to be shared and reused, CARE governs data as something generated from and by communities who retain relational and governance rights over it regardless of where the data travels. The GIDA "#BeFAIRandCARE" initiative positioned the two frameworks as complementary rather than competing, with CARE adding people-and-purpose orientation to FAIR's object-and-property orientation. Collective Benefit requires data ecosystems to enable indigenous nations and communities to derive benefit from the data; Authority to Control establishes that indigenous peoples hold rights to govern their data, including data held by external entities about them; Responsibility requires those working with indigenous data to share how data are used and foster respectful relationships; and Ethics requires indigenous peoples' rights and wellbeing to be the primary concern at all stages of the data life cycle.

GIDA, the international body that released the CARE Principles, was itself founded at a workshop in Oñati, Spain on 11–12 July 2019, convened by Maggie Walter and Desi Rodriguez-Lonebear, with founding members from the Maiam nayri Wingara Collective (Australia), Te Mana Raraunga (Aotearoa New Zealand), and the United States Indigenous Data Sovereignty Network (USIDSN). GIDA takes its name from the Basque word for "guide." Each founding member represented a national IDS network that had developed the framing independently before converging under a global coordinating structure.

National networks

Four national networks form the institutional spine of the movement. In Canada, the First Nations Information Governance Centre (FNIGC) holds OCAP® as a standing framework, producing training, certification, and a growing body of guidance applying First Nations data-governance principles to health surveys, census records, and — increasingly — AI systems. In Aotearoa New Zealand, Te Mana Raraunga (the Māori Data Sovereignty Network) is the Pacific anchor of the movement; the idea emerged at a July 2015 workshop in Australia, the inaugural hui was held at Hopuhopu on 19 October 2015, and the charter was ratified at Papakura Marae on 5 April 2016, framing Māori data as a taonga — a treasured resource requiring culturally grounded governance — and developing the Te Kāhui Raraunga Māori Data Governance Model for use across the Aotearoa New Zealand public service. In Australia, Maiam nayri Wingara (the Aboriginal and Torres Strait Islander Data Sovereignty Collective), named in Gooniyandi for "to hold" and in Barrngala for "to shine," is a GIDA founding member and frames IDS as both a governance tool and a mechanism for self-determination aligned with UNDRIP. In the United States, the USIDSN, based at the Native Nations Institute at the University of Arizona, supports tribal nations in developing data-governance frameworks at the tribal level, addressing census data, health data, and increasingly AI training data.

AI-specific applications: the Kaitiakitanga License and the indigenous-AI movement

The framing's most prominent applied form in AI governance is the Kaitiakitanga License developed by Te Hiku Media, the Māori iwi broadcaster based in Kaitaia, Aotearoa New Zealand, that built the first te reo Māori speech-recognition AI under indigenous stewardship terms. The Kaitiakitanga License grounds the data corpus Te Hiku Media curated as a stewardship relationship rather than transferable intellectual property: data is not owned but cared for under kaitiakitanga, and any benefit derived from it flows back to the source community. The licence grants access only to organisations that respect Māori values, remain within the bounds of consent given by speakers, and return benefits to the Māori people, and explicitly prohibits use of the tools for discrimination, surveillance, or tracking. Chief Technology Officer Keoni Mahelona has stated the political stake as "Data is the last frontier of colonization"; Peter-Lucas Jones, Chief Executive, has warned that absent indigenous governance structures, Māori-language data would be used by "the very people that beat that language out of our mouths to sell it back to us as a service". The Kaitiakitanga License operationalises the IDS framing not as a policy statement for state institutions to adopt but as a community-controlled licensing instrument that an indigenous organisation deploys directly over its own AI assets, making it the working example most cited in the indigenous-AI literature.

Beyond Te Hiku Media, the framing has entered AI governance through the Indigenous Protocols for Artificial Intelligence (IP-AI) workshops — held first in Honolulu, Hawai'i in 2019 — which established a community of practice around the question of what it means for indigenous communities to be the makers of AI rather than only its consumers, bringing together indigenous AI practitioners including Peter-Lucas Jones. The Collaboratory for Indigenous Data Governance at the University of Arizona led the development of the IEEE 2890-2025 standard for provenance documentation of Indigenous Peoples' data — establishing a technical specification that any AI pipeline incorporating indigenous data must document that data's provenance and governance lineage. The WAMPUM Lab (University of Waterloo) and the Collaboratory jointly submitted a CARE Statement for Indigenous Data Sovereignty as a formal input to the UN Global Digital Compact negotiations, arguing that any multilateral AI governance framework must incorporate indigenous peoples' authority to control data about themselves as a matter of UNDRIP compliance.

Why it differs from adjacent framings

Three distinctions separate indigenous data sovereignty from the corpus's adjacent decolonial-AI framings. Where data colonialism names the structural asymmetry of accumulation between jurisdictions — a critique whose political demand is regulatory and redistributive — and algorithmic colonialism names the corporate mechanism through which that asymmetry plays out on African populations — a critique whose demand is resistance and redress — IDS names a specific governance authority: the right of sovereign peoples to govern the data apparatus from within, on their own terms. The framing does not ask that data systems harm indigenous peoples less; it asks that indigenous peoples control the systems that handle their data. The Kaitiakitanga License and the CARE Principles are governance instruments, not protest documents.

The IDS framing also differs from feminist AI — which emphasises inclusive design and intersectional accountability within the existing AI-development ecosystem — in that IDS rests on a foundation of sovereignty rather than inclusion. Indigenous peoples hold not merely inclusion rights within a shared data commons but property rights, governance authority, and relational accountability over data that flows from within their communities — claims grounded in UNDRIP, in treaty rights, and in prior, continuing relationships between peoples and their knowledge systems that no open-data convention overrides.

The third distinction is geographic and political: where data colonialism and algorithmic colonialism primarily name harms inflicted by Global North actors on Global South populations, IDS operates across multiple jurisdictions simultaneously — within Canada, Australia, Aotearoa New Zealand, and the United States as much as across the Global South — because the structural position of indigenous peoples within settler-colonial states creates IDS claims inside the very jurisdictions that anchor the AI industry. The CARE Principles and OCAP® are operative inside, not only against, the jurisdictions that build the world's largest AI systems.

04 · Sources

Where this came from.

9 sources listed from the pinned corpus. Links are shown only when the source URL is a valid HTTP(S) address.

  1. fnigc.ca

    Checked 2026-05-28

    First Nations Information Governance Centre (FNIGC) OCAP® training page — primary source for the OCAP® principles first developed in 1998 by the National Steering Committee of the First Nations and Inuit Regional Longitudinal Health Survey as the Canadian First Nations data-governance framework asserting community control and possession of data about First Nations peoples

  2. openresearch-repository.anu.edu.au

    Checked 2026-05-28

    ANU Open Research Repository entry for Tahu Kukutai and John Taylor (eds.), *Indigenous Data Sovereignty: Toward an Agenda* (ANU Press, 2016) — primary source for the framing's first international book-length codification, drawn from a 2015 Canberra workshop, focused on CANZUS states, grounding IDS as a rights claim under UNDRIP, and arguing that indigenous peoples hold inherent and inalienable rights relating to the collection, ownership, and application of data about their members, territories, and lifeways

  3. datascience.codata.org

    Checked 2026-05-28

    Stephanie Russo Carroll et al., "The CARE Principles for Indigenous Data Governance," Data Science Journal, vol. 19, 4 November 2020 — peer-reviewed primary source for the CARE Principles; the authors' framing that indigenous worldviews have centered "people and purpose through governance processes that emphasise collective ownership and control of data"; and the design intent to address the gap where the open-data movement "does not fully engage with Indigenous Peoples rights and interests"

  4. gida-global.org

    Checked 2026-05-28

    GIDA CARE Principles page — primary source for the 6 September 2019 formal release of the four principles (Collective Benefit, Authority to Control, Responsibility, Ethics), their design as people- and purpose-oriented guidance complementing FAIR under the "#BeFAIRandCARE" initiative, and the Gaborone origin at International Data Week on 8 November 2018

  5. gida-global.org

    Checked 2026-05-28

    GIDA Who We Are page — primary source for the founding of the Global Indigenous Data Alliance at a workshop in Oñati, Spain on 11–12 July 2019, convened by Maggie Walter and Desi Rodriguez-Lonebear, with founding members from Maiam nayri Wingara (Australia), Te Mana Raraunga (Aotearoa New Zealand), and USIDSN; and the name "GIDA" as the Basque word for "guide"

  6. maramatanga.ac.nz

    Checked 2026-05-28

    Ngā Pae o te Māramatanga news item on Te Mana Raraunga — primary source for the network's founding timeline: idea from a July 2015 workshop at the Academy of the Social Sciences in Australia; inaugural hui at Hopuhopu on 19 October 2015; charter ratified at Papakura Marae on 5 April 2016; and Tahu Kukutai as a founding member

  7. technologyreview.com

    Checked 2026-05-28

    MIT Technology Review feature (22 April 2022) on Te Hiku Media — primary source for the Kaitiakitanga License as the framing's principal working applied form in indigenous AI governance, for Keoni Mahelona's verbatim "Data is the last frontier of colonization", and for Peter-Lucas Jones's verbatim "the very people that beat that language out of our mouths to sell it back to us as a service"

  8. indigenous-ai.net

    Checked 2026-05-28

    Indigenous AI Working Group profile of Peter-Lucas Jones — primary source for the Jones framing "I don't just think about how AI can be used, I think about how we can be the makers of AI", for the Indigenous Protocols for Artificial Intelligence (IP-AI) workshops held in Hawai'i in 2019 as the inaugural international convening on indigenous self-determination within AI development, and for indigenous communities' positioning as AI producers rather than only consumers

  9. tehiku.nz

    Checked 2026-05-28

    Te Hiku Media blog post on the Kaitiakitanga License — primary source for the licence's grounding principle that data is cared for under kaitiakitanga rather than owned, and for the organisation's critique that "artificial intelligence in its current form is based on the wholesale appropriation of existing culture"

Source: entities/messages/msg-indigenous-data-sovereignty.md in movement-graph at pin 3cc1a36.