Think and Save the World

How International Open-Data Initiatives Create Shared Knowledge Commons

· 6 min read

The Architecture of Open Knowledge

The Human Genome Project as precedent. The Bermuda Principles (1996) established that all human genome sequence data should be released into public databases within 24 hours of generation. This was a deliberate choice against the prevailing trend of patenting genetic sequences. Craig Venter's competing private effort, Celera Genomics, planned to keep its data proprietary. The public consortium's decision to release freely meant that when the genome was completed, it belonged to everyone.

The downstream effects have been enormous. Open access to genomic data accelerated research across thousands of laboratories worldwide, enabled the development of diagnostic tools and therapies, and made possible the rapid sequencing of SARS-CoV-2 in January 2020 — which in turn enabled vaccine development within months. If the genome had been locked behind proprietary licenses, the COVID-19 vaccine timeline would have been measured in years, not months.

The open data charter. The International Open Data Charter, adopted by over 80 governments and organizations, establishes six principles: open by default, timely and comprehensive, accessible and usable, comparable and interoperable, for improved governance, and for inclusive development and innovation. These principles represent a shift in the default assumption about public information: from "closed unless there's a reason to open" to "open unless there's a reason to close."

Creative Commons and open licensing. The legal infrastructure for open data was built largely by the Creative Commons organization, which created a suite of standardized licenses allowing creators to specify how their work can be reused. CC0 (public domain dedication) and CC-BY (attribution only) licenses now apply to billions of works — images, texts, datasets, educational materials. This legal framework solved a critical problem: how to give away knowledge in a legal system designed to enclose it.

---

Major Open Data Ecosystems

Government open data. Over 100 countries have launched national open data portals. The US (data.gov), UK (data.gov.uk), India (data.gov.in), Kenya (opendata.go.ke), and others publish datasets ranging from budget expenditures to health statistics to transportation data. The quality and comprehensiveness vary enormously, but the directional commitment is significant.

Use cases demonstrate the value: journalists using open budget data to track corruption, civic technologists building tools that help citizens navigate government services, researchers combining datasets across countries to study patterns that no single national dataset reveals.

Scientific open access. The open access movement in academic publishing has grown from radical fringe to institutional mandate. Plan S (2018), backed by major European research funders, requires that all publicly funded research be published in open access journals. The US Office of Science and Technology Policy (2022) mandated that all federally funded research be freely accessible by 2025. Similar mandates exist in India, China, and across Latin America.

The Open Science movement goes further, advocating for open data, open methods, open peer review, and open educational resources. The premise: science is a public good, and knowledge produced with public funds should be publicly available.

Earth observation data. The Copernicus program (European Union) provides free, open access to satellite imagery and environmental monitoring data covering the entire planet. NASA's Landsat program has done the same since 2008, when it made its entire archive — decades of Earth imagery — freely available. The impact on climate science, agricultural monitoring, disaster response, and urban planning has been transformative.

A concrete example: Deforestation monitoring. Brazil's PRODES and DETER systems use open satellite data to track Amazon deforestation in near-real-time. When deforestation rates spiked under Bolsonaro's government and the government attempted to suppress the data, the fact that the underlying satellite imagery was openly available from international sources meant that independent researchers could verify the actual rates. Open data served as a check on state power.

Health data commons. The Global Health Observatory (WHO), the Institute for Health Metrics and Evaluation (IHME), and numerous disease-specific databases provide open access to health data that enables cross-country comparisons, epidemiological research, and health system benchmarking. During COVID-19, open data sharing — including real-time case reporting, genomic sequence sharing through GISAID, and open-access preprints — was essential to the global research response.

---

The Tensions and Failures

Data colonialism. Open data is not automatically equitable. When wealthy institutions extract data from lower-income countries — genomic samples, health data, agricultural data — and use it to develop proprietary products without benefit-sharing, open data becomes a mechanism of extraction rather than commons-building. The Nagoya Protocol (2010) attempted to establish benefit-sharing principles for genetic resources, but enforcement is weak and evasion common.

Couldry and Mejias' concept of "data colonialism" argues that the contemporary extraction of human behavioral data by tech platforms replicates the logic of historical colonialism — claiming shared resources as raw material for private accumulation. Open data advocates must grapple with the distinction between genuine commons and extractive openness.

Privacy and surveillance. Open government data can become a surveillance tool. India's Aadhaar system, which creates a biometric identity database for over a billion people, has been simultaneously celebrated as a tool for efficient service delivery and criticized as a surveillance infrastructure. Open data principles must be balanced against privacy rights — the data that governments publish must not enable the identification, tracking, or targeting of individuals.

The capacity gap. Open data is only useful if you can access and analyze it. A satellite dataset released in a specialized format, requiring expensive software and technical expertise to interpret, is technically open but practically enclosed. The open data movement increasingly recognizes that access without capacity is not genuine openness. Investment in data literacy, analytical tools, and local capacity is essential.

Corporate open-washing. Some corporations use open data rhetoric to obscure proprietary practices. Publishing selected datasets while keeping the most commercially valuable data locked is a common strategy. "Open APIs" that provide access to data only within terms controlled by the platform are open in form but not in substance.

---

What a True Knowledge Commons Would Look Like

A genuine knowledge commons — not just open data but open knowledge at planetary scale — would require several things that don't yet fully exist:

Universal digital access. A commons that 2.6 billion people can't reach isn't universal. (See concept law_1_373.)

Multilingual infrastructure. Knowledge published only in English is accessible to roughly 1.5 billion people — 20% of humanity. True open knowledge requires translation, multilingual interfaces, and support for oral knowledge traditions.

Governance structures. A commons without governance is vulnerable to enclosure and exploitation. Models like Wikipedia's community governance, Creative Commons' legal framework, and CERN's open data policies provide templates, but planetary-scale knowledge governance remains nascent.

Benefit-sharing mechanisms. When open data generates commercial value, some portion of that value should flow back to the communities that produced the data. This principle is established in international law for genetic resources but barely implemented for digital data.

Indigenous data sovereignty. The CARE Principles for Indigenous Data Governance (Collective benefit, Authority to control, Responsibility, Ethics) establish that Indigenous communities have the right to govern data about themselves, their lands, and their knowledge systems. This is not a contradiction to open data — it's a correction. Openness without consent is extraction.

---

Exercises

1. The Access Test. Pick a piece of knowledge that matters to your work or community. Try to access it without paying. If you can, trace what made that possible — what policy, what license, what organization. If you can't, ask: who benefits from this knowledge being enclosed? Who is excluded?

2. The Data Literacy Check. Go to your national government's open data portal (if it exists). Find a dataset relevant to your life — crime statistics, air quality, school performance, public spending. Try to understand it. Notice how easy or hard it is. The gap between open data and usable knowledge is where capacity investment matters.

3. The Commons Contribution. Identify one piece of knowledge you hold — a skill, a dataset, a local observation — that could be shared openly. Find a platform to share it: Wikipedia, OpenStreetMap, a community knowledge base, a Creative Commons-licensed blog. The commons grows when people put things in, not just take things out.

4. The Enclosure Inventory. List three types of knowledge that are currently enclosed (paywalled, classified, proprietary) that you believe should be open. For each, identify who benefits from the enclosure and who is harmed by it. What would need to change for the enclosure to end?

---

Key Sources and Further Reading

- National Research Council, Bits of Power: Issues in Global Access to Scientific Data (National Academies Press, 1997) - Wilbanks, J. and Boyle, J., "Introduction to Science Commons," Science Commons (2006) - Couldry, N. and Mejias, U., The Costs of Connection: How Data is Colonizing Human Life and Appropriating It for Capitalism (Stanford University Press, 2019) - Carroll, S.R., et al., "The CARE Principles for Indigenous Data Governance," Data Science Journal 19 (2020) - Open Data Charter, principles and implementation guidelines - European Commission, Copernicus program documentation and data access portals

Cite this:

Comments

·

Sign in to join the conversation.

Be the first to share how this landed.