Open Data Open Data

Open Data

Understanding what open data means and why it matters

Open Data

What is Open?

OPEN KNOWLEDGE IS ANY CONTENT, INFORMATION OR DATA THAT CAN BE USED, REUSED AND DISTRIBUTED WITHOUT ANY LEGAL, TECHNOLOGICAL OR SOCIAL RESTRICTION.

Open Data

Open data is data that can be freely used, reused and redistributed by anyone – subject to attribution to the creators and provided, in turn, under the same terms.

The full definition, found in the Open Definition, offers more information. Summarizing the most important points:

Key Principles

Availability and Accessibility

Data must be available as a whole, at a reasonable reproduction cost, and preferably downloadable via the Internet. It must also be available in a practically readable format.

Reuse and Redistribution

Data must be provided under terms that permit reuse and redistribution, including mixing with other datasets.

Universal Participation

Everyone must be able to use, reuse and redistribute the data. There must be no discrimination against fields of endeavor or against persons or groups.

The Term "Open"

We have defined the standard for truly free and open dissemination of information with the Open Definition.

What is Open Knowledge?

"Open Knowledge" is any content, information or data that people freely use, reuse and redistribute – without any legal, technological or social restriction.

Access

Free and Open access to content

Redistribute

Freedom to redistribute content

Reuse

Freedom to reuse content

No restrictions

No restrictions based on who someone is or their field of endeavor

Open Knowledge is what emerges from open data when it is useful, usable and used. Thus Open Knowledge empowers – it helps us trigger change and improve the world.

Interoperability

Interoperability

If one wonders why it is so important to be clear about what Open Data means and why this definition is useful, there is a simple answer: interoperability.

Interoperability denotes the ability of different systems to work together (interoperate). In this particular case, it refers to the ability to interoperate – or mix – different datasets.

Interoperability is important because it allows different components to work together. This ability to share and connect components is fundamental to building larger and more complex systems.

Without the ability to interoperate this becomes almost impossible – proof is the famous story of the Tower of Babel, where the inability to communicate (interoperate) led to the complete collapse of the construction effort.

Dataset A
+
Dataset B
=
Value

The core of the "common good" found in data (or code) lies in the fact that a piece of open material they contain can be mixed with other open material. This interoperability is the essential key to realizing the main practical advantages of the concept of "Open".

What data are we referring to?

Readers of this handbook have already seen several examples of different types of data that are or can become "Open" – and will see more examples below. However, it would be useful to make, at this point, a brief outline of the types of data that are or can be open – and, equally importantly, to mention which data cannot be "Open".

🔑

Key Point

When "opening" data, the focus must be only on non-personal data, that is, data that does not contain information about specific individuals.

Restrictions

  • Non-personal data: Data that does not contain information about specific individuals
  • National Security: For certain types of government data, national security restrictions may apply

Why do we need a clear definition?

A clear definition for the concept of "Open" guarantees that when someone has two open datasets from two different sources, they will be able to combine them and also guarantees that we avoid our own Tower of Babel: lots of data but limited or no ability to combine them into larger systems, where their true value is revealed.