Using Open Data for projects
Our team relies on publicly available data to create charts and maps that we include in our stories. One of the most useful resources we’ve come across are Open Data portals that contain datasets published by different levels of government and non-governmental agencies like the World Bank, and OECD, and the United Nations. We’ve compiled a list of sources to help you find data for your next projects.
What is open data?
Open data is digital data that can be used freely by anyone, modified, and shared by anyone for any purpose, including commercial purposes. It must be available in a machine-readable format such as CSV, XML, JSON, and Shapefiles that can be accessed by computer applications to generate insights and analyses.
By making valuable datasets publicly available, organizations and governments — City of Toronto, The World Bank, and the OECD, etc. — are enabling people to explore and analyze data, leading to better decision-making and problem-solving. From weather reports to traffic data, crime statistics to socio-economic data, the volume of open data available is staggering.
Open data sources
Our team has compiled a list of sources; 54 sources categorized into 4 regions — Americas, Asia-Pacific, Europe, and Global — that provide free, publicly available datasets that you can use for personal or business purposes. Most of the datasets are available under the open data license. We’ve organized them by region and topics, and have included an overview of the file formats available for download on each website.
We’ll continue to add sources to the list, so check back often. If there is a source you’d recommend adding, please let us know in the comments section below.
-
CSV: Comma-separated values (CSV) files store tabular data (numbers and text) in plain text, where each line of the file typically represents one data record. It can be opened with a spreadsheet application like Excel, Google Sheets, or OpenOffice Calc.
XLS/XLSX: An XLSX or XLS file is an Excel spreadsheet created by Microsoft Excel or exported by another spreadsheet program, such as OpenOffice Calc or Google Sheets.
JSON: A JSON file is a file that stores simple data structures and objects in JavaScript Object Notation (JSON) format, which is a standard data interchange format. Since it is a text file, you can open JSON files using any text editor as well as compilers including Visual Studio Code.
GeoJSON: A GeoJSON file is a JSON file that stores geographical data such as points, lines, polygons, etc. It is a common and compact format for exchanging spatial data between applications. It can be opened using geospatial software such as ArcGIS or QGIS.
XML: An XML file is an XML (Extensible Markup Language) data file. They can be thought of as text-based data files. You can open an XML file in any text editor or web browser. But if you want to edit the files, you’ll need an XML editor — Microsoft XML Notepad.
SHP: Contains a dataset that stores a collection of geographic features, such as streets, points of interest, and zip code boundaries. It can be used to store point, line, or area features. You can open these with mapping software such as QGIS or ArcGIS.
KML: A KML file stores geographic modeling information in the Keyhole Markup Language (KML) including placemarks, points, lines, polygons, and images. KML files can be opened using Google Earth and mapping software.
Benefits of open data
Open data offers numerous benefits that extend to various sectors and stakeholders. Some of the key benefits include:
Transparency and Accountability: Open data enhances transparency in government activities, corporate practices, and other sectors. It holds institutions accountable for their actions and decisions by providing accessible information to the public.
Informed Decision-Making: Open data empowers individuals, organizations, and governments to make well-informed decisions based on accurate and up-to-date information. It aids in evidence-based policy formulation and strategic planning.
Research and Innovation: Researchers can access diverse datasets to conduct studies, validate hypotheses, and generate new insights. Open data fuels innovation by enabling collaboration and the development of new ideas.
Public Services Improvement: Governments can use open data to identify areas for improvement in public services, leading to enhanced service delivery, resource allocation, and responsiveness to citizens' needs.
Citizen Empowerment: Open data provides citizens with the tools to understand their communities, engage in civic discussions, and participate in public affairs. This promotes active citizenship and democratic engagement.
In summary, open data fosters transparency, drives innovation, empowers citizens, and catalyzes positive societal, economic, and environmental changes across various domains. If there are sources you’d recommend adding to our list, please let us know in the comments below. Many thanks in advance!
As the world races to decarbonize and bring on cleaner sources of electricity, it got us wondering how North America’s electricity grid stacks up against the rest of the world. In this story, we explore North America’s electricity mix in 2023 and how it has changed over the years.