OSDG Community Dataset (OSDG-CD)

Created Jan. 12, 2022, 2:12 p.m.
Updated Sept. 17, 2023, 2:30 p.m.

The OSDG Community Dataset (OSDG-CD) is a public dataset of thousands of text excerpts, which were validated by approximately 1,000 OSDG Community Platform citizen scientists from over 110 countries, with respect to the Sustainable Development Goals (SDGs). 

The data can be used to derive insights into the nature of SDGs using either ontology-based or machine learning approaches.

The dataset is updated on a quarterly basis. The current version (2022.07) contains 32,431 text excerpts and a total of 217,147 assigned labels.

Publish information

Year of publication: 2022
License: Creative Commons Attribution 4.0 International
DOI: https://doi.org/10.5281/zenodo.6831287

Links with projects and/or organisations

This website is using cookies. More info. That's Fine