oecd-data-mining

The Organisation for Economic Co-operation and Development (OECD)
Interface software suite provides a means to discover, download,
and convert OECD SDMX-JSON data sets into CSV files. The files can be further
processed to select a subset according to set criteria (e.g. industries
with electricity), with specific fixed (normalized) column types. The
suite covers:

downloading list of all OECD data set IDs and descriptions;
downloading list of all data set schema;
downloading all OECD SDMX-JSON data sets;
converting all time period data sets to un-pivoted CSV files;
selecting a subset according to set criteria with specific fixed (normalized) column types
concatenating this subset of fixed column CSV files into an overall master CSV file.

There is also the means to work with just OECD frequency dimension data,
which are a subset of the main time period data sets. This suite covers :

identifing frequency dimension (annual/quarterly) supporting schema;
downloading just OECD SDMX-JSON data sets with a frequency dimension;
converting frequency dimension data sets to multi-indexed CSV files.

The OECD Interface software suite is written for Python 3.5, pandas 0.24.2,
and uses the pandasdmx 0.8.2 Python package to convert SDMX-JSON files
to multi-indexed CSV files.

The full online documentation is to be found here, and lays out the
workflow for using these utilities:
https://snatch59.github.io/oecd-data-mining/

snatch59/oecd-data-mining

oecd-data-mining

On this page

Languages

Contributors