In a nutshell

The SDMX Converter is a tool that converts statistical datasets between different formats. The input and output formats supported by the SDMX Converter include (* only as input formats): 

For a complete picture on supported transformations please go to conversions supported by formats.

The SDMX Converter is used extensively in Eurostat and among the members of the European Statistical System (ESS). It is at the core of the Transformation Service. Its purpose, to convert files between different formats, is complementary to that of the SDMX-RI, which can generate SDMX-ML data directly from databases.

To perform conversion, the SDMX Converter requires an input file and the matching SDMX data structure definition (DSD) or a dataflow on its behalf. Optionally, the conversion can include structural validation of the input dataset against the DSD, including code list checks. See the Structural Validation Service.

For data formats other than SDMX-ML, additional structural information and parameters need to be supplied. Most importantly, every SDMX-ML file includes mandatory information within its header, which has to be supplied in case the original file does not contain it, as it is the case for CSV files. The typical examples of these are the CSV conversion parameters, which specify the field delimiter, quoting policy, ordering of columns, whether the file has a header row with concept names or not and all the mandatory information for the SDMX header.

Optionally, the transformation service can perform transcoding, i.e., replacement of the codes found in the source file using a user-supplied lookup table.

The SDMX Converter can be used in several forms:

  • As a web application with an interactive graphical user interface (GUI).
  • As a command-line tool, useful for both interactive and batch processing.
  • As a web service that can be invoked by any SOAP/REST compliant client.
  • As a Java library with its own application program interface (API).

Conversions supported by formats

Input

 

Output

CSV

SDMX

CSV

GESMES TS SDMX GENERIC SDMX COMPACT SMDX UTILITY SDMX CROSS SECTIONAL EXCEL SDMX 2.1 (*****) Message Group (******) FLR
CSV (**) NO YES YES YES YES YES YES(*) YES YES YES YES
SDMX CSV YES NO YES YES YES YES YES YES YES YES YES
GESMES/TS YES YES NO YES YES YES YES(*) YES YES NO NO
SDMX GENERIC YES YES YES NO YES YES YES(*) YES YES NO YES
SDMX COMPACT YES YES YES YES NO YES YES(*) YES YES NO YES
SMDX UTILITY YES YES YES YES YES NO YES(*) YES YES NO YES
SDMX CROSS-SECTIONAL

YES

(*)

YES

YES

(*)

YES

(*)

YES

(*)

YES

(*)

NO NO YES NO YES
SDMX 2.1 YES YES YES YES YES YES

YES

(****)

YES NO NO YES
FLR NO YES NO NO NO NO NO NO NO NO NO

(*) The selected DSD for these files must support cross-sectional messages i.e. having defined cross-sectional measures and cross-sectional attachment levels for its components (see SDMX standard)

(**)When the output format (CSV) is a flat file the attributes, attached in higher level than observations in the input message are omitted.

(****)The provided DSD has to be valid with Time dimension or cross sectional measures.

(*****)With exception of SDMX V2.1 formats no other format supports multiple datasets in a single data message. Consequently Converter only supports conversions of SDMX V2.1 files having a single dataset per message. Also conversion from SDMX 2.1 to SDMX 2.0 formats is possible only when the DSD is compatible with both formats. The compatibility with V2.0 formats is checked in SDMX Source API before conversion begins.

(******)The Message Group input file should have only one dataset reported in the message.

User Guide

Information on the steps needed to convert and / or validate datafiles