Documenting indicators
Metadata schema
Indicators are summary measures that capture key issues or phenomena, derived from observed data. When indicators are presented for a specific geographic area and include a temporal dimension — such as annual, quarterly, monthly, or daily values — they form time series.
To facilitate standardized documentation of indicators, the World Bank developed a metadata standard by compiling and structuring metadata elements commonly used by various organizations, including the World Bank itself, United Nations agencies, the International Monetary Fund (IMF), the Organisation for Economic Co-operation and Development (OECD), and Eurostat.
The World Bank metadata schema aligns with the Statistical Data and Metadata Exchange standard (SDMX), a standard designed to enable machine-to-machine data exchanges.
- The descriptive and reference metadata component of the World Bank schema aligns with SDMX Metadata Structure Definitions (MSDs). MSDs provide a framework for structuring metadata, but do not define the metadata elements to be included. The World Bank’s schema therefore complements SDMX by specifying detailed content that can be incorporated into SDMX MSDs and metadatasets.
- The structural metadata in the World Bank schema, which describes how indicator data are organized in the data file or database, aligns with the SDMX Data Structure Definitions (DSDs).
The Metadata Editor supports this integration by offering tools to:
- Export metadata templates as MSDs – ensuring compatibility with SDMX metadata structures.
- Export indicator metadata as meta-datasets – enabling seamless incorporation of metadata into SDMX-compliant systems.
Metadata schema for documenting databases of indicators
In addition to documenting individual indicators, the World Bank has developed a complementary metadata schema for documenting databases of indicators (i.e., collections of indicators). This schema provides additional metadata at the database level, which enriches the contextual information available for each indicator.
To establish a clear link between an indicator and its associated database, the indicator metadata schema includes a dedicated element that stores the database identifier, enabling seamless association between indicator metadata and database metadata.
In data catalogs like a NADA catalog, the metadata on indicators and the related databases can be combined.

Documenting an indicator
This section describes in detail the process of documenting an indicator, and the various functionalities provided in the Metadata Editor for generating and publishing indicators metadata. The section focuses on the functionalities specific to the documentation of indicators. Refer to the General instructions for guidance on the components of the application that are common to all data types.
Key principles for producing high-quality metadata
The following general principles should be followed to ensure the production of high-quality metadata. They apply broadly across metadata elements used to document statistical indicators. Subsequent sections of this Guide provide detailed instructions and quality criteria for specific metadata elements. These elements and criteria form the foundation of an AI-enabled system for metadata quality assurance and augmentation.
- Standardize terminology. Use consistent terms across all metadata. Where applicable, adopt internationally recognized controlled vocabularies or code lists (e.g., for topics, sectors, geographic areas) to enhance interoperability and comparability.
- Maintain consistency in terminology and jargon. Avoid mixing technical jargon with general terms unless clearly explained. Use consistent language and avoid introducing synonyms or alternative phrases for the same concept without clear justification.
- Link related indicators. Where applicable, establish links among related indicators. Group variants, disaggregations, or derived indicators using relationships such as: Is part of, Has disaggregation, Derived from. This helps users navigate the catalog and understand the relationships among indicators.
- Spell out acronyms and abbreviations. Avoid using unexplained acronyms, especially in definitions. Widely recognized acronyms (e.g., GDP, USD) may be used in titles, but must be spelled out or explained in the metadata definition or notes.
- Use accessible and non-technical language. Write metadata in a clear and accessible style. Avoid excessive technical detail. Complex methodological information should be included in referenced documentation, not embedded in core metadata fields.
Create a new project
The first step in documenting an indicator is to create a new project. You do that by clicking on CREATE NEW PROJECT in the My projects page, then selecting Indicator as data type when prompted. This will open a new, untitled indicator Project page.
In that page, select the template you want to use to document the indicator. A default template is proposed; no action is needed if you want to use the default template. Otherwise, switch to another template by clicking on the template name in the Templates frame. Note that you can at any time change the template used for the documentation of a project. The selected template will determine what you see in the navigation tree and in the metadata entry pages, but switching from one template to another will not impact the metadata that has already been entered; no information will be deleted from the metadata.
Enter information on metadata
The Metadata information section in the navigation tree (in the Project page) contains elements intended to document the metadata being generated, i.e., metadata about the metadata. All content in this section is optional; it is however recommended practice to document the metadata as precisely as possible. This information will not be useful to data users, but it will be to catalog administrators. When metadata is shared across catalogs, the information entered in the Information on metadata provides transparency and clarity on the origin of the metadata.
Enter the indicator description (descriptive metadata)
The documentation of an indicator is a relatively straightforward process. The navigation tree in the Project page provides access to various metadata entry pages where the information about the indicator can be captured. These metadata entry pages are defined by the selected template.
The template includes a description of each metadata element it contains, which can be used as instructions to data curators. Templates can be exported to PDF format, and used as a reference documents by data curators. The instructions contained in templates are also displayed in the metadata entry pages, by clicking on the ? icon shown next to each metadata element's label.
We provide below some guidance on a selection of the metadata elements found in the metadata schema. A small number of these elements are marked as required. When documenting an indicator, it is advised to provide the most comprehensive information possible.
In the list of metadata elements below, the key of each element in the metadata standard is provided between brackets next to the corresponding element's label in the template.
TITLE STATEMENT
Primary ID(idno) A unique identifier (ID) for the indicator. This is a required element. Most agencies will (and should) use a coherent coding convention to generate their indicators identifiersDOI(doi) A Digital Object Identifier (DOI) for the the indicator. See the Documenting data - General instructions section of the User Guide for more information on DOIs.Other identifiers(alternate_identifiers) The elementidnodescribed above is the reference unique identifier for the catalog in which the metadata is intended to be published. But the same indicator/metadata may be published in other catalogs. For example, a data catalog may publish metadata for indicator extracted from the World Bank World Development Indicators (WDI) database. And the WDI itself contains indicators generated and published by other organizations, such as the World Health Organization or UNICEF. Catalog administrators may want to assign a unique identifier specific to their catalog (theidnoelement), but keep track of the identifier of the indicator or indicator in other catalogs or databases. Thealternate_identifierselement serves that purpose. It includes the following sub-elements:Type(type) The type of identifier. For example: "ISBN".Identifier(name) This element will be used to provide the identifier.Database(database) The name of the database (or catalog) where this alternative identifier is used, e.g. "IMF, International Financial Statistics (IFS)".URL(uri) A link (URL) to the database mentioned indatabase.Notes(notes) Any additional information on the alternate identifier.
Name(name) The name (label) of the indicator. Note that a fieldaliasesis provided (see below) to capture alternative names for the indicator.Display name(display_name) The name (label) of the indicator as it should be displayed in a data catalog.Aliases(aliases) An indicator can be referred to using different names. Thealiaseselement is provided to capture the multiple names and labels that may be associated with (i.e synomyms of) the documented indicator.Database ID(database_id) The unique identifier of the database the indicator belongs to. This field must correspond to the elementdatabase_description > title_statement > idnoin the database schema. This is the field that will be used to establish the link between the database metadata and the indicator metadata.Database name(database_name) The name of the database the indicator belongs to (name of the database identified inDatabase ID).Date released(date_released) The date, in ISO format (YYYY-MM-DD) when the indicator was released.
SOURCES, CONCEPTS, AND METHODS
Definition short(definition_short) A short definition of the indicator. The short definition captures the essence of the indicator.Definition long(definition_long) A long(er) version of the definition of the indicator. If only one definition is available (not a short/long version), it is recommended to capture it in thedefinition_shortelement. ALternatively, the same definition can be stored in bothdefinition_shortanddefinition_long.Definition references(definition_references) This element is provided to link to an external resources from which the definition was extracted.Source(source) The source of the definition (title, or label).URL(uri) A link (URL) to the source of the definition.Note(note) This element provides for annotating or explaining the reason the reference has been included as part of the metadata.
Relevance(relevance) This field documents the relevance of an indicator in relation to a social imperative or policy objective.MandateMandate(mandate) Description of the institutional mandate or of a set of rules or other formal set of instructions assigning responsibility as well as the authority to an organization for the collection, processing, and dissemination of statistics for this indicator.Mandate URL(URI) A link to a resource (document, website) describing the mandate.
Data collection(data_collection) This group of elements can be used to document data collection activities that led to or allowed the production of the indicator. This element will typically be used for the description of surveys or censuses. Note: the schema also contains an element "sources". That element will be used to document the organization and/or main data production program from which the indicator is derived.Data source(data_source) A concise and standardized name (label) for the data source, e.g. "National Labor Force Survey, 1st quarter 2022". If multiple data sources were used, they can all be listed here. Note that if an indicator has values obtained from many different sources, the source for each value (or group of values) will not be part of the indicator metadata, but will be stored as an attribute in the data file where the information can be associated with a specific observation ("cell note" or group of observation (e.g. attached to an indicator for avv values for a same year or for a same area).Data collection method(method) Brief information on the data collection method, e.g. :Sample household survey".Data collection period(period) Information on the period of the data collection, e.g. "January to March 2022".Data collection note(note) Additional information on the data collection.Data collection URL(uri) A link to a resource (website, document) where more information on the data collection can be found.
Methodology(methodology) Methodological details on the production of the series or indicator.Methodology references(methodology_references) This element is provided to link to an external resources from which the definition was extracted.Source(source) The source of the information on methodology.URL(uri) A link (URL) to the source of the information on methodology.Note(note) This element provides for annotating or explaining the reason the reference has been included as part of the metadata.
Derivation(derivation) Description of the derivation method (not including imputations, which should be described in element "Imputation").Source(source) The source of the information on derivation.URL(uri) A link (URL) to the source of the information on derivation.Note(note) This element provides for annotating or explaining the reason the reference has been included as part of the metadata.
Derivation references(derivation_references) This element is provided to link to an external resources from which the definition was extracted.Imputation(imputation) Data may have been imputed to account for data gaps or for other reasons (harmonization/standardization, and others). If imputations have been made, this element provides the space for their description.Imputation references(imputation_references) This element is provided to link to an external resources from which the definition was extracted.Source(source) The source of the information on imputation.URL(uri) A link (URL) to the source of the information on imputation.Note(note) This element provides for annotating or explaining the reason the reference has been included as part of the metadata.
Statistical concept(statistical_concept) This element allows to insert a reference of the indicator with content of a statistical character. This can include coding concepts or standards that are applied to render the data statistically relevant.Concept references(concept_references) This element is provided to link to an external resources from which the definition was extracted.Source(source) The source of the information on statistical concepts.URL(uri) A link (URL) to the source of the information on statistical concepts.Note(note) This element provides for annotating or explaining the reason the reference has been included as part of the metadata.
Related concepts(concepts) This repeatable element can be used to document concepts related to the indicators (other than the main statistical concept that may have been entered instatisticsl_concept). For example, the concept of malnutrition could be documented in relation to the indicators "Prevalence of stunting" and "Prevalence of wasting".Name(name) A concise and standardized name (label) for the concept.Definition(definition) The definition of the concept.URL(uri) A link (URL) to a resource providing more detailed information on the concept.
Aggregation method(aggregation_method) Theaggregation_methodelement describes how values can be aggregated from one geographic level (for example, a country) to a higher-level geographic area (for example, a group of country defined based on a geographic criteria (region, world) or another criteria (low/medium/high-income countries, island countries, OECD countries, etc.). The aggregation method can be simple (like "sum" or "population-weighted average") or more complex, involving weighting of values.Aggregation references(aggregation_references) This element is provided to link to an external resources from which the aggregation method was extracted.Source(source) The source of the information on aggregation.URL(uri) A link (URL) to the source of the information on aggregation.Note(note) This element provides for annotating or explaining the reason the reference has been included as part of the metadata.
Sources of data(sources) This element provides information on the source(s) of data that were used to generate the indicator. A source can refer to an organization (e.g., "Source: World Health Organization"), or to a dataset (e.g., for a national poverty headcount indicator, the sources will likely be a list of sample household surveys). Insources, we are mainly interested in the latter. When an indicator in a database is an indicator extracted from another database (e.g., when the World Bank World Development Indicators include an indicator from the World Health Organization in its database), the source organization should be mentioned in theauthoring_entityelement of the schema. Thesourceselement is a repeatable element. Note 1: In some cases, the source of a specific value in a database will be stored as an attribute of the data file (e.g., as a "footnote" attached to a specific cell. If the sources are listed in the data file, they may but do not need to be stored in the metadata. Note 2: the schema also contains an element "data_collection" that would be used to describe a specific data collection activity from which an indicator is derived.Identifier(idno) This element records the unique identifier of a source. It is a required element. If the source does not have a specific unique identifier, a sequential number can be used. If the source is a dataset or database that has its own unique identifier (possibly a DOI), this identifier should be used.Other identifiers(other_identifiers) This repeatable element is used to enter identifiers (IDs) other than the primary ID.Type(type) The type of identifier. For example: “DOI”, or "ISBN".Identifier(identifier) The identifier itself.
Type(type) The type of source, e.g. "household survey", "administrative data", or "external database".Name(name) The name (title, or label) of the source.Organization(organization) The organization responsible for the source data.Authors(authors) Authors (with detailed information on authors)Datasets(datasets)Identifier(idno) The identifier of the dataset.Title(idno) Name (title) of the dataset.URL(idno) Link to a dataset website.
Publisher(publisher)Publication date(publication_date)URL(uri)Date accessed(access_date)Notes(note) This element can be used to provide additional information regarding the source data.
Notes on data source(sources_note) Additional information on the source(s) of data used to generate the indicator or indicator.
STANDARDS AND FRAMEWORKS
Standards(compliance) For some indicators, international standards have been established. This is for example the case of indicators like the unemployment or unemployment rate, for which the International Conference of Labour Statisticians defines the standards concepts and methods. Thecomplianceelement is used to document the compliance of an indicator with one or multiple national or international standards.Name(standard) The name of the standard that the indicator complies with. This name will ideally include a label and a version or a date. For example: "International Standard Industrial Classification of All Economic Activities (ISIC) Revision 4, published in 2007"Abbreviation(abbreviation) The acronym of the standard that the indicator complies with.Custodian(custodian) The organization that maintains the standard that is being used for compliance. For example: "United Nations Statistics Division".URL(uri) A link to a public website site where information on the compliance standard can be obtained. For example: "https://unstats.un.org/unsd/classifications/Family/Detail/27
Frameworks(framework) Some national, regional, and international agencies develop monitoring frameworks, with goals, targets, and indicators. Some well-known examples are the Millennium Development Goals and the Sustainable Development Goals which establish international goals for human development, or the World Summit for Children (1990) which set international goals in the areas of child survival, development and protection, supporting sector goals such as women’s health and education, nutrition, child health, water and sanitation, basic education, and children in difficult circumstances. Theframeworkelement is used to link an indicator to the framework, goal, and target associated with it.Name(name) The name of the framework.Abbreviation(abbreviation) The abreviation of the name of the framework.Custodian(custodian) The name of the organization that is the official custodian of the framework.Description(description) A brief description of the framework.Goal ID(goal_id) The identifier of the Goal that the indicator is associated with.Goal name(goal_name) The name (label) of the Goal that the indicator is associated with.Goal description(goal_description) A brief description of the Goal that the indicator is associated with.Target ID(target_id) The identifier of the Target that the indicator is associated with.Target name(target_name) The name (label) of the Target that the indicator is associated with.Target description(target_description) A brief description of the Target that the indicator is associated with.Indicator ID(indicator_id) The identifier of the indicator, as provided in the framework (this is not theidnoidentifier).Indicator name(indicator_name) The name of the indicator, as provided in the framework (which may be different from the name provided inname)Indicator description(indicator_description) A brief description of the indicator, as provided in the framework.URL(uri) A link to a website providing detailed information on the framework, its goals, targets, and indicators.Notes(notes) Any additional information on the relationship between the indicator and the framework.
QUALITY
Limitation(limitation) This element is used to communicate to the user any limitations or exceptions in using the data. The limitations may result from the methodology, from issues of quality or consistency in the data source, or other.Validation rules(validation_rules) Description of the set of rules (itemized) used to validate values for the indicator, e.g. "Is within range 0-100", or "Is the sum of indicator X + indicator Y".Quality checks(quality_checks) Data may have gone through data quality checks to assure that the values are reasonable and coherent, which can be described in this element. These quality checks may include checking for outlying values or other. A brief description of such quality control procedures will contribute to reinforcing the credibility of the data being disseminated.Quality note(quality_note) Additional notes or an overall statement on data quality. These could for example cover non-standard quality notes and/or information on independent reviews on the data quality.Discrepancies(sources_discrepancies) This element is used to describe and explain why the data in the indicator may be different from the data for the same indicator published in other sources. International organizations, for example, may apply different techniques to make data obtained from national sources comparable across countries, in which cases the data published in international databases may differ from the data published in national, official databases.Adjustments(adjustments) Description of any adjustments with respect to use of standard classifications and harmonization of breakdowns for age group and other dimensions, or adjustments made for compliance with specific international or national definitions.Missing(missing) Information on missing values in the indicator or indicator. This information can be related to treatment of missing values, to the cause(s) of missing values, and others.Errata(errata) This element is used to provide information on detected errors in the data or metadata for the indicator, and on the measures taken to remedy them.Date(date) The date the erratum was published.Description(description) A description of the error and remedy measures.
Acknowledgements(acknowledgements) Itemized list of persons and organizations being acknowledgedName(name) Name of the person or organization being acknowledged.Affiliation(affiliation) Affiliation of the person being acknowledged.Role(role) Role of the person or organization being acknowledged.
Acknowledgement statement(acknowledgement_statement) Overall statement of acknowledgement.Disclaimer(disclaimer) A disclaimer statement that applies to the indicator.
GEOGRAPHIC AND TIME COVERAGE
Time coverage(time_periods) The time period covers the entire span of data available for the indicator. The time period has a start and an end and is reported according to the periodicity provided in a previous element. The dates should be entered in ISO 8601 format (YYYY-MM-DD or YYYY-MM or YYYY).Start(start) The initial date of the indicator in the dataset.End(end) The end date is the latest date for which an estimate for the indicator is available.
Countries(ref_country) A list of countries or economies for which data are available in the indicator. This element is somewhat redundant with the next element (geographic_units) which may also contain a list of countries. Identifying geographic areas of type "country" is important to enable filters and facets in data catalogs (country names are among the most frequent queries submitted to catalogs).Name(name) The name of the country.Code(code) The code of the country. The use of the ISO 3166-1 alpha-3 codes is recommended.
Geographic areas(geographic_units) List of geographic units (regions, countries, states, provinces, etc.) for which data are available for the indicator.Name(name) Name of the geographic unit e.g. "World, "Africa", "Afghanistan", "OECD countries", "Bangkok".Code(code) Code of the geographic unit. The ISO 3166-1 alpha-3 code is preferred when the unit is a country.Type(type) Type of geographic unit e.g. "country", "state", "region", "province", "city", etc.
Bounding box(bbox) This element is used to define one or multiple bounding box(es), which are the rectangular fundamental geometric description of the geographic coverage of the data. A bounding box is defined by west and east longitudes and north and south latitudes, and includes the largest geographic extent of the dataset's geographic coverage. The bounding box provides the geographic coordinates of the top left (north/west) and bottom-right (south/east) corners of a rectangular area. This element can be used in catalogs as the first pass of a coordinate-based search. This element is optional, but if thebound_polyelement (see below) is used, then thebboxelement must be included.West(west) West longitude of the bounding box.East(east) East longitude of the bounding box.South(south) South latitude of the bounding box.North(north) North latitude of the bounding box.
DESCRIPTION
Authoring entity(authoring_entity) This set of five elements is used to identify the organization(s) or person(s) who are the main producers/curators of the indicator. Note that a similar element is provided at the database level. The authoring_entity for the indicator can be different from the authoring_entity of the database. For example, the World Bank is the authoring entity for the World Development Indicators database, which contains indicators obtained from the International Monetary Fund, World Health Organization, and other organizations that are thus the authoring entitis for specific indicators.Name(name) The name of the person or organization who is responsible for the production of the indicator. Write the name in full (use the elementabbreviationto capture the acronym of the organization, if relevant).Affiliation(affiliation) The affiliation of the person or organization mentioned inname.Abbreviation(abbreviation) Abbreviated name (acronym) of the organization mentioned inname.Email(email) The public email contact of the person or organizations mentioned inname. It is good practice to provide a service account email address, not a personal one.URL(uri) A link (URL) to the website of the entity mentioned inname.
Measurement unit(measurement_unit) The unit of measurement. Note that in many databases the measurement unit will also be mentioned in the indicator name. The World Bank's World Development Indicators (WDI) for example, contains indicators named as follows: "CO2 emissions (kg per 2010 US$ of GDP)", "GDP per capita (current US$)", "GDP per capita (current LCU)", or "Population density (people per sq. km of land area)".Dimensions(dimensions) An indicator can be made available with different levels of disaggregation. For example, an indicator containing annual estimates of the "Resident population (mid-year)" can be provided by urban/rural area of residence, by sex, and by age group. Thedimensionelement may be used to document such disaggregations. Note that the element should only be used when theData Structure Definitionof the indicator is not documented (see below the section on structural metadata). Documenting the Data Structured Definition is the preferred way of documenting the dimensions of an indicator.Name(name) The name of the dimension.Label(label) The label of the dimension, for example "sex", or "urban/rural".Description(description) A description of the dimension (for example, if the label is "age group", the description can provide detailed information on the age groups, e.g. "The age groups in the database are 0-14, 15-49, 50-64, and 65+ years old".)
Release calendar(release_calendar) Information on when updates for the indicators can be expected. This will usually not consist of exact dates (which would have to be updated regularly), but of more general information like "Every first Monday of the Month", or "Every year on June 30", or "The last week of each quarter".Periodicity(periodicity) The periodicity of the indicator. It is recommended to use a controlled vocabulary with values like annual, quarterly, monthly, daily, etc.Base period(base_period) The base period for the indicator. This field will only apply to indicators that require a base year (or other reference time) used as a benchmark, like a Consumer Price Index (CPI) which will have a value of 100 for a reference base year.Breaks in series(series_break) Breaks in statistical series occur when there is a change in the standards, sources of data, or reference year used in the compilation of a series. Breaks in series must be well documented. The documentation should include the reason(s) for the break, the time it occured, and information on the impact on comparability of data over time.Keywords(keywords) Words or phrases that describe salient aspects of a data collection's content. Can be used for building keyword indexes and for classification and retrieval purposes. A controlled vocabulary can be employed. Keywords should be selected from a standard thesaurus, preferably an international, multilingual thesaurus.Keyword(name) Keyword (or phrase). Keywords summarize the content or subject matter of the study.Vocabulary(vocabulary) Controlled vocabulary from which the keyword is extracted, if any.URL(uri) The URI of the controlled vocabulary used, if any.
Topics(topics) Thetopicsfield indicates the broad substantive topic(s) that the indicator covers. A topic classification facilitates referencing and searches in electronic survey catalogs. Topics should be selected from a standard controlled vocabulary such as the Council of European Social Science Data Archives (CESSDA) topics classification.ID(id) The unique identifier of the topic. It can be a sequential number, or the ID of the topic in a controlled vocabulary.Label(name) The label of the topic associated with the data.Parent ID(parent_id) When a hierarchical (nested) controlled vocabulary is used, theparent_idfield can be used to indicate a higher-level topic to which this topic belongs.Vocabulary(vocabulary) The name of the controlled vocabulary used, if any.URL(uri) A link to the controlled vocabulary mentioned in fieldvocabulary.
Themes(themes) Themes provide a general idea of the research that might guide the creation and/or demand for the indicator. A theme is broad and is likely also subject to a community based definition or list. A controlled vocabulary should be used. This element will rarely be used (the elementtopicsdescribed below will be used more often).ID(id) The unique identifier of the theme. It can be a sequential number, or the ID of the theme in a controlled vocabulary.Name(name) The label of the theme associated with the data.Parent ID(parent_id) When a hierarchical (nested) controlled vocabulary is used, theparent_idfield can be used to indicate a higher-level theme to which this theme belongs.Vocabulary(vocabulary) The name of the controlled vocabulary used, if any.URL(uri) A link to the controlled vocabulary mentioned in field `vocabulary'.
Disciplines(disciplines) Information on the academic disciplines related to the content of the document. A controlled vocabulary will preferably be used, for example the one provided by the list of academic fields in Wikipedia. This is a block of five elements:ID(id) The ID of the discipline, preferably taken from a controlled vocabulary.Name(name) The name (label) of the discipline, preferably taken from a controlled vocabulary.Parent ID(parent_id) The parent ID of the discipline (ID of the item one level up in the hierarchy), if a hierarchical controlled vocabulary is used.Vocabulary(vocabulary) The name (including version number) of the controlled vocabulary used, if any.URL(uri) The URL to the controlled vocabulary used, if any.
Disaggregation(disaggregation) This element is intended to inform users that an indicator is available at various levels of disaggregation. The related indicator should be listed (by andme and/or identifier). For indicator "Population, total" for example, one may inform the user that the indicator is also available (in other indicators) by sex, urban/rural, and age group (in indicator "Population, male" and "Population, female", etc.).Language(language) The language of the indicator/series metadata. For the language codes and names, the use of the ISO 639-2 standard is recommended.Acronyms(acronyms) Theacronymselement is used to document the meaning of all acronyms used in the metadata of an indicator. If some acronyms are well known (like "GDP", or "IMF" for example), others may be less obvious or could be uncertain (does "PPP" mean "public-private partnership", or "purchasing power parity"?). In any case, providing a list of acronyms with their meaning will help users and make your metadata more discoverable. Note that acronyms should not include country codes used in the documentation of the geographic coverage of the data.Acronym(acronym) An acronym referenced in the indicator metadata (e.g. "GDP").Expansion(expansion) The expansion of the acronym, i.e. the full name or title that it represents (e.g., "Gross Domestic Product").Occurrence(occurrence) This numeric element can be used to indicate the number of times the acronym is mentioned in the metadata. The element will rarely be used.
Related indicators(related_indicators) This element allows to reference indicators that are often associated with the indicator being documented.Code(code) The code for the indicator that is referenced in the document. It will likely be an ID that is used by that indicator.Label(label) The name or label of the indicator that is associated with the indicator being documented.URL(uri) A link to the related indicator.
Indicator group(series_group) The group(s) the indicator belongs to. Groups can be create to organize indicators by theme, producer, or other.Group name(name) The name of the group.Description(description) A brief description of the group.Version(version) The version of the grouping.URL(uri) A link to a public website site where information on the grouping can be obtained.
Notes(notes) This element is open and reserved for explanatory notes deemed useful to the users of the data. Notes should account for additional information that might help: replicate the indicator; access the data and research area, or discoverability in general.note(note) The note itself.
ACCESS AND USE
License(license) The license refers to the accessibility and terms of use associated with the data. Providing a license and a link to the terms of the license allos data users to determine, with full clarity, what they can and cannot do with the data.Name(name) The name of the license, e.g. "Creative Commons Attribution 4.0 International license (CC-BY 4.0)".URL(uri) The URL of a website where the licensed is described in detail, for example "https://creativecommons.org/licenses/by/4.0/".Note(note) Any additional information on the license.
Confidentiality statement(confidentiality) A statement of confidentiality for the indicator.Confidentiality status(confidentiality_status) This indicates a confidentiality status for the indicator. A controlled vocabulary should be used with possible options "public", "official use only", "confidential", "strictly confidential". When all indicators are made publicly available, and belong to a database that has an open or public access policy, this element can be ignored.Confidentiality note(confidentiality_note) This element is reserved for additional notes regarding confidentiality of the data. This could involve references to specific laws and circumstances regarding the use of data.Citation requirement(citation_requirement) This element is used to provide information on how the data should be cited. This can include the preferred citation format, the name of the author, the title of the data, the date of publication, the version of the data, and the URL of the data.Links(links) This element provides links to online resources of any type that could be useful to the data users. This can be links to description of methods and reference documents, analytics tools, visualizations, data sources, or other.Type(type) This element allows to classify the link that is provided.Description(description) A description of the link that is provided.URL(uri) The uri (URL) to the described resource.
Contacts(contacts) Thecontactselement provides the public interface for questions associated with the production of the indicator.Name(name) The name of the contact person that should be contacted. Instead of the name of an individual (which would be subject to change and require frequent update of the metadata), a title can be provided here (e.g. "data helpdesk").Role(role) The specific role of the contact person mentioned inname. This will be used when multiple contacts are listed, and is intended to help users direct their questions and requests to the right contact person.Affiliation(affiliation) The organization or affiliation of the contact person mentioned inname.Email(email) The email address of the person or organization mentioned inname. Avoid using personal email accounts; the use of an anonymous email is recommended (e.g, "helpdesk@....org")Phone(telephone) The phone number of the person or organization mentioned inname.URL(uri) The URI of the agency (typically, a URL to a "Contact us" web page).
API documentation(api_documentation) Increasingly, data are made accessible via Application Programming Interfaces (APIs). The API associated with an indicator must be documented. The documentation will usually not be specific to an indicator, but apply to all indicators in a same database.Description(description) This element will not contain the API documentation itself, but information on what documentation is available.URL(uri) The URL of the API documentation.
Periodicity(periodicity) The periodicity of the series. It is recommended to use a controlled vocabulary with values like annual, quarterly, monthly, daily, ad-hoc, etc.
VERSION STATEMENT
Version(version)Version date(version_date)Version notes(version_notes)Version responsibility(version_resp)
Use of AI to enhance descriptive metadata
You may take advantage of generative AI tools like ChatGPT or equivalent to suggest content for selected fields. For example, generative AI can help:
- Suggest keywords (ask ChatGPT to "Please suggest 20 keywords related to an indicator titled (enter the title of the indicator here) defined as (enter the definition of the indicator here).
- Propose a definition (ask ChatGPT to either Suggest a definition for a statistical indicator titled (...)) or to Improve the following definition of a statistical indicator titled (title) and currently defined as (definition).
- Propose a description of the indicator's relevance (ask ChatGPT to either Please describe the relevance of a statistical indicator titled (...) and defined as (definition). In the description, indicate to whom and for what purposes the indicator is relevant. Do not blindly accept suggestions formulated by AI models. Carefully review and improve (or reject) the proposed content.
Data structure (structural metadata)
The data structure definition (DSD) allows you to describe how the data are organized in the data file. This is the structural metadata for the indicator. The DSD in the Metadata Editor follows the SDMX standard.
In a data file, the indicator is supposed to be stored in long (not wide) format, i.e. in a database type of format where each row corresponds to one observation value. Each row must contain the core information required to define what the observation value represents, complemented by additional (optional) information.
Typically, a data file for an indicator will look like one of the two options below, which use the population by sex of two countries as an example.
Option 1. In option 1, the population by sex is provided as three different indicators: Population, total", "Population, male", and "Population, female". The data file may look like this:

Option 2. In option 2, the information is provided as a single indicator ("Population" with a dimension "sex" with values M, F or T for Male, Female, and Total).

Organizations make their own decisions on how to organize their indicators. Using dimensions has the advantage of reducing the number of indicators. But it makes data discovery (indexing and search in data catalogs) somewhat more complex. In some cases, the use of dimensions is the only practical option. For example, data on population by age group (with 9 age groups + total), sex (2 options + total), and urban/rural (2 options with total) would require 10 x 3 x 3 = 90 different indicators. Maintaining a single indicator "population" with 3 dimensions would be more efficient (one indicator can have multiple dimensions).
Properly documenting the data structure of an indicator provides users (and machines) with the information they need to query and use the data.

A data structure definition consists of providing the following information about the indicator and the way it is organized in the data file. This information will be provided for each column of the data file:
- Name: The column name
- Label: The column label
- Description: A brief description of the column
- Data type: The type of variable, with the following possible values: String, Integer, Float, Date, and Boolean.
- Column type: The type of column, with the following options:
- Dimension: In our Option 2 example, column "sex" is a dimension.
- Time period: The column indicates the time period to which the observation value applies. In our Option 2 example, "year" is a time period.
- Attribute:
- Indicator ID: The column is the indicator unique identifier. Only one column can be an Indicator ID. In our Option 2 example, "IndicatorID" is the Indicator ID.
- Indicator name: The column is the name (or title) of the indicator. In our Option 2 example, "Name" is the indicator name.
- Annotation:
- Geography: The geographic area to which the value corresponds. In our example, column "country" is the geography.
- Observation value: The observation value (the "data" itself). In our example, column "value" is the observation value.
- Periodicity:
- Time period format: Time period format is used to indicate the format of the date in the column identified as Time period. In our Option 2 example, Time period format is "YYYY" as we have data by year.
- Codelist:
- Codelist reference:
All columns in the data file will be documented.

DataCite
See section "Documenting - General instructions".
Provenance
The Provenance container is used to document how and when the dataset was acquired. It is used to ensure traceability. See section "Documenting - General instructions".
Tags
See section "Documenting - General instructions".
External resources
External resources are all materials (and links) that relate to the indicator. This may include documents on methodology, scripts, photos and videos, and any other resource available in digital format. These materials and links are added to the documentation of an indicator in the External resources container. Click on External resources in the navigation tree, then on CREATE RESOURCE. Enter the relevant information on the resource (at least a title), then provide either a filename (the file will then be uploaded on the server that hosts the Metadata Editor) or a URL to the resource.
External resources that have already been created for another project can also be imported. To do that, they must first be exported as JSON or RDF from the other project. The click on IMPORT in the External resources page, and select the file.
External resources will be part of the project ZIP package (when the ZIP package is generated - See the main menu).
Export and publish
The indicator metadata can be saved as a JSON file, or exported as a ZIP package. The metadata can also be published in a NADA catalog using the Publish to NADA option.