Skip to the content.

Overview

In our commitment to enhancing reproducibility, it is essential to address the inherent challenges posed by manual steps in Excel-based packages. We aim to bridge this gap through automation and comprehensive documentation. The following guidelines are designed to structure Excel files to improve transparency, minimize manual intervention, and streamline the verification process for replicators.

In our experience, we’ve encountered three common cases when it comes to packages that use Excel:

Setup

Ensure that the package contains only the materials used in the outputs to maintain clarity and conciseness. Exclude any irrelevant data or files.

Contents of the package:

  1. README
  2. Manuscript
  3. Data
    • Raw, unaltered data from primary or secondary sources.
  4. Outputs
    • Excel workbooks with input data, calculations applied, and outputs

The reproducibility package should not include:

Contents of the README:

1. Overview

2. Data Availability Statement

3. List of Excel Workbooks/Sheets

Setting up Excel: Best Practices to Follow

1. Preserve Original Data:

2. Organize Sheets Logically:

3. Optimize Data Management and References:

4. Ensure All Changes are Traceable

What will be published?

If the raw or Intermediate data can be republished:

If the raw or Intermediate data cannot be republished:

Using Excel as a Secondary Software

If Excel is used alongside statistical software like Stata or R (e.g., for creating figures or tables in Excel after processing data in the software), follow these guidelines:

1. Automate as much as possible in the statistical software. The only tasks in Excel should be formatting or creating figures/tables with data exported from the software.

2. Use formulas for any required calculations—avoid manual calculations to minimize errors and ensure transparency.

3. Document any additional steps clearly so that replicators can easily follow the process.

Published packages that use Excel

1. Reproducibility package for The 2022 global food price shock in Chile and Colombia

2. Reproducibility package for Ensuring an equal start for all Pakistani children: What will it cost?

3. Reproducibility package for Estimating Value Added Tax (VAT) and Corporate Income Tax (CIT) Gaps in Indonesia