You are here

Data

Research Opportunities
The Research Data Centers provide restricted access to non-public Census Bureau data in a secure research environment to qualified researchers for statistical purposes. Projects that can be completed with public use data are not appropriate for the RDCs. In addition, the RDCs are not appropriate for research projects whose output consists primarily of tabulations of data.

A wide range of data collected by the US Census Bureau are potentially available for research projects at the MCRDC. For a list of datasets currently available for use at the RDCs, please see the list provided by the Center for Economic Studies.

Census Data

Economic Data: Firms and Establishments
Economic data refer to the Economic Census of establishments and various surveys and data for establishments and firms. With few exceptions, public use versions of these files are limited to data presented in aggregate form. Click here for a full list of Economic datasets available, including information on the frequency of collection, the level of enumeration, and the years currently available in the RDCs.

Demographic Data: Households and Individuals
Demographic data refer to the Decennial Census and other surveys of individuals and households administered by the Census Bureau. Compared to their public-use counterparts, the non-public files include more detailed geographic information, generally to the block level for the Decennial Census and census tract level for surveys, as well as less restrictive top-coding. The non-public versions of surveys also contain all the individuals surveyed, rather than subsamples published in the public use microdata sets. PLEASE NOTE: individual identifiers such as name, address, and social security numbers are NOT included. In many cases, the additional information in these files allows researchers to perform innovative research. A full list of available files is found here.

Because of the availability of detailed tabulations and public use microdata sets of many of these censuses and surveys, it is particularly important for prospective researchers to make sure they cannot accomplish their research project using these public-use data.

Note: Use of these data files may result in significant disclosure risks. This is especially true for studies of small populations (even with the increased sample sizes that may be available), and even more if the project studies small populations classified by geography and by population characteristics such as age, race, or sex. Moreover, the addition of contextual data also may increase disclosure risks. Researchers should keep these risks in mind in writing their proposals. To reduce the disclosure risks, proposed research projects should emphasize models, not tabulations.

LEHD Program
The Longitudinal Employer-Household Dynamics (LEHD) project provides snapshots of several of the LEHD infrastructure data files to qualified researchers with approved projects in the RDCs. Information on these data can be found here. Because the LEHD is a joint project between the US Census Bureau and the US States, RDC projects requesting LEHD data require state review in addition to Census review. There are public-use data products available from the LEHD project, with more information available here.

Combining Economic and Demographic Data
Projects at the RDCs have combined economic and demographic data or matched demographic data from different surveys and censuses based on geographic identifiers.

Combining Census Bureau Data with Non-Census Bureau Data
Researchers with outside data such as administrative records may seek to enrich the information available to them by linking their data with Census Bureau data files. The MCRDC supports this kind of data development and innovation. However, such projects are subject to additional scrutiny and the review process will require more time because it is necessary to assess carefully possible disclosure risks, to obtain any permissions required to use the outside data and link the data sets, and to assess the costs and feasibility of data set construction.

Health Data
The US Census Bureau partners with two federal health agencies to make restricted data from those agencies available to qualified researchers through the Research Data Center (RDC) network. In all cases, researchers will still need to obtain Special Sworn Status in order to use these data at one of the Census RDCs.

Agency for Healthcare Research and Quality
Please see the AHRQ website for information on the non-public data available in the RDCs.

National Center for Health Statistics
Please see the NCHS website for information on the non-public data available in the RDCs.

Available Data from the Federal Statistical Research Data Center