Princeton University Library
Pliny Fisk Library of Economics & Finance


HEALTH DATA




1. Electronic Data (Highlights)

Some additional sources can be found on the Data and Statistical Services Health Page.

OECD Health Data
This dataset examines national health systems from 1960 forward for OECD member countries in a general, demographic, economic, and social context.

World Health Organization Statistical Information System (WHOSIS)
Guide to health and health-related epidemiological and statistical information available from the World Health Organization. Most WHO technical programmes make statistical information available, and they will be linked from here.

AIDS Public Data Set
This data set contains counts of AIDS (Acquired Immune Deficiency Syndrome) cases reported to CDC by state and local health departments from 1981+. Case counts can be retrieved by demographics, case-definition, date of diagnosis, date of report, HIV exposure group (risk factors), and mortality.

Annual Survey of Hospitals
Producer: American Hospital Association
Years covered: 1975-2006
Scope: surveys for-profit and not-for-profit hospitals on utilization and cost related matters
Sample Size and Makeup: profit and not-for-profit hospitals
How segmented: by state, type of authority, type of service, bed size, metropolitan statistical area size
Where is the documentation? (DSS) RA981.A2.A32
Where do I get the data? 1975-1995, 1996-2006 is available on CD under DSS Study # 457. Ask at the SSRC Desk.
Sample Questions:
How many doctors, nurses, etc. does each hospital have? Which hospitals provide hospice care? Cardiac facilities? Does your hospital provide an HMO? PPO? Indemnity Plan? What were the total Medicare and Medicaid discharges? What were the employee benefits?

Public Patient Discharge Data
Producer:
California. Office of Statewide Health Planning and Development
Former Titles: California Patient Discharge Data (1983-2000); Also called California Hospital Discharge.
Years Covered: Annual data holdings begin in 1983.
Sample Size and Makeup: each inpatient discharged from California acute care hospitals which includes General Acute Care Hospitals, Acute Psychiatric Hospitals, Chemical Dependency Recovery Hospitals, Psychiatric Health Facilities, and State-operated hospitals.
How segmented: hospital facility number; age, race, sex, and 5-digit ZIP code; length of stay, day of the week on which patient was admitted along with the quarter of the year and the year admitted; source of admission; type of admission; principal diagnosis and up to 4 other diagnoses; principal procedure along with a few other procedures; disposition of the patient; expected principal source of payment; days from admission to each procedure; Diagnosis Related Group (DRG); Major Diagnostic Category (MDC); total charges; principal external cause of injury and up to 4 other external causes of injury; and the patient's county of residence.
Where is the documentation: (DSS) RA981.C3 C35 and RA981.C3 P824
Where do I get the data? DSS Study #2947 is restricted and stored on a secure server. Access will be granted after authorization. Please sign the form and give to the Data Librarian.
Questions: How do people pay for health care? Which diagnoses and treatments are linked to which illnesses? What is the average length of stay for a visit?

Healthcare Cost and Utilization Project
Producer:
United States. Agency for Health Care Policy and Research
Years covered: 1988+
Sample Size and Makeup: uses a stratified probability sample of community hospitals, with sampling probabilities proportional to the number of U.S. community hospitals in each stratum. Sampling weights were used to obtain national estimates for discharges, average length of stay, and average total charges. Includes American Hospital Association codes.
How segmented: region, location, teaching status, ownership/control, and bed size
Where is the documentation: (DSS) RA981.A2 H38
Where do I get the data? See the Main Catalog for study numbers and access

ICPSR Health and Medical Care Archive
The HMCA preserves and disseminates health care data collected by researchers. Subjects covered include health care providers, cost/access to health care, substance abuse and health, chronic health conditions, and others.

Medical Expenditure Panel Survey
The Medical Expenditure Panel Survey, or MEPS as it is commonly called, is the third (and most recent) in a series of national probability surveys conducted by AHRQ on the financing and utilization of medical care in the United States. MEPS is the most recent in a series of medical expenditure surveys that began in 1977 as the National Medical Care Expenditure Survey and later became the National Medical Expenditure Survey (NMES). The last study in this series was conducted in 1987. This new survey provides critically needed updates to the 1987 data.

continues:

National Medical Expenditure Survey (NMES)
Years covered:
1977, 1980, 1987.
Scope: provides information on health expenditures by or on behalf of families and individuals, the financing of these expenditures, and each person's use of services.
Sample Size and Makeup: (1) Households: national probability sample of the noninstitutionalized USA civilian population (2) Institutional Population: sample of nursing and personal care homes and facilities for the mentally retarded and residents admitted to those facilities. (3) American Indians and Alaskan Natives living on or near federal reservations.
How segmented: Age, Marital status, student status, veteran status, race, ethnicity
History: 1977: National Medical Care Expenditure Survey, 1980: National Medical Care and Utilization Survey, 1987: National Medical Expenditure Survey. Updated by the Medical Expenditure Panel Survey (see earlier in guide)
Where is the documentation: (DSS) RA410.53.N38
Where do I get the data? See ICPSR Direct
Questions: What is the average length of wait to see a doctor in a given specialty? Do physicians follow their own advice about smoking? Does a person's income determine how often they get a mammogram? How often do doctors only recommend necessary surgery? What is the cost of X procedure?

Medicare current beneficiary survey : cost and use, 1994-2003
Link Medicare claims to survey-reported events and provides complete expenditure and source of payment data on all health care services, including those not covered by Medicare. Expenditure data were developed through a reconciliation process that combines information from survey respondents and Medicare administrative files. Files are highly restricted. To apply for use, go to http://www.cms.hhs.gov/apps/mcbs/FileAval.asp .

National Center for Health Statistics
The National Center for Health Statistics (NCHS) is a part of the Centers for Disease Control and Prevention (CDC), U.S. Department of Health and Human Services. The mission of the NCHS is to provide statistical information that will guide actions and policies to improve the health of the American people. NCHS data systems include data on vital events as well as information on health status, lifestyle and exposure to unhealthy influences, the onset and diagnosis of illness and disability, and the use of health care. These data are used by policy makers in Congress and the Administration, by medical researchers, and by others in the health community. Princeton participates in the NCHS Public-Use data program. Below is a sampling of some of the more commonly used NCHS datasets.

National Ambulatory Medical Care Survey Series
Years covered:
1973-1981, 1985, 1989+.
Scope: contains data on medical care provided in physician's offices; Obtains data on the number of office visits and by selected physician characteristics. Data describing the nature of office visits include the patient's problem, prior visit status, referral status, major reason for the visit, physician's diagnosis, diagnostic and therapeutic services provided, and duration of the visit.
Sample Size and Makeup: continuously sampled survey based on a nationwide multistage probability sample of patient records. The physicians from which records are obtained are nonfederally-employed and are primarily involved in office-based patient care, but not engaged in the specialties of radiology, pathology, or anesthesiology.
How segmented: patient (age, race, sex of patient); physician (geographic location, type of practice, specialization)
Where is the documentation: (DSS) RA407.3.N37
Where do I get the data? See ICPSR Direct ICPSR is usually a year or two behind. Latest available on CD-ROM in Documents Division in SSRC.
Questions: How do patients pay? Does the patient smoke? Does drug or alcohol abuse impact the # of doctor's visits? Does race, sex, or age impact the # of doctor's visits? Which demographic groups tend to use X prescription? Which are the most popular specialties? What % of the population is most likely to see a physician for X illness?

National Hospital Ambulatory Medical Care Survey Series
Years covered:
1992+.
Scope: contains data on visits to hospital outpatient departments and emergency departments. Obtains data on demographics, triage, complaints, diagnosis, services, medications and immunizations provided, and waiting time.
Sample Size and Makeup:Visits to the emergency and outpatient departments of noninstitutional general and short-stay hospitals within the 50 states and the District of Columbia, which had an average length of stay of less than 30 days, or to hospitals whose specialty was general (medical or surgical) or children's general. Excluded were federal hospitals, hospital units within institutions, and hospitals with less than six beds staffed for patient use.
How segmented: patient (age, race, sex of patient); geographic location of facility.
Where do I get the data and documentation? See ICPSR Direct

National Health Interview Survey Series
Years covered:
1975- . For 1970-1975, see Health Interview Surveys.
Scope: basic purpose is to obtain information about the amount and distribution of illness, its effects in terms of disability and chronic impairments, and the kinds of health services people receive. Information on the utilization of medical care facilities is also available in the form of data on medical and dental care, hospitalization, preventive care, nursing care, prosthetic appliances, and self-care. The Core variables are contained in the files for household, person, condition, doctor visit, and hospital data. Each year additional batteries of questions are asked which focus on specific topics.
Sample Size and Makeup: representative sample of the civilian, noninstutionalized population of the USA
How segmented: type of living quarters, size of family, geographic region, age, sex, race, marital status, veteran status, education, income, industry, occupation codes, and limits on activity.
Where is the documentation: (DSS) RA407.3.N38
Where do I get the data? See ICPSR Direct. Also see the Integrated Health Interview Series which integrates 1969-2005.
Questions: Questions: Does income status impact health service received? Does the patient have Medicare, Medicaid, private health insurance? Does race impact prevalence of certain illnesses? Do certain areas of the country have higher incidences of certain diseases? What variables (age, race, sex, education, etc) impact one's health? Does your occupation impact your health?

National Hospital Discharge Survey
Years covered:
1964- .
Scope: part of a continuing sample of hospital discharge records that supplies medical and demographic information used to calculate statistics on hospital utilization. The data collection consists of data abstracted from the face sheets of the medical records for sampled inpatients discharged from a national sample of nonfederal short-stay hospitals. The variables include information on the patient's demographic characteristics (sex, age, date of birth, race, marital status), dates of admission and discharge, status at discharge, diagnoses, procedures performed, source of payment, and hospital characteristics, such as bedsize, ownership, and region of the country.
Sample Size and Makeup: national probability sample of visits to the emergency and outpatient departments of noninstitutional general and short-stay hospitals. USA hospitals that had an average length of stay of less than 30 days, or hospitals whose specialty was general (medical or surgical) or children's general were eligible. Excluded were federal hospitals, hospital units within institutions, and hospitals with less than 6 beds staffed for patient use.
How segmented: See scope note
Where is the documentation: (DSS) RA407.3.N374
Where do I get the data? See ICPSR Direct
Questions: Questions: How does one pay for hospital visits? Does payment coverage differ by age, sex, region? How long does one stay in the hospital? Does treatment differ by type of hospital?

For additional data sets, see the DSS Health Guide.

2. Additional Sources of Health Statistics

Secondary Data Sources for Public Health . Cambridge University Press, 2007.
(SSRC) RA409.B66 2007
Excellent place for overview of major datasets related to health.

Drug Topics Red Book. Thomson PDR, 2002+.
(SXF) HD9666.1.D75
Standard source for prescription drug prices.

Global Health Statistics: A Compendium of Incidence, Prevalence, and Mortality Estimates for Over 200 Conditions. World Health Organization, 1996.
(SSRC Ready Reference) RA407.M87 1996

International Health Data Reference Guide. Centers for Disease Control and Prevention, 2001.
(SSRC) RA407.A58 2001

World Health Statistics. (Various titles) World Health Organization, 1939+
Includes health and demographic data and causes of death, life tables, and mortality trends. Note in some years the data was split into various titles and there is more than one call number. Available online for 1997+.
(Firestone) HB1321.W89q for 1939-1946
(Firestone) HB1321 .W89 for 1947-1961
(Firestone and Annex B) HB1321 .W89 for 1962-1978
(Firestone and Stokes) RA960 .W67q for 1980
(Firestone and Stokes) RA643 .W67q for 1979-1981
(Firestone) RA407.A1 W67q for 1979-1982
(SSRC) RA651.A85q for 1983-1996
(UN) RA651.W646q A85q for 2005+

World Health Report . (Various titles) World Health Organization, 1939+
combines an expert assessment of global health, including statistics relating to all countries, with a focus on a specific subject. Available online for 1995+.
(SSRC) RA8 .A27

Health in the Americas. Pan American Health Organization , 1977+
Published irregularly. Volume I describes the health situation from a regional perspective and contains a series of chapters that identify and analyse the most important general, demographic, and mortality trends of the period; major determinants of health as seen from a variety of political, social, economic, and financial standpoints; principal trends and characteristics of the ongoing health sector reform movement; the role of individuals and communities in promoting health; the association between the physical environment and human living conditions; principal diseases and health problems; the extent of human and technological resources available to address health issues today; and the status and achievements of international and bilateral cooperation within the context of the growing tendency toward trade globalization. Volume II consists of a breakdown of these same topics by country and covers the 45 nations and territories of the Americas. Also from PAHO is the Regional Core Health Data Initiative includes 108 essential health indicators to quantitatively characterize the health situation of  48 countries and territories of the Region of the Americas for the period 1995+. With this system, you can construct tables by any combination of indicator(s)/country(ies)/year(s). Also see their electronic books collection.

Health and healthcare in the United States: county and metro area data. NationsHealth Corporation, 2001.
(SXF Ready Reference) RA407.3.H424 2001

Health United States. U.S. Dept. of Health and Human Services, 1975- .
Presents national trends in United States health status.
RA407.3.U57a (Current volume on SSRC Reference)

Hospital Statistics. American Medical Association, 1972-1983,1985-1986,1990- .
Summary Book of the data compiled from the American Hospital Association annual surveys.
RA981.A2A6234 (Current volume on SSRC Ready Reference)

Source Book of Health Insurance Data. Health Insurance Association of America, 1959-2002. (Ceased publication)
Collection of statistical data on health insurance and medical care in the United States. Includes health insurance coverage, managed care, the insurance market, medical care costs, utilization, medical care providers, and morbidity and mortality trends.
HG9396 .S72 (2002 volume in SXF)

Physician Socioeconomic Statistics. American Medical Association, 1999- .
Data on physician's weeks and hours of practice; utilization of physician services; hospital utilization by physicians; fees; professional expenses; and income. Continues Socioeconomic Characteristics of Medical Practice (1983-1998) R729.S62 and Profile of Medical Practice (1971-1977, 1980) RA410.7.R44
R729.P487 (Current volume on SSRC Ready Reference)

Physician Characteristics and Distribution in the US. American Medical Association, 1974- . (various titles)
Data on physician's activities, specialties, school of graduation, and demographics.
RA410.7.D47 (Current volume on SSRC Ready Reference)

Portrait of Health in the United States. Bernan, 2001.
Charts the health status of the United States presenting data on health correlates, conditions, care, and consequences. Also includes a guide to other resources.
(SSRC Ready Reference) RA407.3.P67

Reforming the Health Care System: State Profiles
. American Association of Retired Persons, 1990- .
State data on issues of concern to the elderly.
RA407.3.R43 (Older issues)

American Health: Demographics of Spending of Health Care Consumers. New Strategist, 2007.
Health care trends for addiction, aging, alternative medicine, attitudes towards health care, births, coverage and costs,deaths, disability, diseases and conditions, health care visits, hospital care, mental health, sexual attitudes and behavior, and weight and exercise..
(SXF) RA445.A442 20074

State-Level Databook on Health Care Access and Financing. Urban Institute, 1998.
Provides state level data on health insurance coverage; the uninsured; indices of health status; and health care costs, access and utilization.
RA410.53.L67 1998

Global Insight's Health Care and Pharmaceutical Library
Market research on the health care and pharmaceutical industries.

3. Health-Related WEB Sites

WEB Sites with health related data and information can be found at http://firestone.princeton.edu/econlib/health.html


Bobray Bordelon
Last updated June 6, 2008