Survey of Earned Doctorates

Survey Overview (2018 survey cycle)

Purpose. The Survey of Earned Doctorates (SED) collects data on the number and characteristics of individuals receiving research doctoral degrees from U.S. academic institutions.

Data collection authority. The information collected by the SED is solicited under the authority of the National Science Foundation Act of 1950, as amended, and the America COMPETES Reauthorization Act of 2010. The Office of Management and Budget control number is 3145-0019, expiration date 31 May 2020.

Survey contractor. RTI International.

Major changes to the recent survey cycle. None.

Key Survey Information

Frequency. Annual.

Initial survey year. Academic year 1957–58.

Reference period. Academic year 2017–18 (1 July 2017 to 30 June 2018).

Response unit. Individuals.

Sample or census. Census.

Population size. 55,195.

Sample size. Not applicable.

Key variables. Key variables of interest are listed below.

  • Academic institution of doctorate
  • Baccalaureate-origin institution (U.S. and foreign)
  • Birth year
  • Citizenship status at graduation
  • Country of birth and citizenship
  • Disability status
  • Educational attainment of parents
  • Educational history in college
  • Field of each degree earned (N = 334 fields)
  • Graduate and undergraduate educational debt
  • Marital status, as well as the number and age of dependents
  • Postgraduation plans (e.g., work, postdoc, other study or training)
    • Primary and secondary work activities
    • Source and type of financial support for postdoctoral study or research
    • Type and location of employer
    • Basic annual salary
  • Race and ethnicity
  • Sex
  • Sources of financial support during graduate school
  • Type of academic institution (e.g., historically black colleges and universities, Carnegie codes, public or private) awarding the doctorate

Survey Design

Target population. The population for the 2018 SED consists of all individuals receiving a research doctorate from a U.S. academic institution in the 12-month period beginning 1 July 2017 and ending 30 June 2018. A research doctorate is a doctoral degree that (1) requires completion of an original intellectual contribution in the form of a dissertation or an equivalent culminating project (e.g., musical composition) and (2) is not primarily intended as a degree for the practice of a profession. The most common research doctorate degree is the PhD. Recipients of professional doctoral degrees, such as MD, DDS, DVM, JD, DPharm, DMin, and PsyD, are not included in the SED.

Sampling frame. The total universe in 2018 included 55,195 persons from 431 institutions that conferred research doctorates.

Sample design. The SED is a census.

Data Collection and Processing

Data collection. Three modes of data collection are used in the SED: self-administered Web survey, self-administered paper questionnaire, and computer-assisted telephone interviewing (CATI). The self-administered Web survey is the primary mode of SED completion. The proportion of SED completions using the Web has increased each year since it was introduced in 2001, and it reached 95.5% in 2018.

For a small number of institutions, the institutional coordinator distributes the paper questionnaires to students receiving research doctorates. The institutional coordinators then collect the completed questionnaires and return them to the survey contractor for editing and data entry.

Both the Web survey and paper questionnaire are used in follow-up contacts via e-mail and mail to nonrespondents. If the series of follow-up emails and mailings is unsuccessful, the survey contractor attempts to reach nonrespondents to complete an abbreviated survey by CATI. Approximately 2% of SED completions each year are from CATI. At the end of data collection phase, institutional coordinators are contacted to obtain information on a small number of critical SED data items for nonrespondents from their institution.

Data processing. The data collected in the SED are subject to automated editing and coding procedures. Imputation is not performed for missing survey data items except for the month value used in calculation of the age at doctorate and time to degree variables.

Estimation techniques. The survey is a census, which does not require any sampling; weighting is not used to adjust for nonresponse.

Survey Quality Measures

Sampling error. Not applicable because the SED is a census.

Coverage error. Due to the availability of comprehensive lists of doctorate-granting institutions and the institutions’ high levels of participation in the survey, coverage error of institutions is minimal. Because the graduate schools collect the survey data from degree recipients at the time of doctorate completion, coverage error for the universe of doctorate recipients is also minimal. Comparisons of the institutions and the number of research doctorate recipients covered by the SED with the total number of doctorate recipients (including nonresearch doctorate degree recipients) reported by institutions to the National Center for Education Statistics confirm that there is minimal coverage error of doctorate recipients. In 2018, a small number of doctorate-granting institutions declined to fully enumerate their academic year 2018 doctorate recipients, resulting in a small percentage (approximately 0.1%) of undercoverage in the universe.

Nonresponse error.

Unit nonresponse. Of the 55,195 individuals granted a research doctorate in 2018, 92.1% completed the SED. Limited number of SED critical data items (doctoral institution, year of doctorate, field of doctorate, type of doctorate, and, if available, baccalaureate institution, master’s degree institution, and sex) are constructed for nonrespondents from commencement programs, graduation lists, and other similar public records. Nonresponse was concentrated in certain institutions with 43 institutions accounting for 70% of the total unit nonresponse.

Item nonresponse. Among the 55,195 individuals who received a research doctorate in 2018, item nonresponse rates for the five key SED demographic variables—sex, citizenship, country of citizenship, race and ethnicity, and location after graduation—range from 0.1% for sex to 7.0% for location after graduation.

Measurement error. Measurement error in the SED is attributable to several sources including errors in respondent reporting and errors that occur during data processing. Data reported by respondents about their educational history, including degree institutions and field of study, that are not coded within the survey instrument are reviewed and coded by trained coders. Average coding error rates were 0.12% for institution coding, 0.10% for fields of study coding, and 0.06% for “Other–specify” back coding.

Data Comparability

Data availability. Each year’s survey data are compiled into the Doctorate Records File (DRF) and trend data are available back to 1957–58; more limited information (sex, institution, field, and year of doctorate) is contained on the DRF for PhDs who graduated in 1920–56.

Data comparability. Because of procedural changes implemented during the 1990 survey cycle to improve the completeness of race, ethnicity, and citizenship data, the data from 1990 and later years are not directly comparable to data before 1990.

Data Products

Publications. The data from this survey are published annually in a series reporting on all fields of study, the latest edition of which is Doctorate Recipients from U.S. Universities: 2018 (https://www.nsf.gov/statistics/sed/).

Additional data from this survey for earlier years are published in Science and Engineering Doctorates: 1960–91 (NSF 93-301). Also available is the interagency report U.S. Doctorates in the 20th Century, which provides an overview of the development of a national resource—the American system of doctoral education—from 1900 to 1999.

Information from the survey is also included in the series Science and Engineering Degrees, in Science and Engineering Degrees, by Race/Ethnicity of Recipients, in Science and Engineering Indicators, and in Women, Minorities, and Persons with Disabilities in Science and Engineering.

Electronic access. Access to tabular data on selected variables from 1958 onward is now available in an interactive data tool from the National Center for Science and Engineering Statistics (NCSES). Users can create custom tables of the number of doctorate recipients by academic discipline and institutional characteristics of either the doctorate-granting institution or the baccalaureate-granting institution of doctorate recipients.

NCSES created the SED Tabulation Engine in 2007 to strengthen the confidentiality protections applied to SED data while still meeting the needs of SED data users. This tool provides users with limited access to race and ethnicity, sex, and citizenship data from 2007 onward in a way that does not disclose personally identifiable information. Content from the SED Tabulation Engine will eventually transition to the NCSES interactive data tool.

Restricted access. Access to restricted data for researchers interested in analyzing microdata can be arranged through a licensing agreement. For more information, see the NCSES Licensing Page (https://www.nsf.gov/statistics/license/).

Contact Information

For additional information about this survey, or the methodology report, please contact:

Kelly H. Kang
SED Project Officer
Human Resources Statistics Program
National Center for Science and Engineering Statistics
National Science Foundation
2415 Eisenhower Avenue, Suite W14200
Alexandria, VA 22314
Phone: (703) 292-7796
E-mail: kkang@nsf.gov