Immigrant Mortality before and during the COVID-19 Pandemic in the United States ================ Eugenio Paglino and Irma T. Elo # Immigrant Mortality during the COVID-19 Pandemic We used death records for 2017-2020 from the National Center for Health Statistics (NCHS) under a data user agreement. We classified deaths by sex, age, bridged race, Hispanic origin, nativity, and cause of death. Deaths of US-born residents include deaths of individuals who were born and resided in the 50 US states, and deaths of foreign-born residents include deaths of foreign-born individuals residing in the 50 US states. We obtained US population counts by age, bridged race, Hispanic origin, and sex for 2017-2020 from CDC WONDER. To estimate population by nativity we pooled the 2017-2019 American Community Survey (ACS) 1-year files to estimate the proportion foreign-born by age, sex, race, and Hispanic origin. We then applied these proportions to the population counts to obtain populations by 10-year age groups, sex, race, Hispanic origin, and nativity. Because the pandemic affected the 2020 ACS data collection, we applied the 2019 proportions to the 2020 Census population estimates. Supplementary Tables 1a and 1b present the number of deaths and populations by nativity, race, Hispanic origin, and sex for broad age groups in 2017-2019 and 2020. We included nine exhaustive and mutually exclusive cause-of-death categories based on the underlying cause of death. These are respiratory diseases, circulatory diseases, cancers, Alzheimer disease and other dementias, diabetes, COVID-19, external causes, and all other causes combined (Supplementary Table 2). We chose these causes because mortality from them increased during the pandemic. Using the average 2017-2020 age distribution as the standard, we calculated age-standardized death rates (ASDR) by sex, race, Hispanic-origin, and nativity for all causes combined, COVID-19, and all causes other than COVID-19 at ages 25+, 25-64, and 65+ and by the more detailed causes at ages 25+ for 2017-2019 and 2020. We pooled three pre-pandemic years to adjust for year-to-year fluctuations in death rates. For each ASDR, we computed standard errors (SD) and coefficients of variation (SD/ASDR), which are included in the Supplementary Material. Because the size our smallest group (US-born Asian females in 2020) exceeds one million, all standard errors are small, and the largest coefficient of variation is 3.75%. Based on these data, we examined the contribution to COVID-19 and all other causes of death, other than COVID-19, to the change in all-cause mortality between 2017-2019 and 2020 by sex, nativity, and race/ethnicity at ages 25+, 25-64 and 65+. We further decomposed the cause-of-death contributions by the more detailed cause-of-death categories to US-born and foreign-born difference in ASDRs at ages 25+ by race/ethnicity and sex in 2017-2019 and 2020 and their contributions to the change in these differences between 2017-2019 and 2020. ## Repository Structure In this repository we stored all the codes needed to reproduce the figures and tables in the paper as well as supplementary material. All codes were written with the R language, version 4.2.1. Here is the repository structure. ## /Users/eugeniopaglino/Dropbox/Upenn/GitHub/pandemic-immigrant-mortality ## ├── R ## │ ├── compWithWhitesPlots.Rmd ## │ ├── contributionsPlots.Rmd ## │ ├── createComorbTable.Rmd ## │ ├── createComorbTableComp.Rmd ## │ ├── createFinalTable.Rmd ## │ ├── createMortTable.Rmd ## │ ├── createPopTableCDC.Rmd ## │ ├── createPropTable.Rmd ## │ ├── mortalityDifferentialsPlots.Rmd ## │ ├── mortalityDifferentialsPlotsNoRatios.Rmd ## │ ├── mortalityRatiosPlots.Rmd ## │ ├── relativeComorbTable.Rmd ## │ └── sampleAndSummaryTables.Rmd ## ├── README.Rmd ## ├── README.md ## ├── data ## │ ├── input ## │ │ ├── ACS20171Year ## │ │ │ ├── ACS20171Year.dat ## │ │ │ └── ACS20171Year.xml ## │ │ ├── ACS20181Year ## │ │ │ ├── ACS20181Year.dat ## │ │ │ └── ACS20181Year.xml ## │ │ ├── ACS20191Year ## │ │ │ ├── ACS20191Year.dat ## │ │ │ └── ACS20191Year.xml ## │ │ ├── ACSDescStats ## │ │ │ ├── ACS17to19.dat ## │ │ │ └── ACS17to19.xml ## │ │ ├── CDCPopEstimates ## │ │ │ ├── CDCPopEstimates2017-2020.txt ## │ │ │ └── CDCPopEstimates2021.txt ## │ │ ├── CDCPopEstimatesSingleRace ## │ │ │ ├── SingleRacePop2017-2020.txt ## │ │ │ └── SingleRacePop2021.txt ## │ │ ├── ICD10CMCodes ## │ │ │ ├── ICD10CMCodesAll.txt ## │ │ │ └── ICD10CMCodesDetailed.txt ## │ │ ├── mort2017 ## │ │ │ ├── 2014 Mortality History File Contents.doc ## │ │ │ ├── MULT2017.USPSAllCnty.zip ## │ │ │ ├── MULT2017.USPSAllCnty.zip.exe ## │ │ │ ├── MULT2017USAllCnty.txt ## │ │ │ └── Mort2005ToPresentAllCntyLayout_NAPHSIS.doc ## │ │ ├── mort2018 ## │ │ │ ├── MULT2018.USPSAllCnty.zip.exe ## │ │ │ ├── MULT2018USAllCnty.txt ## │ │ │ ├── Mort2005ToPresentAllCntyLayout_NAPHSIS.doc ## │ │ │ └── Mort2018PS.AllCnty.txt ## │ │ ├── mort2019 ## │ │ │ ├── MULT2019.USPSAllCnty.zip.exe ## │ │ │ ├── MULT2019PS.AllCnty.txt ## │ │ │ ├── MULT2019USAllCnty.txt ## │ │ │ └── Mort2005ToPresentAllCntyLayout_NAPHSIS.doc ## │ │ └── mort2020 ## │ │ ├── MULT2020.PSAllCnty.txt ## │ │ ├── MULT2020.USPSAllCnty.zip ## │ │ ├── MULT2020.USPSAllCnty.zip.exe ## │ │ ├── MULT2020USAllCnty.txt ## │ │ └── Mort_2005ToPresentAllCntyLayout_NAPHSIS.doc ## │ └── output ## │ ├── ageStdRates.csv ## │ ├── ageStdRates.xlsx ## │ ├── ageStdRatesAbove65.csv ## │ ├── ageStdRatesAbove65.xlsx ## │ ├── ageStdRatesBelow65.csv ## │ ├── ageStdRatesBelow65.xlsx ## │ ├── ageStdRatesForPrint.xlsx ## │ ├── ageStdRatesSens.csv ## │ ├── ageStdRatesSens.xlsx ## │ ├── comorbTableUCD.csv ## │ ├── comorbTableUCDComp.csv ## │ ├── exampleData.csv ## │ ├── finalTable.csv ## │ ├── finalTable.feather ## │ ├── mortTable.feather ## │ ├── popTable.feather ## │ ├── proportionsData.feather ## │ ├── relativeComorbTable.csv ## │ ├── standardPop.csv ## │ ├── statsForAbstract.xlsx ## │ ├── summaryTableDataFemales.csv ## │ └── summaryTableDataMales.csv ## ├── figures ## │ ├── ICDCodesTable.docx ## │ ├── USBornForeignDiffFemalesColorSafePlot.svg ## │ ├── USBornForeignDiffFemalesColorSafePlotSens.svg ## │ ├── USBornForeignDiffMalesColorSafePlot.svg ## │ ├── USBornForeignDiffMalesColorSafePlotSens.svg ## │ ├── absDiffTrendsByAgeHBarPlotRR.svg ## │ ├── absDiffTrendsSimplePlot.svg ## │ ├── absDiffTrendsSimplePlotSens.svg ## │ ├── ageStdRatesForPrint.pdf ## │ ├── contributionsTableFemales.docx ## │ ├── contributionsTableMales.docx ## │ ├── diffTrendsPlot.svg ## │ ├── diffTrendsPlotSens.svg ## │ ├── diffTrendsTable.docx ## │ ├── popTable.png ## │ ├── ratioTrendsPlot.svg ## │ ├── ratioTrendsPlot2.svg ## │ ├── ratioTrendsPlotSens.svg ## │ ├── ratioTrendsTable.png ## │ ├── relDiffTrendsHBarPlot.svg ## │ ├── relDiffTrendsHBarPlotRR.svg ## │ ├── relDiffTrendsHPlot.svg ## │ ├── relDiffTrendsSimpleHPlot.svg ## │ ├── relDiffTrendsSimpleHPlotSens.svg ## │ ├── sampleTable2Females.docx ## │ ├── sampleTable2Males.docx ## │ ├── sampleTableFemales.docx ## │ ├── sampleTableMales.docx ## │ ├── summaryTableFemales.docx ## │ └── summaryTableMales.docx ## └── paper ## └── manustricp_DR_SM.docx The `data/output` folder contains all the datasets produced in the project. The most important ones are the `ageStdRates*.csv` files that store the age standardized crude death rates by sex, race/ethnicity, nativity, cause of death, and period together with uncertainty measures for each rate. The structure of these files is described below. The remaining files are produced in the project as intermediate steps. The `figures` folder contains all figures and tables of the project. Finally, the `R` folder contains all the code used to clean and analyse the data. This repository does not contain the individual level death records because they are not publicly available. The public version of these data, in which information on place of birth has been masked, can be obtained from the [Vital Statistics Online Data Portal](https://www.cdc.gov/nchs/data_access/vitalstatsonline.htm#Mortality_Multiple). They have the same structure as the data we used. ## Content of Main Data Files Refer to the formatted readme file in the GitHub repository for the project at https://github.com/eugeniopaglino/pandemic-immigrant-mortality. ## Description of R Code Below we briefly describe each of the R files. Ideally, they would be run in the order in whcih they are listed below. - `createMortTable.Rmd`: Reads and cleans the death records creating death counts. - `createPopTableCDC.Rmd`: Reads and cleans the population data from CDC. - `createPropTable.Rmd`: Reads and cleans the ACS data to compute proportion foreign-born by sex, race, Hispanic origin, age, and year to be applied to the CDC population estimates. - `createFinalTable.Rmd`: Combines the death and population counts in a unique cleaned file. - `sampleAndSummaryTables.Rmd`: Computes age-specific and age-standardized rates and creates the sample and summary tables (Supplementary Tables 1a, 1b, 3a, and 3b). - `mortalityDifferentialsPlotsNoRatios.Rmd`: Creates Figure 1 and Figure 2. - `compWithWhitesPlots.Rmd`: Creates Figure 3 and the corresponding Supplementary Table 4. - `contributionsPlots.Rmd`: Creates Figure 4a and 4b and the corresponding Supplementary Tables 5a and 5b. As a note to help readers navigate the scripts, we used a functional approach, defining the functions to create each figure and table at the start of the file and then running the functions and saving the output below.