Visualizing Lab and Phenotype Associations Using PheWAS and Electronic Health Records 1 Emerson ,

2 Goldman ,

3 Kolli

Brenda Miriam Sahiti University of Michigan Big Data Summer Institute 2017 1. Bowling Green State University, 2. Arizona State University, 3. University of Illinois

6. Results

1. Background and Objective

4. Methods - Significance

Digitization of patient records is becoming much more common, 86.9% of doctors use electronic health records (EHR). Records include information on diagnoses, lab values, and treatment history. Electronic health records provide longitudinal data over a large sample size and are relatively easy to obtain. Data often has entry errors, missing information on care outside of the system, and incomplete consent.

• A large number of association tests requires an FDR (False Discovery rate) cut off of .0001 to choose a p-value cut off

Objective: • Look for novel phenotype associations with mean lab values • For a large proportion of labs, not all patients were tested so we wanted to test whether the presence of a lab has associations with a phenotype • Creating an interactive application to visual the associations between labs and phenotype

Acknowledgments Philip S. Boonstra, Ph.D. Research Assistant Professor; Zhenke Wu, Ph.D., Assistant Professor; Matt Zawistowski, Ph.D., Research Area Specialist; Xutong Zhao, Graduate Student Instructor; University of Michigan Big Data Summer Institute; Michigan Genomics Initiative

• This provided a p-value of 5e-05 for the mean test and 1.63e-05 for the presence test. This result was verified by running a permutation test. 4e-4 the total permuted values were significant with that p-value respectively. This is the line that appears in the shiny app. FDR Histogram p−value density histogram

Table 2: Potentially Novel Associations PheCode Phenotype

Lab Lipase (LIP)

385.5

Prolactin (PRL) B-Type Natriuretic Peptide (BNP)

-log(p-value) 10.229

41.2

Tympanosclerosis and middle ear disease related to otitis media Streptococcus infection

145.1

Cancer of the lip

18.747

7.75

4

3

Variables density

Density

2. The Data

q−values local FDR ^ 0 = 0.808 π

2

Figure 3: PheWAS plot for lab values, showing Lipase Lab

1

• High levels of lipase are related to problems with the pancreas. We did not expect to see that tympanosclerosis and middle ear disease was significant (Figure 3).

^ 0 = 0.808 π

0

• Electronic health records obtained through the Michigan Genomics Initiative • Analyzed 197 numeric labs present in at least 5% of the data • Phenotype Wide Association Studies (PheWAS) analysis are run on ICD9 codes which are diagnosis codes for each patient 3. Methods - Model • A PheWas package to create a matrix of codes for phenotypes and whether or not patients were diagnosed with that phenotype was used • Created two linear models each controlling for age and sex. The first model tested the association between mean lab values for each patient for each given lab and resulting phenotype. The second model tested whether presence of a lab has an effect on resulting phenotype and subsetted the data for those who had followup times greater than one year • Initially, a logistic model was used, but due to the expense of running 350,000+ tests with 18,402 observations, a linear model was switched to

0.00

0.25

Linear Model -log(p-values)

1.00

• Cancer of the lip was highly related to the B-Type Natriuretic Peptide (BNP) test. BNP is released when the heart is working hard to pump blood, so high levels of BNP are used to detect and diagnose heart failure (Table 2).

5. Methods - Visualization • To visualize the data, an interactive shiny web application in R was created, which includes two PheWAS plot, one for each model • When looking at one graph, the user can choose which lab code they wish to see plotted, can select or hover over the points they are interested in, and can search a specific PheCode and have the p-value returned • The application displays two data tables ordered from most significant p-value to least significant p-value; one for each model. These data tables only include the points with p-values higher than the cutoff line. • This application gives researchers the opportunity to quickly see data and results relevant to labs or diseases of study Lab

Figure 1: The p-values for both the linear and logistic models were similar. Since we are only looking for associations, and not trying to predict the phenotype a linear model does not interfere with the conclusions.

0.75

p-value Figure 2: Large number of significant tests, and null test uniformly distributed

Linear model vs Logistic Model -log(p-value)

Logistic Model - log(p-values)

0.50

p−value

Hemoglobin A1C (A1C) Hemoglobin A1C (A1C) Body Mass Index (BMI) Creatinine Level (CREAT) Glucose Level (GLUC) Hemoglobin (HGB) Urea Nitrogen (UN) Red Cell Distribution Width (RDW)

Top Hits Table PheCode Phenotype 250.1 250.2 278.1 585 250 285 285.2 280

-log(p-value)

Type 1 Diabetes Type 2 Diabetes Obesity Renal Failure Diabetes mellitus Other anemias Anemia of chronic disease

>325 > 325 > 325 > 325 > 325 > 325 320.171

Iron Deficiency Anemias

289.63

Table 1: A number of most significant points from PheWAS, many of these were expected, such as A1C and Diabetes.

Figure 4: PheWAS plot for missing values showing Cholesterol Lab for those with followup greater than one year.

• Figure 4 coincides with what we would expect for cholesterol labs as the significant values are mostly for those with respiratory or circulatory problems Disscussion: In the future we would like to do a meta analysis and combine the two models. More research into our unanticipated results would have to be done to confirm any relationships; follow up studies are required to confirm these results. 7. Bibliography 1. Maisel, Alan S., et al. ”Rapid measurement of B-type natriuretic peptide in the emergency diagnosis of heart failure.” New England Journal of Medicine, 347.3 (2002): 161-167. 2. Winkler, F. K., A. D. Arcy, and W. Hunziker. ”Structure of human pancreatic lipase.” Nature 343.6260 (1990): 771.

EHR - Emerson, Goldman, Kolli.pdf

Connect more apps... Try one of the apps below to open or edit this item. EHR - Emerson, Goldman, Kolli.pdf. EHR - Emerson, Goldman, Kolli.pdf. Open. Extract.

1MB Sizes 13 Downloads 223 Views

Recommend Documents

Goldman Sachs
Sep 17, 2015 - and ADI, many with a historical background in both analog and communications semi chips). Finally, there will be continued growth in many of the ...... In early 2014, AT&T announced initiatives including a connected car center in Atlan

Goldman Sachs
Sep 17, 2015 - Redbend offers connected car software management, firewalls between infotainment systems and other ECUs, risk assessments consulting, zero-day vulnerability mitigation, sandboxing, and ECU consolidation using virtualization technology.

Nurse EHR Rankings 2018.pdf
6 EPIC SYSTEMS 80% 95% 93% 83% 88%. 7 HEALTHLAND ... 4 OPERATING ROOM 84% 81% 86% 91% 86% ... Displaying Nurse EHR Rankings 2018.pdf.

EHR - Olson, Gui, Madapoosi.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. EHR - Olson ...

Emerson 14-15 SIP Plan.pdf
Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Emerson 14-15 SIP Plan.pdf. Emerson 14-15 SIP Plan.pdf. Open.

Emerson Elementary 2015-2016 SIP.pdf
Alliance Public Schools. Mrs. Susan M. Cummings, Principal. 700 Black Hills. Alliance, NE 69301. Document Generated On February 28, 2017. Page 1 of 13 ...

EHR - Olson, Gui, Madapoosi.pdf
Operating on a patient with hepatitis C. Canadian Journal of Surgery,52(4), ... data and genome-wide association study data. ... Type 2 Diabetes 3e-10 0.77.

emerson lake palmer 1974.pdf
Loading… Page 1. Whoops! There was a problem loading more pages. emerson lake palmer 1974.pdf. emerson lake palmer 1974.pdf. Open. Extract. Open with.

Lio and Goldman
This article cites 66 articles, 30 of which can be accessed free at: service ...... be most successful. Yang (1994a) showed that for large data sets the general ...

Robert Goldman - Montgomery Housing Partnership
Master's degree in Real Estate from Johns Hopkins University, and his Bachelor's degree in. Public Policy from Duke University. A native Washingtonian ...

Emerson, The Method of Nature.pdf
Emerson, The Method of Nature.pdf. Emerson, The Method of Nature.pdf. Open. Extract. Open with. Sign In. Main menu. Displaying Emerson, The Method of ...

ralph waldo emerson books pdf
Page 1 of 1. File: Ralph waldo emerson books pdf. Download now. Click here if your download doesn't start automatically. Page 1 of 1. ralph waldo emerson books pdf. ralph waldo emerson books pdf. Open. Extract. Open with. Sign In. Main menu. Displayi

Emerson, The Young American.pdf
goods in the United States? This rage for road building is beneficent for America, where vast distance is so main a consideration in our. domestic politics and ...

TSS 2018 Emerson WS.pdf
Feb 25, 2018 - Lylli Anthon has been a grades teacher at the Halton Waldorf School near Toronto, Ontario for over 25 years. She. has taken one class from grades 2 to 4 and another from grades 1 to 8, and then concentrated on grades 6, 7, and 8,. havi

Goldman Salvatore - Caudillismos Rioplatenses.PDF
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Goldman ...