You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A semi-supervised approach for rapidly creating clinical biomarker phenotypes in the UK Biobank, across different primary care EHR and clinical terminology systems.
The UK Biobank is making primary care electronic health records available for the entire cohort for COVID-19 research. This repository provides machine-readable versions (CSV files) of electronic health record phenotyping algorithms for 31 commonly-measured biomakers, many of which are associated with COVID-19 comorbidities such as body mass index, cardiovascular disease and respiratory disease.
Details on how these algorithms were bootstrapped and validated can be found in the publication:
Spiros Denaxas, Anoop D Shah, Bilal A Mateen, Valerie Kuan, Jennifer K Quint, Natalie Fitzpatrick, Ana Torralbo, Ghazaleh Fatemifar,
Harry Hemingway, A semi-supervised approach for rapidly creating clinical biomarker phenotypes in the UK Biobank using
different primary care EHR and clinical terminology systems, JAMIA Open, Volume 3, Issue 4, December 2020, Pages 545–556,
https://doi.org/10.1093/jamiaopen/ooaa047
Serum cholesterol measurent including total cholesterol measurements but excluding plasma-derived measurements. The phenotype includes fasting and non-fasting measurements.
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
ALP = value1
ELSE IF data_provider = Scotland (2)
ALP = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
ALP = value1
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
ALT = value1
ELSE IF data_provider = Scotland (2)
ALT = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
ALT = value1
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
Albumin = value1
ELSE IF data_provider = Scotland (2)
Albumin = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
Albumin = value1
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
Basophills = value1
ELSE IF data_provider = Scotland (2)
Basophills = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
Basophills = value1
Serum C reactive protein level excluding plasma-derived measurements.
Version
alpha
Implementation
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
CRP = value1
ELSE IF data_provider = Scotland (2)
CRP = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
CRP = value1
Serum calcium measurement. Excludes ionized calcium level measurements. The phenotype does not include corrected/adjusted levels.
Version
alpha
Implementation
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
Calcium = value1
ELSE IF data_provider = Scotland (2)
Calcium = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
Calcium = value1
Serum cholesterol measurent including total cholesterol measurements but excluding plasma-derived measurements. The phenotype includes fasting and non-fasting measurements.
Version
alpha
Implementation
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
Cholesterol = value1
ELSE IF data_provider = Scotland (2)
Cholesterol = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
Cholesterol = value1
Creatinine levels including corrected levels. Algorithm includes serum, plasma, and unspecified source measurements.
Version
alpha
Implementation
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
Creatinine = value1
ELSE IF data_provider = Scotland (2)
Creatinine = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
Creatinine = value1
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
Eosinophills = value1
ELSE IF data_provider = Scotland (2)
Eosinophills = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
Eosinophills = value1
Forced expired volume in 1 second. Excludes predicted values and measurements post-bronchodialation.
Version
alpha
Implementation
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
FEV1 = value1
ELSE IF data_provider = Scotland (2)
FEV1 = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
FEV1 = value1
Forced vital capacity measurement. Excludes predictd values and measurements post-bronchodialation.
Version
alpha
Implementation
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
FVC = value1
ELSE IF data_provider = Scotland (2)
FVC = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
FVC = value1
Blood-based glucose level derived from plasma. Excludes fasting measurements.
Version
alpha
Implementation
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
Glucose = value1
ELSE IF data_provider = Scotland (2)
Glucose = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
Glucose = value1
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
HDL = value1
ELSE IF data_provider = Scotland (2)
HDL = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
HDL = value1
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
Haematocrit perc = value1
ELSE IF data_provider = Scotland (2)
Haematocrit perc = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
Haematocrit perc = value1
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
Haemoglobin conc = value1
ELSE IF data_provider = Scotland (2)
Haemoglobin conc = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
Haemoglobin conc = value1
HbA1c level recorded using International Federation of Clinical Chemistry and Laboratory Medicine or Diabetes Control and Complications Trial (DCCT).
Version
alpha
Implementation
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
HbA1c = value1
ELSE IF data_provider = Scotland (2)
HbA1c = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
HbA1c = value1
Terminology
Read code
Read term
read2
42W5.00
Haemoglobin A1c level - IFCC standardised
read2
42W4.00
HbA1c level (DCCT aligned)
read2
42W..00
Hb. A1C - diabetic control
read2
42W..11
Glycosylated Hb
read2
42W..12
Glycated haemoglobin
read2
42W2.00
Hb. A1C 7-10% - borderline
read2
42W1.00
Hb. A1C < 7% - good control
read2
42WZ.00
Hb. A1C - diabetic control NOS
read2
42W3.00
Hb. A1C > 10% - bad control
ctv3
XaPbt
Haemoglobin A1c level - International Federation of Clinical Chemistry and Laboratory Medicine standardised
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
Lymphocytes = value1
ELSE IF data_provider = Scotland (2)
Lymphocytes = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
Lymphocytes = value1
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
MCHb conc = value1
ELSE IF data_provider = Scotland (2)
MCHb conc = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
MCHb conc = value1
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
MCV = value1
ELSE IF data_provider = Scotland (2)
MCV = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
MCV = value1
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
Monocytes = value1
ELSE IF data_provider = Scotland (2)
Monocytes = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
Monocytes = value1
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
Neutrophills = value1
ELSE IF data_provider = Scotland (2)
Neutrophills = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
Neutrophills = value1
Platelet count. Excludes platelet distribution width measurements.
Version
alpha
Implementation
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
Platelets = value1
ELSE IF data_provider = Scotland (2)
Platelets = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
Platelets = value1
Red blood cell (RBC) count excluding nucleated RBC values.
Version
alpha
Implementation
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
RBC = value1
ELSE IF data_provider = Scotland (2)
RBC = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
RBC = value1
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
Total bilirubin = value1
ELSE IF data_provider = Scotland (2)
Total bilirubin = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
Total bilirubin = value1
Serum triglycerides measurements. Algorithm excludes plasma-derived measurements. Includes fasting/non-fasting/random values.
Version
alpha
Implementation
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
Triglycerides = value1
ELSE IF data_provider = Scotland (2)
Triglycerides = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
Triglycerides = value1
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
Urea = value1
ELSE IF data_provider = Scotland (2)
Urea = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
Urea = value1
Total white cell count. Excludes polymorphonuclear leukocyte count.
Version
alpha
Implementation
IF Read v2 code = read_2
IF data_provider = England Vision (1) OR data provider = Wales (4)
WBC = value1
ELSE IF data_provider = Scotland (2)
WBC = value2
units = value3
ELSE IF CTV3 code = read_3
IF data_provider = England TPP (3)
WBC = value1