Introduction
This article describes creating a Pharmacokinetics (PK) Non-compartmental analysis (NCA) ADaM (ADNCA/ADPC) or a Population PK ADaM (ADPPK). The first part of the article describes the NCA file creation while the second part describes Population PK. This initial steps for both files are very similar and could be combined in one script if desired.
Programming PK NCA (ADPC/ADNCA) Analysis Data
The Non-compartmental analysis (NCA) ADaM uses the CDISC
Implementation Guide (https://www.cdisc.org/standards/foundational/adam/adamig-non-compartmental-analysis-input-data-v1-0).
This example presented uses underlying EX
and
PC
domains where the EX
and PC
domains represent data as collected and the ADPC
ADaM is
output. However, the example can be applied to situations where an
EC
domain is used as input instead of EX
and/or ADNCA
or another ADaM is created.
One of the important aspects of the dataset is the derivation of relative timing variables. These variables consist of nominal and actual times, and refer to the time from first dose or time from most recent reference dose. The reference dose for pre-dose records may be the upcoming dose. The CDISC Implementation Guide makes use of duplicated records for analysis, which allows the same record to be used both with respect to the previous dose and the next upcoming dose. This is illustrated later in this vignette.
Here are the relative time variables we will use. These correspond to the names in the CDISC Implementation Guide.
Variable | Variable Label |
---|---|
NFRLT | Nom. Rel. Time from Analyte First Dose |
AFRLT | Act. Rel. Time from Analyte First Dose |
NRRLT | Nominal Rel. Time from Ref. Dose |
ARRLT | Actual Rel. Time from Ref. Dose |
MRRLT | Modified Rel. Time from Ref. Dose |
Note: All examples assume CDISC SDTM and/or ADaM format as input unless otherwise specified.
ADPC
Programming Workflow
Read in Data
To start, all data frames needed for the creation of
ADPC
should be read into the environment. This will be a
company specific process. Some of the data frames needed may be
PC
, EX
, and ADSL
.
Additional domains such as VS
and LB
may be
used for additional baseline variables if needed. These may come from
either the SDTM or ADaM source.
For the purpose of example, the CDISC Pilot SDTM and ADaM datasets—which are included in pharmaversesdtm—are used.
library(dplyr, warn.conflicts = FALSE)
library(admiral)
library(pharmaversesdtm)
library(lubridate)
library(stringr)
library(tibble)
data("admiral_adsl")
data("ex")
data("pc")
data("vs")
data("lb")
adsl <- admiral_adsl
ex <- convert_blanks_to_na(ex)
# Load PC
pc <- convert_blanks_to_na(pc)
# Load VS for baseline height and weight
vs <- convert_blanks_to_na(vs)
# Load LB for baseline lab values
lb <- convert_blanks_to_na(lb) %>%
filter(LBBLFL == "Y")
# ---- Lookup tables ----
param_lookup <- tibble::tribble(
~PCTESTCD, ~PARAMCD, ~PARAM, ~PARAMN,
"XAN", "XAN", "Pharmacokinetic concentration of Xanomeline", 1,
"DOSE", "DOSE", "Xanomeline Patch Dose", 2,
)
At this step, it may be useful to join ADSL
to your
PC
and EX
domains as well. Only the
ADSL
variables used for derivations are selected at this
step. The rest of the relevant ADSL
variables will be added
later.
In this case we will keep TRTSDT
/TRTSDTM
for day derivation and TRT01P
/TRT01A
for
planned and actual treatments.
In this segment we will use derive_vars_merged()
to join
the ADSL
variables and the following admiral
functions to derive analysis dates, times and days:
derive_vars_dtm()
, derive_vars_dtm_to_dt()
,
derive_vars_dtm_to_tm()
, derive_vars_dy()
. We
will also create NFRLT
for PC
data based on
PCTPTNUM
. We will create an event ID (EVID
) of
0 for concentration records and 1 for dosing records. This is a
traditional variable that will provide a handy tool to identify records
but will be dropped from the final dataset in this example.
adsl_vars <- exprs(TRTSDT, TRTSDTM, TRT01P, TRT01A)
pc_dates <- pc %>%
# Join ADSL with PC (need TRTSDT for ADY derivation)
derive_vars_merged(
dataset_add = adsl,
new_vars = adsl_vars,
by_vars = exprs(STUDYID, USUBJID)
) %>%
# Derive analysis date/time
# Impute missing time to 00:00:00
derive_vars_dtm(
new_vars_prefix = "A",
dtc = PCDTC,
time_imputation = "00:00:00"
) %>%
# Derive dates and times from date/times
derive_vars_dtm_to_dt(exprs(ADTM)) %>%
derive_vars_dtm_to_tm(exprs(ADTM)) %>%
derive_vars_dy(reference_date = TRTSDT, source_vars = exprs(ADT)) %>%
# Derive event ID and nominal relative time from first dose (NFRLT)
mutate(
EVID = 0,
DRUG = PCTEST,
NFRLT = if_else(PCTPTNUM < 0, 0, PCTPTNUM), .after = USUBJID
)
Next we will also join ADSL
data with EX
and derive dates/times. This section uses the admiral
functions derive_vars_merged()
,
derive_vars_dtm()
, and
derive_vars_dtm_to_dt()
. Time is imputed to 00:00:00 here
for reasons specific to the sample data. Other imputation times may be
used based on study details. Here we create NFRLT
for
EX
data based on VISITDY
using
dplyr::mutate()
.
# ---- Get dosing information ----
ex_dates <- ex %>%
derive_vars_merged(
dataset_add = adsl,
new_vars = adsl_vars,
by_vars = exprs(STUDYID, USUBJID)
) %>%
# Keep records with nonzero dose
filter(EXDOSE > 0) %>%
# Add time and set missing end date to start date
# Impute missing time to 00:00:00
# Note all times are missing for dosing records in this example data
# Derive Analysis Start and End Dates
derive_vars_dtm(
new_vars_prefix = "AST",
dtc = EXSTDTC,
time_imputation = "00:00:00"
) %>%
derive_vars_dtm(
new_vars_prefix = "AEN",
dtc = EXENDTC,
time_imputation = "00:00:00"
) %>%
# Derive event ID and nominal relative time from first dose (NFRLT)
mutate(
EVID = 1,
NFRLT = case_when(
VISITDY == 1 ~ 0,
TRUE ~ 24 * VISITDY
)
) %>%
# Set missing end dates to start date
mutate(AENDTM = case_when(
is.na(AENDTM) ~ ASTDTM,
TRUE ~ AENDTM
)) %>%
# Derive dates from date/times
derive_vars_dtm_to_dt(exprs(ASTDTM)) %>%
derive_vars_dtm_to_dt(exprs(AENDTM))
Expand Dosing Records
The function create_single_dose_dataset()
can be used to
expand dosing records between the start date and end date. The nominal
time will also be expanded based on the values of EXDOSFRQ
,
for example “QD” will result in nominal time being incremented by 24
hours and “BID” will result in nominal time being incremented by 12
hours. This is a new feature of
create_single_dose_dataset()
.
Dates and times will be derived after expansion using
derive_vars_dtm_to_dt()
and
derive_vars_dtm_to_tm()
.
For this example study we will define analysis visit
(AVISIT)
based on the nominal day value from
NFRLT
and give it the format, “Day 1”, “Day 2”, “Day 3”,
etc. This is important for creating the BASETYPE
variable
later. DRUG
is created from EXTRT
here. This
will be useful for linking treatment data with concentration data if
there are multiple drugs and/or analytes, but this variable will also be
dropped from the final dataset in this example.
# ---- Expand dosing records between start and end dates ----
ex_exp <- ex_dates %>%
create_single_dose_dataset(
dose_freq = EXDOSFRQ,
start_date = ASTDT,
start_datetime = ASTDTM,
end_date = AENDT,
end_datetime = AENDTM,
nominal_time = NFRLT,
lookup_table = dose_freq_lookup,
lookup_column = CDISC_VALUE,
keep_source_vars = exprs(
STUDYID, USUBJID, EVID, EXDOSFRQ, EXDOSFRM,
NFRLT, EXDOSE, EXDOSU, EXTRT, ASTDT, ASTDTM, AENDT, AENDTM,
VISIT, VISITNUM, VISITDY, TRT01A, TRT01P, DOMAIN, EXSEQ, !!!adsl_vars
)
) %>%
# Derive AVISIT based on nominal relative time
# Derive AVISITN to nominal time in whole days using integer division
# Define AVISIT based on nominal day
mutate(
AVISITN = NFRLT %/% 24 + 1,
AVISIT = paste("Day", AVISITN),
ADTM = ASTDTM,
DRUG = EXTRT,
) %>%
# Derive dates and times from datetimes
derive_vars_dtm_to_dt(exprs(ADTM)) %>%
derive_vars_dtm_to_tm(exprs(ADTM)) %>%
derive_vars_dtm_to_tm(exprs(ASTDTM)) %>%
derive_vars_dtm_to_tm(exprs(AENDTM)) %>%
derive_vars_dy(reference_date = TRTSDT, source_vars = exprs(ADT))
Find First Dose
In this section we will find the first dose for each subject and
drug, using derive_vars_merged()
. We also create an
analysis visit (AVISIT
) based on NFRLT
. The
first dose datetime for an analyte FANLDTM
is calculated as
the minimum ADTM
from the dosing records by subject and
drug.
# ---- Find first dose per treatment per subject ----
# ---- Join with ADPC data and keep only subjects with dosing ----
adpc_first_dose <- pc_dates %>%
derive_vars_merged(
dataset_add = ex_exp,
filter_add = (EXDOSE > 0 & !is.na(ADTM)),
new_vars = exprs(FANLDTM = ADTM),
order = exprs(ADTM, EXSEQ),
mode = "first",
by_vars = exprs(STUDYID, USUBJID, DRUG)
) %>%
filter(!is.na(FANLDTM)) %>%
# Derive AVISIT based on nominal relative time
# Derive AVISITN to nominal time in whole days using integer division
# Define AVISIT based on nominal day
mutate(
AVISITN = NFRLT %/% 24 + 1,
AVISIT = paste("Day", AVISITN)
)
Find Reference Dose Dates Corresponding to PK Records
Use derive_vars_joined()
to find the previous dose data.
This will join the expanded EX
data with the
ADPC
based on the analysis date ADTM
. Note the
filter_join
parameter. In addition to the date of the
previous dose (ADTM_prev)
, we also keep the actual dose
amount EXDOSE_prev
and the analysis visit of the dose
AVISIT_prev
.
# ---- Find previous dose ----
adpc_prev <- adpc_first_dose %>%
derive_vars_joined(
dataset_add = ex_exp,
by_vars = exprs(USUBJID),
order = exprs(ADTM),
new_vars = exprs(
ADTM_prev = ADTM, EXDOSE_prev = EXDOSE, AVISIT_prev = AVISIT,
AENDTM_prev = AENDTM
),
join_vars = exprs(ADTM),
join_type = "all",
filter_add = NULL,
filter_join = ADTM > ADTM.join,
mode = "last",
check_type = "none"
)
Similarly, find next dose information using
derive_vars_joined()
with the filter_join
parameter as ADTM <= ADTM.join
. Here we keep the next
dose analysis date ADTM_next
, the next actual dose
EXDOSE_next
, and the next analysis visit
AVISIT_next
.
# ---- Find next dose ----
adpc_next <- adpc_prev %>%
derive_vars_joined(
dataset_add = ex_exp,
by_vars = exprs(USUBJID),
order = exprs(ADTM),
new_vars = exprs(
ADTM_next = ADTM, EXDOSE_next = EXDOSE, AVISIT_next = AVISIT,
AENDTM_next = AENDTM
),
join_vars = exprs(ADTM),
join_type = "all",
filter_add = NULL,
filter_join = ADTM <= ADTM.join,
mode = "first",
check_type = "none"
)
Use the same method to find the previous and next nominal times. Note
that here the data are sorted by nominal time rather than the actual
time. This will tell us when the previous dose and the next dose were
supposed to occur. Sometimes this will differ from the actual times in a
study. Here we keep the previous nominal dose time
NFRLT_prev
and the next nominal dose time
NFRLT_next
. Note that the filter_join
parameter uses the nominal relative times,
e.g. NFRLT > NFRLT.join
.
# ---- Find previous nominal time ----
adpc_nom_prev <- adpc_next %>%
derive_vars_joined(
dataset_add = ex_exp,
by_vars = exprs(USUBJID),
order = exprs(NFRLT),
new_vars = exprs(NFRLT_prev = NFRLT),
join_vars = exprs(NFRLT),
join_type = "all",
filter_add = NULL,
filter_join = NFRLT > NFRLT.join,
mode = "last",
check_type = "none"
)
# ---- Find next nominal time ----
adpc_nom_next <- adpc_nom_prev %>%
derive_vars_joined(
dataset_add = ex_exp,
by_vars = exprs(USUBJID),
order = exprs(NFRLT),
new_vars = exprs(NFRLT_next = NFRLT),
join_vars = exprs(NFRLT),
join_type = "all",
filter_add = NULL,
filter_join = NFRLT <= NFRLT.join,
mode = "first",
check_type = "none"
)
Combine PC and EX Records and Derive Relative Time Variables
Combine PC
and EX
records and derive the
additional relative time variables. Often NCA data will keep both dosing
and concentration records. We will keep both here. Sometimes you will
see ADPC
with only the concentration records. If this is
desired, the dosing records can be dropped before saving the final
dataset. We will use the admiral function
derive_vars_duration()
to calculate the actual relative
time from first dose (AFRLT
) and the actual relative time
from most recent dose (ARRLT
). Note that we use the
parameter add_one = FALSE
here. We will also create a
variable representing actual time to next dose (AXRLT
)
which is not kept, but will be used when we create duplicated records
for analysis for the pre-dose records. For now, we will update missing
values of ARRLT
corresponding to the pre-dose records with
AXRLT
, and dosing records will be set to zero.
We also calculate the reference dates FANLDTM
(First
Datetime of Dose for Analyte) and PCRFTDTM
(Reference
Datetime of Dose for Analyte) and their corresponding date and time
variables.
We calculate the maximum date for concentration records and only keep the dosing records up to that date.
# ---- Combine ADPC and EX data ----
# Derive Relative Time Variables
adpc_arrlt <- bind_rows(adpc_nom_next, ex_exp) %>%
group_by(USUBJID, DRUG) %>%
mutate(
FANLDTM = min(FANLDTM, na.rm = TRUE),
min_NFRLT = min(NFRLT_prev, na.rm = TRUE),
maxdate = max(ADT[EVID == 0], na.rm = TRUE), .after = USUBJID
) %>%
arrange(USUBJID, ADTM) %>%
ungroup() %>%
filter(ADT <= maxdate) %>%
# Derive Actual Relative Time from First Dose (AFRLT)
derive_vars_duration(
new_var = AFRLT,
start_date = FANLDTM,
end_date = ADTM,
out_unit = "hours",
floor_in = FALSE,
add_one = FALSE
) %>%
# Derive Actual Relative Time from Reference Dose (ARRLT)
derive_vars_duration(
new_var = ARRLT,
start_date = ADTM_prev,
end_date = ADTM,
out_unit = "hours",
floor_in = FALSE,
add_one = FALSE
) %>%
# Derive Actual Relative Time from Next Dose (AXRLT not kept)
derive_vars_duration(
new_var = AXRLT,
start_date = ADTM_next,
end_date = ADTM,
out_unit = "hours",
floor_in = FALSE,
add_one = FALSE
) %>%
mutate(
ARRLT = case_when(
EVID == 1 ~ 0,
is.na(ARRLT) ~ AXRLT,
TRUE ~ ARRLT
),
# Derive Reference Dose Date
PCRFTDTM = case_when(
EVID == 1 ~ ADTM,
is.na(ADTM_prev) ~ ADTM_next,
TRUE ~ ADTM_prev
)
) %>%
# Derive dates and times from datetimes
derive_vars_dtm_to_dt(exprs(FANLDTM)) %>%
derive_vars_dtm_to_tm(exprs(FANLDTM)) %>%
derive_vars_dtm_to_dt(exprs(PCRFTDTM)) %>%
derive_vars_dtm_to_tm(exprs(PCRFTDTM))
For nominal relative times we calculate NRRLT
generally
as NFRLT - NFRLT_prev
and NXRLT
as
NFRLT - NFRLT_next
.
Derive Analysis Variables
Using dplyr::mutate
we derive a number of analysis
variables including analysis value (AVAL
), analysis time
point (ATPT
) analysis timepoint reference
(ATPTREF
) and baseline type (BASETYPE
).
We set ATPT
to PCTPT
for concentration
records and to “Dose” for dosing records. The analysis timepoint
reference ATPTREF
will correspond to the dosing visit. We
will use AVISIT_prev
and AVISIT_next
to
derive. The baseline type will be a concatenation of
ATPTREF
and “Baseline” with values such as “Day 1
Baseline”, “Day 2 Baseline”, etc. The baseline flag ABLFL
will be set to “Y” for pre-dose records.
Analysis value AVAL
in this example comes from
PCSTRESN
for concentration records. In addition we are
including the dose value EXDOSE
for dosing records and
setting BLQ (Below Limit of Quantitation) records to 0 before the first
dose and to 1/2 of LLOQ (Lower Limit of Quantitation) for records after
first dose. (Additional tests such as whether more than 1/3 of records
are BLQ may be required and are not done in this example.) We also
create a listing-ready variable AVALCAT1
which includes the
“BLQ” record indicator and formats the numeric values to three
significant digits.
We derive actual dose DOSEA
based on
EXDOSE_prev
and EXDOSE_next
and planned dose
DOSEP
based on the planned treatment TRT01P
.
In addition we add the units for the dose variables and the relative
time variables.
# ---- Derive Analysis Variables ----
# Derive ATPTN, ATPT, ATPTREF, ABLFL and BASETYPE
# Derive planned dose DOSEP, actual dose DOSEA and units
# Derive PARAMCD and relative time units
# Derive AVAL, AVALU and AVALCAT1
adpc_aval <- adpc_nrrlt %>%
mutate(
PARCAT1 = PCSPEC,
ATPTN = case_when(
EVID == 1 ~ 0,
TRUE ~ PCTPTNUM
),
ATPT = case_when(
EVID == 1 ~ "Dose",
TRUE ~ PCTPT
),
ATPTREF = case_when(
EVID == 1 ~ AVISIT,
is.na(AVISIT_prev) ~ AVISIT_next,
TRUE ~ AVISIT_prev
),
# Derive baseline flag for pre-dose records
ABLFL = case_when(
ATPT == "Pre-dose" ~ "Y",
TRUE ~ NA_character_
),
# Derive BASETYPE
BASETYPE = paste(ATPTREF, "Baseline"),
# Derive Actual Dose
DOSEA = case_when(
EVID == 1 ~ EXDOSE,
is.na(EXDOSE_prev) ~ EXDOSE_next,
TRUE ~ EXDOSE_prev
),
# Derive Planned Dose
DOSEP = case_when(
TRT01P == "Xanomeline High Dose" ~ 81,
TRT01P == "Xanomeline Low Dose" ~ 54
),
DOSEU = "mg",
) %>%
# Derive relative time units
mutate(
FRLTU = "h",
RRLTU = "h",
# Derive PARAMCD
PARAMCD = coalesce(PCTESTCD, "DOSE"),
ALLOQ = PCLLOQ,
# Derive AVAL
AVAL = case_when(
EVID == 1 ~ EXDOSE,
PCSTRESC == "<BLQ" & NFRLT == 0 ~ 0,
PCSTRESC == "<BLQ" & NFRLT > 0 ~ 0.5 * ALLOQ,
TRUE ~ PCSTRESN
),
AVALU = case_when(
EVID == 1 ~ EXDOSU,
TRUE ~ PCSTRESU
),
AVALCAT1 = if_else(PCSTRESC == "<BLQ", PCSTRESC, prettyNum(signif(AVAL, digits = 3))),
) %>%
# Add SRCSEQ
mutate(
SRCDOM = DOMAIN,
SRCVAR = "SEQ",
SRCSEQ = coalesce(PCSEQ, EXSEQ)
)