Clinical trial design: overview

RCT

Notes related to clinical trial design.

Author

Chi Zhang

Published

June 1, 2023

Considerations: choice of comparator and trial outcome measures (due to the grrowing number of treatmenet options, standard care would change over time), annd definition of target patient population (e.g. molecular profiling makes it possible to identify smaller subgroups of patient with defined tumor type).

Phases

Typically takes 5-15 years from inception to being recommeded in routine care

Phase 1: 10-30, identify tolerable dose, safety, information on drug metabolism, extretion and toxicity. Often not controlled and open-label.

Phase 2: 30-100 (per group), efficacy, further assessment of safety and side effects, can be controlled or single arm

Phase 3: 100+, definite evidence, must be randomized

Phase 4: 1000+, (postmarketing surveillance) monitor efficacy and safety in the population, may identify uncommon adverse events unseen in 2 and 3. Not usually randomized

Patients

Major criteria for patient recruitment (oncology)

diagnosis of cancer(s) of interest
adequaute fitness to tolerate the treatment, determined by performance status PS (often correlated with overall survival)
absense of relevant comobidity
no previous exposure to treatment similar to the experiment treatment

Types of design

Population have the disease outcome of interest; not healthy voluteers vs diseased.

Randomisation unit: persons, two eyes of a person, or groups of persons

Comparison structure: parallel, crossover, group allocation

Parallel: simultaneous treatment and control groups, subjects randomly assigned to one group.
Crossover: randomize of order in which treatments are received; TC or CT. Each patient is his/her own control. Washout period: time between two treatments.
- Variability reduced because less variability within patient than between patients. Fewer patients needed.
- Disadvantages: only certain treatments can use crossover design, treatment can’t have permanent effects. Carry-over effects from first period; washout needs to be long enough. Dropouts more significant, analysis may be more difficult: correlated outcomes.
- Constant intensity of underlying disease: chronic diseases (e.g. asthma, hypertension, arthritis) + short-term treatment effects (relief of signs or symptoms)
- e.g. morning dose vs evening dose
Group allocation: a group of subjects (community, school, clinic).

Extensions of the parallel design: factorial, large simple

Factorial: two interventions tested simultaneously. Can be presented in a 2 by 2 table (treatment A +-, treatment B +-); or 3 by 2 etc.
- Interested in main effect (if no interaction expected). A vs no A; B vs no B. The other treatment doesn’t matter.
Large simple: large number of patients, possibly from many study sites.

Tests other than superiority

Equivalency: intervention response is close to control group response
Non-inferiority: Treatment A (new) is at least as good as B (established). One-sided test, if A is worse than B, one can be rejected. Does not require as big sample size.

Adaptive design

Possible adaptations

randomization probabilities
sample size (e.g. group sequential methods)
visit schedule: shorten/lengthen follow-up time, change number of timing of visits, treatments (dose/duration, concomitant meds)
hypothesis tested

Randomisation and masking

Rationale:

avoid selection bias: prognostic factors related to treatment assignment
tends to produce comparable treatment groups

Schemes

Simple randomization, restricted randomization, adaptive randomization

Simple rz

Each assignment is unpredictable, number of patients in each group should be equal in the long run.

Risks: imbalances in number assigned to treatment groups, or confounding factors (gender, disease severity) -> reduced power

Restricted rz

Schemes with constraints to produce expected assignment ratio

blocking
stratification

Blocking. Block of size 2 with treatment allocation ratio 1:1: A,B. Size 4: 2As, 2Bs. Need to be permuted: AABB, ABAB, … in total 6 combinations. Then choose one of the permutations.

Stratification. Ensure balance in treatment assignments with subgroups defined before rz. Limit to a few variables (highly related to outcome and/or logistical): e.g. clinic in a multicenter trial, surgeon (skills, procedures), stage of disease, demographic such as gender and age.

Use these two together.

Adaptive rz

Probability of assignment does not remain constant, but determined by the current balance and composition of the groups.

minimization: choose the design that gives the smallest imbalance.
play the winner: change allocation ratio or favor the better treatment based on the primary outcome. Need to evaluate outcomes relatively quickly.

Masking (blinding)

Treatment assignment is not known after rz.

patient, clinical personnel, evaluators, data processors, …
single (only participant), double (+ investigator), triple (+ data processors, …), quadruple …

Purpose: remove bias related to treatment effects.

Different levels of masking protects to different extent against bias in different aspects

data reporting
data collection / follow-up
testing, behaviors
outcome assessment

Decision to mask treatments

ethical?
possible? can you make the treatment seem identical so the participants do not know?
trial design features: more important to mask subjective ones (e.g. alive or dead is the least subjective, hence wouldn’t benefit much; however if participants need to report effects that are not objectively measureable, they might report that treatment is better in contrast to placebo group)
feasible? cost-benefit, practicality (adherence)

Sometimes investigators in a double blind study might know which treatment is being assigned to participants, if the effect of drug is very obvious (both good or bad).

Unmasking

Planned: inform participants once the trial finished
Unplanned (discouraged): in the event of adverse event

Outcome / Endpoints

Outcome: endpoint. It is a quantitaive measure.

efficacy / effectiveness
patient reported outcome (QoL)
safety (adverse evvennts, toxicity)
process, adherence to treatment
costs

Example: evaluate treatment for asthma

Outcomes: exhaled nitrous oxide, lung function (spirometry measures), asthma symptoms (wheezing, night awakenings), …

Example: evaluate a procedure to reduce perioperative morbidity

Outcome considerations: time window (what is postoperative), specific events to be considered an outcome, procedures to establish outcomes, …

Metrics for events as outcomes

dichotomous: 1/0 for presence absense, normal abnormal; clinical state or cut-off value
time-to-event: in addition to dichotomous, add time dimension; allow for censoring. More powerful than dichotomous.
rates: 1/0 but allow for repeats, analyze count or rate. Events within a person are usually not independent, need to account for it.
continuous variables: value or change from baseline; standard units (lab values, scores). Need to define an important difference. Distributional assumptions more important.
ordinal scale: ranked categories (e.g. adverse event grading, 1-5). Difference between categories is usually qualitative.

Common time-to-event outcomes (in cancer)

Ouutcome	Definition of event
Overall survival (OS)	Death from any cause
Disease (relapse-free) survival	First recurrennce / relapse of the cancer; death from any cause
Progression-free survival, duratinon of response, duration of clinical benefit	First sign of cancer progression; death from any cause
Cancer-specific survival	Death from the cancer of interest
Time to treatment failure (TTF)	First sign of cancer progression; death from any cause; stopped trial treatment

Patients reported outcomes are subjective

health status / change in status, e.g. pain relief, quality of life
masking is more important
hawthorne / placebo effect: effect of being studies, usually positive
quantify with standardized scales

Influence of outcomes on design

Efficacy vs effectiveness:

In a vaccine trial, efficacy is the clinical case with lab confirmation; effectivenenss is the clinical case of influenza in a larger population, may or may not be confirmed.

In asthma, efficacy is FEV1, effectiveness is the decrease of the hospitalizations/steroid courses.

Considerations (3Bs)

biology: does outcome reflect a clinically relevant fact/change
biostatistics: detectable difference between groups is plausible and practical
budget: afford total N and can measure it reliably in every participant

Example: HIV trial outcomes

survival (deaths; AIDS status)
immunologic response
virologic response
change in patient status (e.g QoL)
specified toxicity
other side effects

Choice of primary outcome depends on the objectives or stage of research

phase 1, emphasis on safety
phase 2, short-term efficacy
phase 3, long-term efficacy
phase 4, long-term effectiveness

Statistical analysis

Intention to treat ITT

Cross-overs after rz: some patients might have a treatment (yes or no) beyond what they were assigned to, e.g. refuse surgery or medical treatment

Non-adherence during followup: some in treatment group refuse of can not tolerate certain treatment; while some in placebo group require medication or take on their own

Subgroup analysis

Stratified analysis: estimate treatment effect separately in subgroups. Does not tell difference across different subgroups

Test for interaction: use of main effect and interaction.

Issue of multiple testing when doing a series of analyses

inflate sample size to plan for subgroup analysis
report number of subgroup analyses performed
possibly adjust for multiple comparisons
report CI instead of just p-values

Resources

Coursera course Design and interpretation of clinical trials by Johns Hopkins University
Book Fast Facts: Clinical trials in oncology: The fundamentals