Clinical trial design: overview
Notes related to clinical trial design.
Considerations: choice of comparator and trial outcome measures (due to the grrowing number of treatmenet options, standard care would change over time), annd definition of target patient population (e.g. molecular profiling makes it possible to identify smaller subgroups of patient with defined tumor type).
Phases
Typically takes 5-15 years from inception to being recommeded in routine care
Phase 1: 10-30, identify tolerable dose, safety, information on drug metabolism, extretion and toxicity. Often not controlled and open-label.
Phase 2: 30-100 (per group), efficacy, further assessment of safety and side effects, can be controlled or single arm
Phase 3: 100+, definite evidence, must be randomized
Phase 4: 1000+, (postmarketing surveillance) monitor efficacy and safety in the population, may identify uncommon adverse events unseen in 2 and 3. Not usually randomized
Patients
Major criteria for patient recruitment (oncology)
- diagnosis of cancer(s) of interest
- adequaute fitness to tolerate the treatment, determined by performance status PS (often correlated with overall survival)
- absense of relevant comobidity
- no previous exposure to treatment similar to the experiment treatment
Types of design
Population have the disease outcome of interest; not healthy voluteers vs diseased.
Randomisation unit: persons, two eyes of a person, or groups of persons
Comparison structure: parallel, crossover, group allocation
- Parallel: simultaneous treatment and control groups, subjects randomly assigned to one group.
- Crossover: randomize of order in which treatments are received; TC or CT. Each patient is his/her own control. Washout period: time between two treatments.
- Variability reduced because less variability within patient than between patients. Fewer patients needed.
- Disadvantages: only certain treatments can use crossover design, treatment can’t have permanent effects. Carry-over effects from first period; washout needs to be long enough. Dropouts more significant, analysis may be more difficult: correlated outcomes.
- Constant intensity of underlying disease: chronic diseases (e.g. asthma, hypertension, arthritis) + short-term treatment effects (relief of signs or symptoms)
- e.g. morning dose vs evening dose
- Group allocation: a group of subjects (community, school, clinic).
Extensions of the parallel design: factorial, large simple
- Factorial: two interventions tested simultaneously. Can be presented in a 2 by 2 table (treatment A +-, treatment B +-); or 3 by 2 etc.
- Interested in main effect (if no interaction expected). A vs no A; B vs no B. The other treatment doesn’t matter.
- Large simple: large number of patients, possibly from many study sites.
Tests other than superiority
Equivalency: intervention response is close to control group response
Non-inferiority: Treatment A (new) is at least as good as B (established). One-sided test, if A is worse than B, one can be rejected. Does not require as big sample size.
Adaptive design
Possible adaptations
- randomization probabilities
- sample size (e.g. group sequential methods)
- visit schedule: shorten/lengthen follow-up time, change number of timing of visits, treatments (dose/duration, concomitant meds)
- hypothesis tested
Randomisation and masking
Rationale:
- avoid selection bias: prognostic factors related to treatment assignment
- tends to produce comparable treatment groups
Schemes
Simple randomization, restricted randomization, adaptive randomization
Simple rz
Each assignment is unpredictable, number of patients in each group should be equal in the long run.
Risks: imbalances in number assigned to treatment groups, or confounding factors (gender, disease severity) -> reduced power
Restricted rz
Schemes with constraints to produce expected assignment ratio
- blocking
- stratification
Blocking. Block of size 2 with treatment allocation ratio 1:1: A,B. Size 4: 2As, 2Bs. Need to be permuted: AABB, ABAB, … in total 6 combinations. Then choose one of the permutations.
Stratification. Ensure balance in treatment assignments with subgroups defined before rz. Limit to a few variables (highly related to outcome and/or logistical): e.g. clinic in a multicenter trial, surgeon (skills, procedures), stage of disease, demographic such as gender and age.
Use these two together.
Adaptive rz
Probability of assignment does not remain constant, but determined by the current balance and composition of the groups.
- minimization: choose the design that gives the smallest imbalance.
- play the winner: change allocation ratio or favor the better treatment based on the primary outcome. Need to evaluate outcomes relatively quickly.
Masking (blinding)
Treatment assignment is not known after rz.
- patient, clinical personnel, evaluators, data processors, …
- single (only participant), double (+ investigator), triple (+ data processors, …), quadruple …
Purpose: remove bias related to treatment effects.
Different levels of masking protects to different extent against bias in different aspects
- data reporting
- data collection / follow-up
- testing, behaviors
- outcome assessment
Decision to mask treatments
- ethical?
- possible? can you make the treatment seem identical so the participants do not know?
- trial design features: more important to mask subjective ones (e.g. alive or dead is the least subjective, hence wouldn’t benefit much; however if participants need to report effects that are not objectively measureable, they might report that treatment is better in contrast to placebo group)
- feasible? cost-benefit, practicality (adherence)
Sometimes investigators in a double blind study might know which treatment is being assigned to participants, if the effect of drug is very obvious (both good or bad).
Unmasking
- Planned: inform participants once the trial finished
- Unplanned (discouraged): in the event of adverse event
Outcome / Endpoints
Outcome: endpoint. It is a quantitaive measure.
- efficacy / effectiveness
- patient reported outcome (QoL)
- safety (adverse evvennts, toxicity)
- process, adherence to treatment
- costs
Example: evaluate treatment for asthma
Outcomes: exhaled nitrous oxide, lung function (spirometry measures), asthma symptoms (wheezing, night awakenings), …
Example: evaluate a procedure to reduce perioperative morbidity
Outcome considerations: time window (what is postoperative), specific events to be considered an outcome, procedures to establish outcomes, …
Metrics for events as outcomes
- dichotomous: 1/0 for presence absense, normal abnormal; clinical state or cut-off value
- time-to-event: in addition to dichotomous, add time dimension; allow for censoring. More powerful than dichotomous.
- rates: 1/0 but allow for repeats, analyze count or rate. Events within a person are usually not independent, need to account for it.
- continuous variables: value or change from baseline; standard units (lab values, scores). Need to define an important difference. Distributional assumptions more important.
- ordinal scale: ranked categories (e.g. adverse event grading, 1-5). Difference between categories is usually qualitative.
Common time-to-event outcomes (in cancer)
Ouutcome | Definition of event |
---|---|
Overall survival (OS) | Death from any cause |
Disease (relapse-free) survival | First recurrennce / relapse of the cancer; death from any cause |
Progression-free survival, duratinon of response, duration of clinical benefit | First sign of cancer progression; death from any cause |
Cancer-specific survival | Death from the cancer of interest |
Time to treatment failure (TTF) | First sign of cancer progression; death from any cause; stopped trial treatment |
Patients reported outcomes are subjective
- health status / change in status, e.g. pain relief, quality of life
- masking is more important
- hawthorne / placebo effect: effect of being studies, usually positive
- quantify with standardized scales
Influence of outcomes on design
Efficacy vs effectiveness:
In a vaccine trial, efficacy is the clinical case with lab confirmation; effectivenenss is the clinical case of influenza in a larger population, may or may not be confirmed.
In asthma, efficacy is FEV1, effectiveness is the decrease of the hospitalizations/steroid courses.
Considerations (3Bs)
- biology: does outcome reflect a clinically relevant fact/change
- biostatistics: detectable difference between groups is plausible and practical
- budget: afford total N and can measure it reliably in every participant
Example: HIV trial outcomes
- survival (deaths; AIDS status)
- immunologic response
- virologic response
- change in patient status (e.g QoL)
- specified toxicity
- other side effects
Choice of primary outcome depends on the objectives or stage of research
- phase 1, emphasis on safety
- phase 2, short-term efficacy
- phase 3, long-term efficacy
- phase 4, long-term effectiveness
Statistical analysis
Intention to treat ITT
Cross-overs after rz: some patients might have a treatment (yes or no) beyond what they were assigned to, e.g. refuse surgery or medical treatment
Non-adherence during followup: some in treatment group refuse of can not tolerate certain treatment; while some in placebo group require medication or take on their own
Subgroup analysis
Stratified analysis: estimate treatment effect separately in subgroups. Does not tell difference across different subgroups
Test for interaction: use of main effect and interaction.
Issue of multiple testing when doing a series of analyses
- inflate sample size to plan for subgroup analysis
- report number of subgroup analyses performed
- possibly adjust for multiple comparisons
- report CI instead of just p-values
Resources
Coursera course Design and interpretation of clinical trials by Johns Hopkins University
Book Fast Facts: Clinical trials in oncology: The fundamentals