Research
Research
design and
design and
statistics (II)
statistics (II)
Observational studies
Observational studies
Wei-Chu Chie
Wei-Chu Chie
Purposes of
Purposes of
clinical/preventive research
clinical/preventive research
– Therapeutic (treatment) efficacy evaluation – Prognostic/predictive factors identification
• Possible factors associated with an clinical/preventive outcome
– Descriptive
– Causal inference
• hypothesis testing • prediction
Therapeutic efficacy
Therapeutic efficacy
evaluation
evaluation
• Exposure (cause): a certain
treatment
– drug, procedure, education, ...
• Outcome:
– the end result of care, or a measurable change in the health status or behavior of patients
Prognostic/predictive factors
Prognostic/predictive factors
identification
identification
– Exposure: certain characteristics of patient, disease, ...
• Prognosis: a prediction of the future course of disease following its onset
• Prognostic/predictive factors: conditions associated with an outcome of disease
– Outcome:
• the end result of care, or a measurable
change in the health status or behavior of patients
Major designs
Major designs
– Experimental:• exposure (treatment) is manipulated • analog to laboratory work
• gold standard: randomized controlled trials
– Observational:
• no any manipulation of exposure (treatment) • natural observation
Common enemies of causal
Common enemies of causal
inference
inference
• What we pursue?
– Real causal effect
• What we may have otherwise?
– Bias
– confounding – chance
Bias
Bias
• Source: systemic error
– selection: two definitions! – information
• prevention/avoidance
– better design (RCT)
• evaluation and analysis
– additional data
Chance
Chance
• Source: random error
• Prevention/avoidance
– increase sample size/power of test – more accurate measurement
• Analysis and evaluation
– p value/ confidence intervals – meta-analysis
Confounding
Confounding
• Source: other factors associated with
both exposure and outcome
• prevention/avoidance
– better design (RCT) (*matching is not suitable)
• analysis and evaluation restriction
– restriction – stratification
Reverse causal association
Reverse causal association
• Source: cross-sectional data collection
• Prevention/avoidance
– proper time sequence: exposure before outcome
• Analysis and evaluation
– biological plausibility – consistency
美國預防醫學特別委員會
美國預防醫學特別委員會
判斷標準
判斷標準
– Review of Evidence:
– literature retrieval and exclusion criteria – evaluating the quality of the evidence
(Appendix A)
• grade I: RCT (randomized controlled trials) • grade II-1: CT without R
• grade II-2: well-designed cohort or case-control studies, multi-center preferable
美國預防醫學特別委員會
美國預防醫學特別委員會
判斷標準
判斷標準
• grade II-3: multiple time-series with or without intervention, dramatic results of uncontrolled experiments
• grade III: opinion of respected authorities, based on clinical
• experiences; descriptive studies and case reports; case reports of expert committees
– cost-benefit, utility and effectiveness analysis – meta-analysis and synthesis of research
results
美國預防醫學特別委員會
美國預防醫學特別委員會
判斷標準
判斷標準
• Translating science into clinical practice (Recommendation in Appendix A)
– level A: good evidence to support the recommendation that the condition be specifically considered in a periodic health examination
– level B: fair …
– level C: insufficient … but recommendations may be made on other grounds
– level D: fair evidence … be excluded – level E: good evidence … be excluded
Grade definition
Grade definition
• Good:
– evidence includes consistent results from well-designed, well-conducted studies in representative populations that directly assess effects on health outcomes
Grade definition
Grade definition
• Fair:
– evidence is sufficient to determine effects on health outcomes, but the strength of the evidence is limited by the number, quality, or consistency of the individual studies
Grade definition
Grade definition
• Poor:
– evidence is insufficient to assess the effects on health outcomes because of limited number or power of studies,
important flaws in their design or
conduct, gaps in the chain of evidence, or lack of information on important
Ideal and reality
Ideal and reality
– Ideal: experimental• good for causal inference/fewer bias and conf ounders but not always generalizable
• more ethical concerns and costly • therapeutic efficacy evaluation
– Reality: observational
• easier to implement or data ready to use
• fewer ethical concerns but more bias or confo unders
Observational studies
Observational studies
• Cohort, prospective
– variations of prospective cohort
• Case-control
• Cross-sectional
The major difference
The major difference
• Time/timing between measurement
of exposure and outcome
• Strength in causal inference
Cohort study
Cohort study
• Original definition of a cohort 羅馬
軍團
• Prospective cohort study:
– the most classical design and attractive nature of epidemiology
– causal inference without experiment – the best in observational studies
• Variants:
Prospective cohort study in
Prospective cohort study in
outcome research
outcome research
• Assemble the cohort
– inception cohort: onset of disease/zero tim e
• Measure predictor variables (prognostic
/predictive)
• Follow-up and measure outcomes
– time to event (incidence): change of status – surrogate, qol, …: change of value
Strengths and weakness in
Strengths and weakness in
outcome research
outcome research
• Strengths:
– proper time sequence: predictors (exposures measured before outcomes)
– fewer bias: information and selection
– time-dependent variables available if measured – binary: rates obtainable/ non-binary: value/change
• Weakness:
– inefficient for rare outcomes
– expensive, time consuming in maintenance/follow-up
Variant 1: retrospective
Variant 1: retrospective
cohort study
cohort study
– Identify a suitable cohort
– Collect data about predictor variables – Collect data about outcomes at a later
time
• basically also a cohort or follow-up study • only difference: time of measurement
• common in clinical studies/data linkage
• not necessarily collecting outcomes “later” but at a later time than the occurrence of the exposure
Strengths and weakness in
Strengths and weakness in
outcome research
outcome research
• Strengths:
– same as prospective cohort
– less costly and time consuming
• Weakness:
– same as prospective cohort except for cost & time
– no QA/QC for data collection
Variant 2: case-cohort study
Variant 2: case-cohort study
• Identify a cohort with adequate
samples
• select a sub-cohort as comparison:
representative to the full cohort
• identify cases at the end of follow-up
• measure predictors on baseline
Strengths and weakness in
Strengths and weakness in
outcome research
outcome research
• Strengths
– same as prospective cohort except non-binary outcomes
– most useful for costly analyses of specimens • biochemistry, biomarkers, …
– time-dependent factors available
– rate and non-binary outcomes obtainable
• Weakness
Variant 3: nested
Variant 3: nested
case-control study
control study
• Identify a cohort with adequate
samples
• identify cases at the end of follow-up
• select matched controls from the
cohort
• measure predictors on baseline
samples from cases and controls
Strengths and weakness in
Strengths and weakness in
outcome research
outcome research
• Strengths
– same as prospective cohort except non-binary outcomes
– most useful for costly analyses of specimens • biochemistry, biomarkers, …
– controls from cohort: representative and time-matched
• Weakness
– same as prospective cohort except for cost – specimen storage/use and re-use
Variant 4: double cohort and
Variant 4: double cohort and
external controls
external controls
• Identify cohorts with different
exposures
• Determine outcomes
Strengths and weakness in
Strengths and weakness in
outcome research
outcome research
• Strengths– same as prospective cohort – good for rare exposures
– time-dependent factors available
– rates, non-binary outcomes obtainable
• Weakness
– same as prospective cohort – more confounding
Case-control (reference)
Case-control (reference)
study
study
• An important breakthrough of
epidemiologic study
• classical definition
• new perspective
– control as a sample of hypothetical
population from which cases came from – can be seen as a variant of cohort study
Case-control study in
Case-control study in
outcome research
outcome research
– Draw a sample of new (incident) cases (outcome +)
– Draw a sample of controls (outcome - at a certain time)
• a sample of hypothetical population from which cases came from
– Measure the predictor variables
• usually at the time when cases and controls are drawn
Strengths and weakness in
Strengths and weakness in
outcome research
outcome research
• Strengths– efficient for rare outcomes: time and cost
• Weakness
– not always proper time sequence – bias: selection and information – confounding
– non-binary outcomes not obtainable
Cross-sectional study
Cross-sectional study
• The most easy type
• usually by surveys
• current status/prevalence and
prevalence ratios only
Cross-sectional study in
Cross-sectional study in
outcome research
outcome research
• Select a sample from population
• measure the predictor variables and
the outcomes at the same time
– case/non-case (not controls) – exposure/non-exposure
Strengths and weakness in
Strengths and weakness in
outcome research
outcome research
• Strengths
– time saving
– get prevalence/status data both binary and non-binary
• Weakness
– no proper time sequence: poor in causal inference – inefficient in rare outcomes
– bias: selection and information/confounding – binary outcomes: only prevalence ratio
Differences of case-control and
Differences of case-control and
cross-sectional studies
cross-sectional studies
• Nature of cases
– case-control: incident (newly onset) cases – cross-sectional: prevalent cases
• the problem of using prevalent cases: duration
• Ratio of case to non-case/controls
– case-control: equal or of a certain ratio/efficient – cross-sectional: depends on the ratio in the
Differences of case-control and
Differences of case-control and
cross-sectional studies
cross-sectional studies
• Nature of non-cases/controls
– case-control: a representative sample of cohort – cross-sectional: a group of non-cases in the
sample of population at the time of sampling • subject to the duration, recovery of disease
• Time frame of two parts
– case-control: better matched by time of onset – cross-sectional: usually not related to onset
Similarity of case-control and
Similarity of case-control and
cross-sectional studies
cross-sectional studies
• Collect data at the time when the
cases/non-cases or controls identified
• not easy to get proper and true time
Serial surveys or panels
Serial surveys or panels
• Follow-up a single population
– Serial surveys: like multiple cross-sectional studies
– Panel: like cohort studies
Sample size estimation
Sample size estimation
– Purpose: adequate power of test– basic formula and necessary components
• alpha (one or two-sided) and beta error
– usually alpha = 0.05, beta = 0.2 – then power = 1-beta = 0.8
• effective size: mean, difference, ratio, ... • standard deviation
– from prior information or other related source
Statistical methods
Statistical methods
– Purpose:• descriptive or inference
• not to confuse or cheat readers!
– Rules:
• a good design is much more important than fancy statistical methods
• never dream of using statistical technique to compensate the results of a poor design
Statistical methods
Statistical methods
• Exposure (independent variable) X
outcome (dependent variable)
• classification of variables
– nominal or categorical
• binary/dichotomous, time to event • other
– ordinal or rank
exposure as categorical
exposure as categorical
• outcome as binary
– chi-square/proportion (Z) – logistic regression
• outcome as binary, time to
event/‘censored’
– survival :
• Kaplan-Meier, log-rank
exposure as categorical
exposure as categorical
• Outcome as other categorical
– chi-square – proportion
– logistic regression for poly-chotomous o utcomes vs. dichotomous
exposure as categorical
exposure as categorical
• Outcome as rank
– non-parametric methods
– Wilconxon’s rank sum test – Kruskal-Wallis test
exposure as categorical
exposure as categorical
• Outcome as interval
– t-test – ANOVA
– regression with dummy independent variable
Exposure as rank
Exposure as rank
• outcome as categorical:
– chi-square• outcome as rank:
– non-parametric correlation• outcome as interval
– non-parametric correlation – parametric correlation/regressionExposure as interval
Exposure as interval
• Outcome as categorical
– treat as reverse relation
• outcome as rank
– non-parametric correlation
– parametric correlation/regression
• outcome as interval
Repeated measurements
Repeated measurements
• Two times
– Categorical variables: • McNemar’s chi-square – Rank variables:• Wilcoxon’s signed rank test
– Interval variables
Repeated measurements
Repeated measurements
• Multiple times
• Advanced statistic models/methods
– GEE
– mixed effects regression model
– Markov chain and transitional probability of more than two states/status