Statistics
A folder of projects on Biostatistics and Statistics
This folder commits to the application of biostatistics knowledge on clinical, randomized trials and observational studies. Goals include:
- write a good statistical analysis plan
- understand the obstacles when dealing real-world data
- apply biomedical/research statistical analyses
- excel various analytical models
Code and raw datasets are placed in this Github repository
- Logistic Regression:
- analytics: logistic regression model, build & interpretation
- data source: Framingham longitudinal heart study
- research question: Does quitting smoking reduce the likelihood of incident cardiovascular disease?
- Survey Analysis:
- analytics: survey data analysis with weights
- data source: National Health and Nutrition Examination Survey (NHANES) 2017-2018 data
- research question: Is vaping associated with increased blood lead levels? Is there effect modification by age or gender?
- Longitudinal Analysis:
- analytics: longitudinal data analysis & missing data imputation
- data source: longitudinal data from study “A Phase 3 placebo-controlled, double-blind, multi-site trial of the alpha-2-adrenergic agonist, lofexidine, for opioid withdrawal”
- research question:
- Is there group difference between the treatment group and placebo group with respect to MHOWS (parallel group response profiles)?
- Is there any time effect?
- Propesity Score Matching Analysis:
- analytics: propensity score matching and outcome analysis
- data source: National Health and Nutrition Examination Survey Data I Epidemiologic Follow Up Study (NHEFS) 1971&1982 follow-up data
- research question: Is there an effect of smoking cessation on weight gain?
- Survival Analysis:
- analytics: survival analysis, Cox proportional hazards model
- data source: simulated Ebola epidemic data
- research question: is gender, age a risk factor for Ebola survivability?