Data Analyst
South Jordan, UT
I build KPI's dashboards, data pipelines and ML models that turn messy data into clear decisions.
Machine Learning & Healthcare
Built an end-to-end ML pipeline on 790,000+ ICU patient records to detect sepsis a median of 23 hours before clinical diagnosis. Achieved AUC of 0.885 using XGBoost with SHAP explainability.
Data Engineering
Built an end-to-end ETL pipeline that extracts drug adverse event reports from the FDA's OpenFDA API, transforms nested JSON into three structured tables, and loads them into Snowflake using idempotent MERGE statements. Processed 51,265 reports in a 16-day window, surfacing fentanyl and alcohol as the highest death-rate drugs reported to the FDA.
Data Wrangling & ML
Built a Random Forest model to predict next-day S&P 500 direction using Yahoo Finance and FRED API data with feature engineering across 5 time horizons. Backtested a model-guided dollar-cost averaging strategy against passive DCA, both returning ~15% on $27.5K invested.
Data Wrangling & ML
Built a market stress early warning system using S&P 500 data. Engineered risk features — rolling volatility, cumulative returns, and drawdowns — then trained a Logistic Regression model to estimate the probability of market stress. Found that risk spikes days before price drops, showing markets can look calm while becoming fragile.
Machine Learning
Trained a Random Forest classifier to predict whether a house was built before or after 1980 using 48 features from a 22,900-row dataset. Achieved 92.3% accuracy, with living area, number of bathrooms, and stories as the top predictors.
Powered by the Healthcare Adverse Events Pipeline — ask anything about drug safety reports stored in Snowflake. The AI writes the SQL, queries the database live, and explains the results in plain English.