Projects

Here are some projects I have made. Mainly, they are the ones I have deployed to the web.

Analysis of Nutrition and Food Security in Rural Malawi (Python)

With a team of 5 other Master's studentds, I analyzed features most important for predicting food insecurity primarily using Gini Impurity of Random Forests and p-values of logistic regression models during this semester-long project. Additionally, I wrote an interactive analytical dashboard that allowed users to instantaneously compare hundreds of features to one another.

Democratic vs. Republican Covid Policies (Python)

For a class in school, I and two other students analyzed how the differing COVID-19 policy prescriptions affected public health and economic conditions.

Planet Pinky Lamb (Javascript Minigames)

Over the 2019 summer, I taught myself Javascript, HTML, and CSS using online resources such as Khan Academy, freeCodeCamp and W3Schools. I enjoy programming minigames that can incorporate my personal artwork. I built my own website that is hosted by Github so that others could access them more easily.

Personal Expenses Dashboard (Python)

Using streamlit and plotly, I created a dashboard to analyze my personal expenses. It allows you to subset the data by category and date. It also allows you to set labels on specific dates to help determine why personal expenses went up or down. For fun, I even included clustering to help categorize expenses.

Catan Optimization (Python)

This is a work in progress, "for fun" project. The goal is to write an algorithm to determine the best starting position and locations to settle in Catan based on finding the most interconeected node. In the future, I want to play with the reawrd functions to allow the user to weight how much they want roads vs settlements vs cities to contriute to the reward function.

Education

SEPTEMBER 2023 – MAY 2024

Cornell University | Master of Professional Studies in Applied Statistics

  • Relevant Coursework: Natural Language Processing, Reinforcement Learning, Monte Carlo Simulation
  • SEPTEMBER 2020 – MAY 2023

    Cornell University | B.A. in Statistical Sciences

  • Relevant Coursework: Data Mining & Machine Learning, Linear Algebra, Multivariable Calculus, Object-Oriented Programming and Data Structures, Quantitative Genomics, Probability Models & Inference
  • Experience

    MAY 2023 – August 2023

    Data Science Intern @ Alife Health

  • Wrote logistic regression model and implemented interactive dashboard to predict clinical pregnancy rates per transfer from only known information a potential infertility patient would have using roughly 100,000 frozen embryo transfers (FET) cycles from the Society for Assisted Reproductive Technology (SART) database.
  • Wrote interactive dashboard to analyze Anti-Mullerian Hormone (AMH) distributions among different populations with the goal of establishing a gold standard for different sub-populations
  • MAY 2021 – March 2023

    Data Analyst Intern @ Medtech For Solutions

  • Medtech For Solutions is a company that manages IVF labs across the United States. I had the incredible opportunity to analyze pre-implantation genetic testing embryology data of 20,000 embryos that were created within that last three years.
  • I mainly wrote Python functions to standardize and conduct statistical analysis on preimplantation genetic testing data to improve clinical outcomes and for quality control. I also used PyAutoGUI and Pytesseract to automate quarterly reports and data entry from EMR data.
  • Hobbies

    I like to play piano, hike, cross country ski, fold origami and make video montages.

    2021-2022 Montage

    2020-2021 Montage