Honors Data Science & Economics @ George Washington University
BS Data Science & Economics | George Washington University Honors |
As an intern with the Data Engineering Team, I collaborated with Data Analysts, Software Engineers, and Data Scientists to build and maintain numerous data models using dbt, Snowflake, and Looker. I optimized SQL queries for datasets with +100 million rows, achieving high performance and cost reduction. Additionally, I designed and implemented scalable big data pipelines using Airflow, improving the efficiency of data flow and processing.
During my internship at the Federal Reserve Board with the BOPA team, I had the opportunity to hone my technical skills while contributing to impactful financial decision-making processes. I developed various machine learning models using Python and R, such as a predictive model for the division’s budget. I also created and maintained comprehensive Tableau and Power BI dashboards, providing managers/officers with clear insights that led to cost-saving opportunities. This experience was invaluable in solidifying my interest in data and demonstrating the real-world impact of analytical solutions.
As a research assistant, I have experience constructing and analyzing large datasets to study complex topics. In this role, I created a panel dataset covering over 3,000 counties and 22 years to investigate the impact of unionization on voting access measures in the United States. I used quasi-experimental methods and regression analysis (using R and STATA) to study this relationship and conducted thorough research on the topic to validate and support my data analysis. This role interested me as it allowed me to develop my data skills, which I will use in my career.
I served for an AmeriCorps Program called Beyond School Mission St. Louis. I helped a group of 13 sixth graders prepare for High School and beyond with social/emotional learning, enrichment activities, community service, and individualized academics. I was also the volunteer specialist of my school, helping recruit and train 40 volunteers to work with our students. I learned a lot while working for Beyond School, specifically skills of volunteer management, tutoring, and database management. The entire experience was really precious as it broadened my views and taught me important life skills, which I know I will use in the future.
This project is a content-based book recommendation system that uses large language models to suggest titles based on user-inputted descriptions and emotional tone, rather than user history. Built with Python, LangChain, Hugging Face, and Gradio, it integrates zero-shot genre classification, sentiment analysis, and vector similarity search to deliver nuanced recommendations through an interactive web interface.
Read more about this project here
This project investigates the impact of Donald Trump’s Twitter activity on short term foreign exchange market volatility during his presidency, employing high-frequency intraday data and utilizing the Anthropic Claude Sonnet 3.5 model for tweet classification. The study analyzes over 56,000 tweets, categorizing them into trade policy, international relations, economic policy, and security topics while examining their effects on major currency pairs, including the Turkish Lira, Canadian Dollar, and Mexican Peso. Through Ordinary Least Squares regression analysis and sentiment interaction terms, the research reveals significant correlations between Trump’s tweets and currency volatility, demonstrating how social media communications from world leaders can create distinct information shocks in global financial markets.
Read more about this project here
This project analyzed the 2017 World Happiness Report in R, examining the factors influencing happiness scores and variations across regions. The study utilized correlation matrices, box plots, regression analysis, and feature importance tests to identify that life expectancy, economic factors, and freedom were the most impactful variables on happiness scores, with regional variations observed in their correlations.
Read more about this project here
Through logistic regression modeling in Stata, I analyzed 2018 National Health Interview Survey data to quantify the relationship between food insecurity and asthma prevalence in the US. By constructing control variables for income, family size, and regional factors, I identified a statistically significant positive correlation between food access limitations and asthma risk. I verified my findings through hypothesis testing and robustness checks, addressing potential endogeneity concerns to validate the observed relationship. This econometric approach revealed important socioeconomic mediators of the food insecurity-asthma connection, contributing meaningful insights into health disparities while strengthening my capabilities in statistical inference and complex data analysis.
Read more about this project here
Connect with me on LinkedIn