Analytical & Data Projects

A collection of technical projects spanning machine learning, econometric analysis, and data science.

Email Spam Classification System

Email Spam Classification System

Developed a text-based machine learning classifier achieving 90%+ accuracy while balancing spam detection performance with false-positive risk and ethical considerations.

Python Machine Learning NLP
NBA Salary Determinants

NBA Salary Determinants

Used econometric regression to analyze how player performance and experience translated into NBA wages, finding scoring and tenure as primary drivers of compensation.

Econometrics OLS Regression R
Cook County Housing Price Prediction

Cook County Housing Price Prediction

Built and evaluated a machine learning model on 500k+ housing records to predict property prices and assess fairness impacts across socioeconomic neighborhoods.

Python Scikit-learn ML
Climate Change & Civil Conflict

Climate Change & Civil Conflict

Constructed panel data models to study how temperature and precipitation shocks influence civil conflict risk across countries and time.

Python R Panel Data
Minimum Wage Policy & Foreign Investment

Minimum Wage Policy & Foreign Investment

Applied panel econometrics to assess how minimum wage policy and labor availability affect foreign direct investment in emerging manufacturing economies.

Econometrics Panel Data Policy Analysis
Economic Impact Analysis: Mariel Boatlift

Economic Impact Analysis: Mariel Boatlift

Replicated Nobel Prize–winning causal research to evaluate the labor market effects of immigration using survey data and econometric analysis.

Python Causal Analysis Labor Economics
Regional GDP Analysis

Regional GDP Analysis

Analyzed and visualized county-level GDP trends to compare regional economic recovery patterns across the United States over two decades.

Python Geospatial Time Series
Econometrics Analysis of MLB Salary Disparities

Econometrics Analysis of MLB Salary Disparities

Investigated how race and city demographics influence MLB player salaries beyond performance metrics.

R Regression Bias Analysis
Ants: Data-Driven Tower Defense Game

Ants: Data-Driven Tower Defense Game

Tower defense game using object-oriented programming with custom classes, inheritance, and pathfinding algorithms.

Python OOP Algorithms
CATS: Typing Autocorrect System

CATS: Typing Autocorrect System

Typing software measuring speed and accuracy with autocorrect using recursive string comparison and dynamic programming.

Python Algorithms Recursion
Classifying Movies (K-Nearest Neighbors)

Classifying Movies (K-Nearest Neighbors)

K-NN classifier predicting movie genres by analyzing keyword frequency in scripts using Euclidean distances.

Python K-NN Classification
Climate Change: Temperatures and Precipitation

Climate Change: Temperatures and Precipitation

Analyzed temperature and precipitation data from 210 U.S. cities using hypothesis testing and A/B tests.

Python Statistics Hypothesis Testing
Predicting Baseball Wins

Predicting Baseball Wins

Developed regression models predicting MLB team wins based on runs and performance metrics since 2000.

R Regression Sports Analytics
World Population and Poverty

World Population and Poverty

Examined global population growth and poverty trends, modeling data from 145 countries with visualizations.

Python Data Viz Global Development