Projects
Ship Identification
Keeping track of ships can be a tricky process, particularly when unregistered vessels are conducting clandestine operations. Aerial monitoring can identify unregistered/illicit maritime traffic such as piracy, illegal fishing, drug trafficking, and illegal cargo movement. A Convolutional Neural Network is used to detect the presence of ships in a given image, and a U-Net network is used to generate bounding boxes for the images which contain ships, allowing users to automatically identify ships from aerial ocean photography. This program allows ports to automatically monitor traffic, coast guards to detect untracked vessels, and marina operators to track use.
Covid AQI
The COVID-19 shutdowns of March, 2020, represented an unprecedented reduction in the amount of vehicular traffic in American cities, with many cities seeing a drop in Air Quality Index values (lower AQI is indicative of cleaner air). Using 5 years of previous AQI data, I created a time series model to predict a “normal” 2020 for Los Angeles, allowing us to see just how strong the impact of the shutdown was. While the AQI levels did eventually return to normal levels, this model shows just how strong of an impact vehicular traffic has on air quality.
Reddit NLP
I created a custom function to download tens of thousands of posts from the subreddits r/books and r/booksuggestions, allowing me to do Natural Language Processing on the posts. Despite the highly similar subject matter, I was able to develop a classification model which operated with great accuracy, demonstrating that just because the subject matter is similar, the method of communication can be specific. How people talk about a subject is just as important a predictor as is the subject matter itself.
Skills
Python • Data Science • Pandas • SQL • SKLearn • NumPy • Matplotlib Seaborn • Git • agile • JavaScript • JIRA • Confluence • Product Management • Data Visualization • Exploratory Data Analysis • Regression Models • Classification Models • Pipelines • Feature Engineering • Hypothesis Testing • Data Cleaning • Webscraping • Big Data • Natural Language Processing • Machine Learning • Artificial Intelligence • Neural Networks • Clustering Recommender Systems • Unsupervised Learning • Time Series Forecasting • Autocorrelation • Spark