Juliana
 
IMG_0287.jpeg

Juliana Antoninus

I am a data scientist, archaeologist, and writer who enjoys uncovering the stories hiding in the data. A lifelong learner, I'm driven by a desire to know and understand the mysteries before me. I'm interested in how people use products in the real world, and want to help companies best meet the expectations of their users through data-driven design.

 
IMG_0985.jpg

Projects

Ship Identification

Keeping track of ships can be a tricky process, particularly when unregistered vessels are conducting clandestine operations. Aerial monitoring can identify unregistered/illicit maritime traffic such as piracy, illegal fishing, drug trafficking, and illegal cargo movement. A Convolutional Neural Network is used to detect the presence of ships in a given image, and a U-Net network is used to generate bounding boxes for the images which contain ships, allowing users to automatically identify ships from aerial ocean photography. This program allows ports to automatically monitor traffic, coast guards to detect untracked vessels, and marina operators to track use.

View on GitHub


Covid AQI

The COVID-19 shutdowns of March, 2020, represented an unprecedented reduction in the amount of vehicular traffic in American cities, with many cities seeing a drop in Air Quality Index values (lower AQI is indicative of cleaner air). Using 5 years of previous AQI data, I created a time series model to predict a “normal” 2020 for Los Angeles, allowing us to see just how strong the impact of the shutdown was. While the AQI levels did eventually return to normal levels, this model shows just how strong of an impact vehicular traffic has on air quality.

View on GitHub


Reddit NLP

I created a custom function to download tens of thousands of posts from the subreddits r/books and r/booksuggestions, allowing me to do Natural Language Processing on the posts. Despite the highly similar subject matter, I was able to develop a classification model which operated with great accuracy, demonstrating that just because the subject matter is similar, the method of communication can be specific. How people talk about a subject is just as important a predictor as is the subject matter itself.

Skills

Python • Data Science • Pandas • SQL • SKLearn • NumPy • Matplotlib Seaborn • Git • agile • JavaScript • JIRA • Confluence • Product Management • Data Visualization • Exploratory Data Analysis • Regression Models • Classification Models • Pipelines • Feature Engineering • Hypothesis Testing • Data Cleaning • Webscraping • Big Data • Natural Language Processing • Machine Learning • Artificial Intelligence • Neural Networks • Clustering Recommender Systems • Unsupervised Learning • Time Series Forecasting • Autocorrelation • Spark

IMG_2908.JPG

 Contact