Data Science and ML Projects

Data Science and ML Projects

Over time, I have attempted many different machine learning projects to utilize as many different architectures as I can. Generally these projects are all deep learning based, and so some examples include image recognition and NLP.

This page mirrors what is currently present in my main data science repository (also linked above). These follow a framework for training that is adapted from the this repository by ashleve. A lot of credit goes to this person for providing me the starter template. Below you will find the projects that have been started and built using this template. These experiments are also capable of outputting metrics to Weights and Biases. Finally, most of these projects are derived from the Kaggle competition counterparts.

Histopathologic Cancer Detection

This was the “intro” project I did in order to familiarize myself with the template and how it works. It is derived from the corresponding Kaggle competition from 5 years ago. It is a basic convolutional neural net that attempts to detect cancer in images.

Oxford Pet detection

The [Oxford-IIIT Pet Dataset] is a collection of 37 categories with about 200 images per class. It consists of both object detection (predicting bounding boxes around objects) and segmentation (predicting each pixel individually). I used this dataset to primarily practice building models for image segmentation.

Google Research - Detecting Contrails in the Atmosphere

This is a more involved project which involved predicting the exact presence of contrail in the atmosphere given a time-series of infrared images. This is a segmentation task, where a prediction is made for each individual pixel in the image.

Photo by Conny Schneider on Unsplash