Ryan Clark

Data Science at HHMI | Boston Childresn's Hospital | Harvard Medical School.

View Projects View Resume

About Me

I recently graduated from Boston College with a Bachelor of Science in Computer Science and a minor in Finance. I currently work as a Data Scientist in the Hur Lab at Howard Hughes Medical Institute (Boston Children’s Hospital, Harvard Medical School), where I develop and apply computational and statistical methods to large-scale data. My work focuses on modeling complex patterns in high-dimensional datasets, extracting actionable insights from noisy signals, and building scalable tools for real-world data analysis. With a strong foundation in computer science and experience working with massive experimental datasets, I bring a data-first approach to solving the most difficult challenges.

Ryan Clark Portrait

Projects

FoxP3 Visualization

FoxP3 Dimer Discovery

Customized ML pipelines revealed novel head-to-head FoxP3 binding motifs, reshaping our understanding of transcription factor interactions.

Read More →
SignalFrame Visualization

SignalFrame Library

Engineered a lightning-fast Python package for genomic signal processing. Outperforms industry giants, now powering open-source pipelines.

Read More →
MarketMotif Visualization

MarketMotif AI

Leveraged motif-mining algorithms on financial data to detect volatility regimes, enabling proactive risk management strategies.

Read More →
Music Genre NLP Project

Genre & Sentiment Analysis of Song Lyrics

Used NLTK, scikit-learn, and DistilBERT to classify 150,000 songs by genre and analyze lyrical sentiment. Combined models to uncover patterns between genre and emotional tone.

Read More →
Leaf Health Classifier

Tomato Leaf Disease Detection with CNNs

Developed clustering and CNN-based models to classify PlantVillage tomato leaf images as healthy or diseased. Focused on minimizing false negatives to aid early disease detection.

Read More →
Dementia Detection with CNNs

fMRI Dementia Detection with Deep Learning

Developed a multi-model CNN pipeline (ResNet50, VGG16, Inception-V3, AlexNet) to classify dementia stages using fMRI scans. Achieved 93% accuracy with ResNet50 and visualized important brain regions using Grad-CAM.

Read More →

Skills & Technologies

Python
AWS
SQL
ML & AI
R
Tableau
Git
Pandas & NumPy
TensorFlow & PyTorch
Linux

Get In Touch