About Me
Data Analyst at Schonfeld Strategic Advisors, working across investment and business teams to build data infrastructure and portfolio analytics that inform firm-level decision-making. I hold a B.S. in Computer Science with a minor in Finance from Boston College, and I bring experience in scaling data pipelines for massive datasets, applying machine learning to extract predictive insight, and deploying cloud-based analytics infrastructure.
Projects
Large-Scale Data Engineering & Pattern Mining
First co-author: designed a leakage-safe, terabyte-scale ETL + pattern-mining pipeline (1B+ rows) to surface stable motifs and interaction rules; hardened via cross-dataset checks and FDR-controlled statistics.
Read More →
MarketMotif AI
Adapted genomics-style motif mining to markets to flag volatility regimes early; time-aware CV and threshold calibration reduced false positives under a fixed alert budget for actionable risk signals.
Read More →
SignalFrame Library
Built a Python toolkit for fast interval-signal analytics (BigWig/BED): vectorized merges/intersections and windowed stats turn hours-long feature jobs into minutes with predictable latency.
Read More →
Genre & Sentiment Analysis of Song Lyrics
Used NLTK, scikit-learn, and DistilBERT to classify 150,000 songs by genre and analyze lyrical sentiment. Combined models to uncover patterns between genre and emotional tone.
Read More →
Tomato Leaf Disease Detection with CNNs
Developed clustering and CNN-based models to classify PlantVillage tomato leaf images as healthy or diseased. Focused on minimizing false negatives to aid early disease detection.
Read More →
fMRI Dementia Detection with Deep Learning
Developed a multi-model CNN pipeline (ResNet50, VGG16, Inception-V3, AlexNet) to classify dementia stages using fMRI scans. Achieved 93% accuracy with ResNet50 and visualized important brain regions using Grad-CAM.
Read More →