Where I moved data around for real
Work that sits somewhere between data engineering, applied ML, and product automation.
Future Tilt
- Collaborated with a Data Consultant to build ETL pipelines with Airbyte, BigQuery, and AWS Lambda, capturing customer behavior and demographic data across 50+ ecommerce clients and 20M+ daily queries.
- Developed an AI email template platform with React, FastAPI, Docker, Google OAuth, and Klaviyo template logic, transforming Google Sheets campaign calendars into editable emails with Google Drive asset sync.
- Built customer segmentation and churn monitoring workflows with SQL and Python across Shopify and Klaviyo data to support more targeted marketing strategies.
Future Tilt
Actionable data, high impact campaigns, and measurable ROI for DTC brands.
Superlinked (Series A)
- Developed a paid launch demo for Superlinked's SIE engine during early access, working with Valentin Marek and Eric Taylor to showcase explainable wine recommendations.
- Built a RAG wine recommender that used Vivino data, small inference models, OCR, and text embeddings to surface similar wines with clearer reasoning.
- Shipped a React UI and containerized Python monorepo in Docker, keeping OCR and embedding modules cleanly separated for documentation users.
USF MAGIC Lab
- Built parallelized ETL pipelines in Go across 5 virtual machines, using DuckDB analytical querying to generate entity aware NLP datasets from 20,000+ news articles.
- Developed and evaluated an explainable aspect based sentiment analysis framework using REBEL, PyTorch, and graph based NLP methods, improving sentiment classification performance by 20% over VADER.
- Used graph based relationship extraction to connect articles, entities, sentiment, and evidence into datasets researchers could inspect.
Alaris Security (Pre seed)
- Designed orchestration workflows with Prefect and Airflow to normalize CrowdStrike, Elastic, and Microsoft Defender telemetry while powering an agent driven cybersecurity analysis platform that reduced investigation time from 10 hours to approximately 1 hour.
- Built customer facing compliance tooling with Next.js, React, and React PDF, generating NIS2 reports that linked agent generated findings from millions of raw SOC2 logs back to source telemetry.
- Established reusable tRPC service patterns across a Next.js monorepo, consolidating 50+ frontend database calls into typed backend procedures.
Future Tilt
- Built a Lambda campaign orchestration service that syncs Google Sheets planning calendars with Klaviyo campaigns and Trello production tasks, cutting setup time by 50%.
- Worked across Google Sheets, Klaviyo, Trello, and AWS Lambda to turn campaign planning data into production ready automation.
Candle Stories
- Supported documentary shoots, equipment handling, and on set logistics. Less data pipeline, more real world pipeline.
USF Strategic Enrollment Management
- Analyzed 500,000+ student records from SLATE, turning raw SQL exports into datasets, dashboards, and PCA models for admissions strategy.
- Identified regional enrollment patterns, including strong correlations between Florida applicants and SEO driven recruitment channels, informing admissions marketing strategy.
- Automated recurring web and reporting updates with Python and Jinja2 so the data work did not become manual copy paste theater.
iD Tech Camps (Stanford)
- Taught project based Python and machine learning lessons to high school students at Stanford, covering neural networks, NumPy, Pandas, and Keras.
UC Merced to SATAL
- Analyzed thousands of Qualtrics survey responses and focus group notes from 500+ students to identify drivers of engagement and academic performance.
- Used Pandas and OpenAI assisted categorization to turn open ended feedback into structured themes faculty could act on.
- Presented research on methodology at the Fresno State Exemplary Practices in Higher Education Conference.
Acme Builders Incorporated
- Built internal data systems in Python with NumPy and Pandas to clean, organize, and standardize records across departments.
- Updated, organized, and archived company documents to support payroll cycles, budgeting, and reliable business data management.
- Used OCR workflows to reduce manual document sorting and make scanned account records easier to organize.
Projects
These are mostly passion projects that I made with friends.
showing everything
All projects, no bucket applied.
nextsteamgame.com
I built this because most similarity recommenders only let users pick a game, then hand back results without explaining what actually matched. If I like Persona 5 for the jazz fusion OST and modern Tokyo setting, I want to search from those reasons, not just a broad RPG label. The goal is to help people find new games, give niche games a better highlight, and show which features drove each recommendation.
Superlinked Wine Recommender
A wine recommender developed with the Superlinked team during early access to their SIE engine. It uses document processing, vector embeddings, and small model inference to explain why a result appears, whether the match came from fizz, cherry notes, body, acidity, or other wine attributes.
Maldemic Simulator
We built Maldemic to help close the gap between researchers and the public. Disease models can feel locked behind papers and equations, so we turned SIR dynamics and Markov chain mobility into a 3D globe people can watch, question, and reason about. Python computes the stochastic population transitions, then Godot makes the spread visible for public education.
Next Chapter
A hackathon project built to make retirement questions feel less foggy. Users can ask things like "Can I retire in the Philippines?" or "How much should I start saving?" and the system answers with retrieved context and visible data instead of pretending a prompt is a financial plan.
USF Search Engine Crawler
A search pipeline built around data movement more than page scraping. 300 extract workers download and parse pages while 300 database workers batch writes into SQLite, with queues keeping the system fed without falling over when the crawler moves faster than the database wants it to.
Antidote Intelligence
An open source ML security project that treats training data as the place where model risk often starts. The system uses a multi agent analysis pipeline to inspect dataset content, generate hypotheses, and surface examples worth investigating before bad data becomes expensive behavior.
Dreamville
A gamified Canvas LMS tracker that pulls assignments into a game loop, then scores urgency from completion patterns and difficulty signals. The useful part is turning school workflow data into a next action system students can act on without another dashboard yelling at them.
Hyper Rosen
A hackathon built Godot experiment in systems that can keep expanding. Swirled Perlin noise places planets, wave function collapse handles city placement, and procedural rules create enemies and asteroids, making the project feel like a small galaxy generated from reusable data rules.
Cake Walk
A fast game jam pitch: make a tiny character readable, charming, and playable in a single day. We built and demoed Cake Walk at GDC Festival of Gaming with Keriya Son on 3D, Angie Peczon on art, Eric Taylor on shaders, and Ilce Perez on music.
Old Man Climbs
A small vertical climber built over a weekend for a UC Merced game jam in 2022. It is here less as a technical flex and more as the first shipped artifact: a reminder that finishing a small loop teaches more than endlessly planning a bigger one.
Quick Autocorrect
A small community plugin for reducing friction while writing in Obsidian. It catches repeated misspellings, applies quick corrections, and keeps a personal dictionary for words Obsidian should stop fighting you on: a tiny version of the same pattern I like, cleaning a messy text stream into something easier to use.
NutriFinder
A small dietary search project with a practical pitch: pull in messy menu and nutrition information, normalize it enough to filter, and give people a cleaner way to decide what they can eat.
Spiral Visualizer
A compact teaching visualization for spiral growth using queued directions. The pitch is simple: when a system changes step by step, showing the state often teaches faster than another paragraph of explanation.