Alex Peczon

Hey, I'm Alex

Data software engineer. I like turning messy APIs, logs, reviews, surveys, and spreadsheets into pipelines people can actually use.

Favorite language: Python

Where I moved data around for real

Work that sits somewhere between data engineering, applied ML, and product automation.

Future Tilt

Software Engineer
Jul 2025 to Present
San Francisco
  • Collaborated with a Data Consultant to build ETL pipelines with Airbyte, BigQuery, and AWS Lambda, capturing customer behavior and demographic data across 50+ ecommerce clients and 20M+ daily queries.
  • Developed an AI email template platform with React, FastAPI, Docker, Google OAuth, and Klaviyo template logic, transforming Google Sheets campaign calendars into editable emails with Google Drive asset sync.
  • Built customer segmentation and churn monitoring workflows with SQL and Python across Shopify and Klaviyo data to support more targeted marketing strategies.
React FastAPI Airbyte BigQuery AWS Lambda Klaviyo API Google OAuth Docker
Future Tilt Future Tilt Actionable data, high impact campaigns, and measurable ROI for DTC brands.

Superlinked (Series A)

SIE Demo Software Developer (Contract)
Mar 2026 to May 2026
San Francisco Bay Area · Hybrid
  • Developed a paid launch demo for Superlinked's SIE engine during early access, working with Valentin Marek and Eric Taylor to showcase explainable wine recommendations.
  • Built a RAG wine recommender that used Vivino data, small inference models, OCR, and text embeddings to surface similar wines with clearer reasoning.
  • Shipped a React UI and containerized Python monorepo in Docker, keeping OCR and embedding modules cleanly separated for documentation users.
Superlinked SIE React Python Docker RAG Small Models

USF MAGIC Lab

NLP Research Assistant
Mar 2025 to May 2026
San Francisco
  • Built parallelized ETL pipelines in Go across 5 virtual machines, using DuckDB analytical querying to generate entity aware NLP datasets from 20,000+ news articles.
  • Developed and evaluated an explainable aspect based sentiment analysis framework using REBEL, PyTorch, and graph based NLP methods, improving sentiment classification performance by 20% over VADER.
  • Used graph based relationship extraction to connect articles, entities, sentiment, and evidence into datasets researchers could inspect.
Go Python NLP PyTorch REBEL Graph Extraction DuckDB

Alaris Security (Pre seed)

Junior Fullstack Engineer
Aug 2025 to Nov 2025
San Francisco
  • Designed orchestration workflows with Prefect and Airflow to normalize CrowdStrike, Elastic, and Microsoft Defender telemetry while powering an agent driven cybersecurity analysis platform that reduced investigation time from 10 hours to approximately 1 hour.
  • Built customer facing compliance tooling with Next.js, React, and React PDF, generating NIS2 reports that linked agent generated findings from millions of raw SOC2 logs back to source telemetry.
  • Established reusable tRPC service patterns across a Next.js monorepo, consolidating 50+ frontend database calls into typed backend procedures.
Next.js React PDF tRPC Prefect Airflow CrowdStrike Microsoft Defender

Future Tilt

Software Engineering Intern
Jun 2025 to Aug 2025
San Francisco
  • Built a Lambda campaign orchestration service that syncs Google Sheets planning calendars with Klaviyo campaigns and Trello production tasks, cutting setup time by 50%.
  • Worked across Google Sheets, Klaviyo, Trello, and AWS Lambda to turn campaign planning data into production ready automation.
AWS Lambda Google Sheets Klaviyo API Trello API Automation

Candle Stories

Production Assistant
Apr 2025 to Aug 2025
San Francisco
  • Supported documentary shoots, equipment handling, and on set logistics. Less data pipeline, more real world pipeline.
Production Logistics

USF Strategic Enrollment Management

Data Analyst / Web Intern
Jul 2024 to Jul 2025
San Francisco
  • Analyzed 500,000+ student records from SLATE, turning raw SQL exports into datasets, dashboards, and PCA models for admissions strategy.
  • Identified regional enrollment patterns, including strong correlations between Florida applicants and SEO driven recruitment channels, informing admissions marketing strategy.
  • Automated recurring web and reporting updates with Python and Jinja2 so the data work did not become manual copy paste theater.
SQL Pandas PCA SLATE Jinja2 Admissions Analytics

iD Tech Camps (Stanford)

Machine Learning Instructor
Jun 2024 to Aug 2024
Stanford, California
  • Taught project based Python and machine learning lessons to high school students at Stanford, covering neural networks, NumPy, Pandas, and Keras.
Python PyTorch Keras NumPy Pandas

UC Merced to SATAL

Data Analyst Intern
Aug 2023 to May 2024
Merced
  • Analyzed thousands of Qualtrics survey responses and focus group notes from 500+ students to identify drivers of engagement and academic performance.
  • Used Pandas and OpenAI assisted categorization to turn open ended feedback into structured themes faculty could act on.
  • Presented research on methodology at the Fresno State Exemplary Practices in Higher Education Conference.
Pandas Qualtrics OpenAI Survey Analysis Research Methods

Acme Builders Incorporated

Construction Worker → Accounting Assistant
May 2021 to Dec 2023
Oakland · On site · Part time
  • Built internal data systems in Python with NumPy and Pandas to clean, organize, and standardize records across departments.
  • Updated, organized, and archived company documents to support payroll cycles, budgeting, and reliable business data management.
  • Used OCR workflows to reduce manual document sorting and make scanned account records easier to organize.
Python Pandas NumPy OCR Business Data Accounting Construction

Projects

These are mostly passion projects that I made with friends.

show me

showing everything

All projects, no bucket applied.

Live
Stars

nextsteamgame.com

I built this because most similarity recommenders only let users pick a game, then hand back results without explaining what actually matched. If I like Persona 5 for the jazz fusion OST and modern Tokyo setting, I want to search from those reasons, not just a broad RPG label. The goal is to help people find new games, give niche games a better highlight, and show which features drove each recommendation.

Long term PostgreSQL ChromaDB Qdrant ModernBERT FastAPI Docker
Superlinked Wine Recommender
Superlinked
Series A

Superlinked Wine Recommender

A wine recommender developed with the Superlinked team during early access to their SIE engine. It uses document processing, vector embeddings, and small model inference to explain why a result appears, whether the match came from fizz, cherry notes, body, acidity, or other wine attributes.

Long term Superlinked SIE Vector Search OCR Small Models Chroma PostgreSQL
2nd Place

Maldemic Simulator

We built Maldemic to help close the gap between researchers and the public. Disease models can feel locked behind papers and equations, so we turned SIR dynamics and Markov chain mobility into a 3D globe people can watch, question, and reason about. Python computes the stochastic population transitions, then Godot makes the spread visible for public education.

Long term Python NumPy SciPy Godot Markov Chains SIR
Next Chapter
Hackathon

Next Chapter

A hackathon project built to make retirement questions feel less foggy. Users can ask things like "Can I retire in the Philippines?" or "How much should I start saving?" and the system answers with retrieved context and visible data instead of pretending a prompt is a financial plan.

RAG LLMs FinTech Personal Finance AI for Good
USF Search Engine Crawler
Crawler

USF Search Engine Crawler

A search pipeline built around data movement more than page scraping. 300 extract workers download and parse pages while 300 database workers batch writes into SQLite, with queues keeping the system fed without falling over when the crawler moves faster than the database wants it to.

Long term Go SQLite Concurrency Batch Writes Queues
Antidote Intelligence
Open Source

Antidote Intelligence

An open source ML security project that treats training data as the place where model risk often starts. The system uses a multi agent analysis pipeline to inspect dataset content, generate hypotheses, and surface examples worth investigating before bad data becomes expensive behavior.

Long term Python OpenAI ML Security Data Quality Agent Pipeline
Dreamville
In Progress

Dreamville

A gamified Canvas LMS tracker that pulls assignments into a game loop, then scores urgency from completion patterns and difficulty signals. The useful part is turning school workflow data into a next action system students can act on without another dashboard yelling at them.

Long term Godot Go Canvas API Regression Workflow Data
Hackathon

Hyper Rosen

A hackathon built Godot experiment in systems that can keep expanding. Swirled Perlin noise places planets, wave function collapse handles city placement, and procedural rules create enemies and asteroids, making the project feel like a small galaxy generated from reusable data rules.

Godot Hackathon Procedural Generation Perlin Noise Wave Function Collapse
Cake Walk
GDC
GDC Jam

Cake Walk

A fast game jam pitch: make a tiny character readable, charming, and playable in a single day. We built and demoed Cake Walk at GDC Festival of Gaming with Keriya Son on 3D, Angie Peczon on art, Eric Taylor on shaders, and Ilce Perez on music.

Godot Game Jam 3D Shaders Team Project
Old Man Climbs
First Project, 2022

Old Man Climbs

A small vertical climber built over a weekend for a UC Merced game jam in 2022. It is here less as a technical flex and more as the first shipped artifact: a reminder that finishing a small loop teaches more than endlessly planning a bigger one.

Godot Game Jam 2022
Quick Autocorrect
Obsidian

Quick Autocorrect

A small community plugin for reducing friction while writing in Obsidian. It catches repeated misspellings, applies quick corrections, and keeps a personal dictionary for words Obsidian should stop fighting you on: a tiny version of the same pattern I like, cleaning a messy text stream into something easier to use.

Long term TypeScript Obsidian Plugin Text UX
NutriFinder

NutriFinder

A small dietary search project with a practical pitch: pull in messy menu and nutrition information, normalize it enough to filter, and give people a cleaner way to decide what they can eat.

React Flask Python Search Filters
Spiral Visualizer

Spiral Visualizer

A compact teaching visualization for spiral growth using queued directions. The pitch is simple: when a system changes step by step, showing the state often teaches faster than another paragraph of explanation.

Python Matplotlib Queues