- Architecting a Temporal-based workflow engine and an AI provider control plane (cost ledger, rate limiting, circuit breakers, webhook replay) powering a multi-domain platform.
- Established a domain-driven monorepo (portable platform core + business domains) with P0–P5 smoke gates and full observability (Grafana / Loki / Alloy).
- Building a skills-based LLM generation system (18+ domain skills — brand enrichment, brief/script generation, anti-fraud) with guardrails and quality evals.
Summary
AI-native product engineer who ships full systems end-to-end — backend, frontend, and the LLM-agent layer — by orchestrating AI coding agents at team-level throughput. 15 years building products: co-founder of App in the Air (7M users, Apple App of the Year) and a Top-Rated freelancer with $300K+ across 61 projects (~100% success). Recent focus: production agentic systems, multimodal LLM pipelines, and AI interpretability.
Selected Work
- Built an end-to-end LLM-agent support system (Vercel AI SDK, OODA tool-calling loop) across Telegram / Email / Web — intent classification, field extraction, automated ticketing; delivered in < 2 months (3 prototypes → production).
- Engineered multimodal intake (vision + Whisper STT) and context-aware memory over PostgreSQL/pgvector + Redis.
- Implemented a self-improving eval loop (45+ real cases → LLM-driven prompt refinement → regression gating in CI) and production infra (Docker, Langfuse, retry queues, graceful shutdown).
- Designed and prototyped an auditable corporate-memory system — immutable raw layer + re-creatable structured layer, giving end-to-end provenance from any AI output back to its source.
- Built LLM knowledge-graph extraction (entities / relations / facts) with pluggable per-domain lenses and type guards over PostgreSQL/pgvector, served via MCP — handed off as a prototype for further development.
- Architected an event-driven backend (Fastify, Temporal.io, 4 microservices) that scrapes, analyzes, and scores short-form video (TikTok / Reels / Shorts).
- Built multimodal video analysis on Gemini structured outputs (hook, pacing, CTA, niche, sentiment) and a 7-stage Python ML pipeline (correlation, clustering, anomaly detection → content directives).
- Built production-grade infra: PostgreSQL/Drizzle (20+ tables), Redis Streams, full observability (Prometheus / Grafana / Jaeger), billing & auth.
- Shipped a live, monetized Telegram AI product — dream interpretation plus an evolving per-user personality profile (Telegram Stars billing; 38% D30 retention among paying users).
- Built a multi-stage LLM pipeline — agentic mode routing, multi-provider image generation, per-tier cost guards, structured outputs (~$0.02/dream).
- Executed a zero-downtime, idempotent migration of 3,485 users / 6,340 records with cold-backup safety.
- 200+ experiments on extracting exact, verifiable logic from trained networks (lossless NN → formula → Verilog), vs. approximate methods (SHAP/LIME).
- Notable results: parity-128 in 40 parameters; a GPT-2 reasoning-coprocessor adapter lifting 8-bit arithmetic from 0%→97% at 0.08% of parameters; 131× FHE inference speedup.
- Rigorous methodology (multi-seed, Bonferroni correction, 3 verification rounds) with a documented falsified-hypotheses log.
- Built a solo full-stack real-time multiplayer game (Nuxt/Vue, Socket.io, Redis/BullMQ) with RU/EN localization and Telegram integration.
- Created a modular design system (Storybook, 26 components) and an Apify + OpenAI dictionary pipeline across a 10-module monorepo.
Experience
- $300K+ earned across 61 projects / 4,446 hours, ~100% success; clients primarily in the US, Canada, and Europe.
- 30+ MVPs and proofs-of-concept; deep involvement in product decisions and technical advising.
- Co-founded a travel-assistant app: 7M users, Apple App of the Year, Editors' Choice, preinstalled across Apple Stores worldwide.
- Interactive installations and real-time visual systems (TouchDesigner, Ventuz, GLSL, Arduino).