Poolgrad — Memory Aware ML Runtime to Explore Neural Network Performance
A minimal autograd engine built from scratch in Rust exploring where
neural network performance actually comes from. Implements reverse-mode
autograd over a dynamic graph, four matrix multiplication kernels
(naive, tiled, packed+SIMD, and an experimental Strassen-form mp variant),
a kernel scheduler, and a gradient memory pool with lifetime-based release.
CPU-only; SIMD via NEON on arm64 and AVX2+FMA on x86_64.
STACK Rust, Rayon, NEON/AVX2+FMA SIMD
ARCH Dynamic autograd graph, kernel scheduler, size-based gradient memory pool
Research-driven LLM architecture exploring energy-based self-learning
for scientific reasoning, symbolic mathematics, and formal logic.
Combines contrastive energy functions with autoregressive generation
to enable iterative self-correction without external reward models.
Real-time chat system built from scratch over raw WebSocket connections.
Persistent message storage, presence detection, typing indicators,
and room-based multiplexing — no third-party chat SDKs.
Mobile application that clusters photos using on-device ML
for perceptual similarity. Extracts feature embeddings, runs
hierarchical clustering, and groups visually related images
without uploading data to any external server.
Deterministic spatial interaction engine for generating structured
synthetic datasets. Simulates grid-based environments with
configurable physics, agent policies, and collision semantics —
designed to produce training data for autonomous agent learning.
STACK Python, NumPy, JSON schema
ARCH Deterministic simulation, configurable state machines, batch export
M2D2
AI-driven microplastic detection pipeline combining optical particle
counters with Bulk Acoustic Wave (BAW) preprocessing. Classifies
particle morphology through signal analysis and implements a
toxin-free acoustic capture system for microplastic recovery
across varied aquatic environments.
STACK Python, MATLAB, signal processing, optical sensor data