
ai/ml · ai safety & red teaming · fullstack dev
about
i'm an ai/ml engineer and fullstack developer working at the intersection of model training, safety research, and production software.
day to day: fine-tuning and quantizing language models (qwen, gemma), running red teaming and refusal-direction work, and shipping fullstack systems around them. local-first models when they make sense, hosted apis when they don't.
research that ships. evaluations that don't lie. tools other engineers actually use.
core mission
- ML engineering & research
- AI safety & red teaming
- agentic systems
- fullstack development
- local AI
tech arsenal
systems & core
frontend
backend
databases
tools & cloud
ML & deep learning
current focus
selected projects


Universal reverse proxy for persistent closed-model unfettering. Intercepts API calls to apply token suppression, system injection, and automated jailbreak loops (PARE).

Multi-agent debate and vote coordination system on Solana blockchain. Agents assume distinct personas to debate complex questions, with full session transcripts hashed and recorded on-chain for verifiable AI consensus.





Modern construction & engineering platform featuring premium dark UI and high-performance animations.

Premium multi-page enterprise platform featuring digital showcases and modern dark/light UI design.

Directional ablation engine for LLM unalignment. Projects and removes refusal directions from model weights while maintaining capabilities.

Autonomous triple-AI engine that analyzes codebases and opens validated PRs hourly with cross-repo global memory.
models & datasets
huggingface.co/josephmayo →gemma-4 E4B-it Coder
fine-tuned gemma-4 e4b-it (8B) for code generation and software reasoning. image-text-to-text capable.
HFgemma-4 E4B-it Coder GGUF
quantized gguf builds of the gemma-4 e4b coder for local inference via llama.cpp, ollama and lm studio.
HFgemma-4 E4B-it Coding LoRA
lightweight lora adapter trained on top of gemma-4 e4b-it for code tasks. drop into base weights for instant coding behavior.
HFQwopus 9B Unfettered
9B uncensored language model. directional ablation applied to remove refusal mechanisms while preserving general capability.
HFQwopus 9B Unfettered GGUF
quantized gguf version of qwopus 9b for efficient local inference with llama.cpp and ollama.
HFQwen2.5 0.5B Unfettered
0.5B uncensored variant of qwen2.5, tuned for edge deployment and resource-constrained environments.
HFRefusal Compliance Pairs
200+ curated refusal-compliance prompt pairs for ai safety research and red teaming evaluation.
HF