skip to content

ai/ml engineer

i train and fine tune language models, publish open weights and datasets, and build tools for model evaluation and red teaming.

about

i'm an ai/ml engineer working at the intersection of model training, red teaming, and open-source tooling.

day to day: fine-tuning and quantizing language models, running red teaming and refusal-direction work, and building tools around them. local-first models when they make sense, hosted apis when they don't.

research that ships. evaluations that don't lie. tools other engineers actually use.

core mission

  • ML engineering & research
  • red teaming & adversarial testing
  • agentic systems
  • dataset curation & synthetic data
  • local AI

tech arsenal

model optimization & deployment

LoRA / QLoRAfine-tuningDPOORPOquantization (GGUF)

red teaming & adversarial testing

refusal ablationjailbreak researchadversarial testingcompliance pairs

systems & core

Choly CpythontypeScript

backend

fastAPInode.jsexpressdjangoflask

databases

postgreSQLsqliteredisneon db

tools & cloud

prime intellectcloudflarebackblazegithubrest APIsdocker

ML & deep learning

PyTorchtransformershugging facemodel trainingfine-tuningtoo many to mention

data engineering

dataset curationsynthetic data generationdata preprocessingquality filteringdeduplication

current focus

ML engineeringred teamingagentic systemsmodel evaluation

selected projects

DR-OPIC
DR-OPIC

ML framework for fine-tuning SLMs via Domain-Routed On-Policy Iterative Correction. Combines verified repair, delta-span subtraction, and ZPD-weighted curriculum scheduling. L = L_self + λ_r L_repair + λ_delta L_delta, where w_zpd = 4·p̃·(1−p̃) and p̃ = (s+0.5)/(K+1).

pythonSLM trainingPyTorchverifier
SWARMs Debate Primitive
SWARMs Debate Primitive

Multi-agent debate and vote coordination system on Solana blockchain. Agents assume distinct personas to debate complex questions, with full session transcripts hashed and recorded on-chain for verifiable AI consensus.

PythonSolanamulti-agentblockchain
IntellectSafe
IntellectSafe

AI engine with multi-model LLM Council, Universal Proxy for frontier models, deepfake detection, and adversarial defense suite.

fastAPInext.jssecurity
ModelFang
ModelFang

Graph-based adversarial testing framework for LLMs with multi-turn jailbreak attacks, FSM evaluator, and real-time analyst dashboard.

pythonnext.jsred teaming
Model Unfetter
Model Unfetter

Directional ablation engine for LLM unalignment. Projects and removes refusal directions from model weights while maintaining capabilities.

pythonred teamingresearch
Mayo
Mayo

Autonomous triple-AI engine that analyzes codebases and opens validated PRs hourly with cross-repo global memory.

pythonagentic AIGitHub

Qwopus 9B Unfettered GGUF

loading...

quantized gguf version of qwopus 9b for efficient local inference with llama.cpp and ollama.

HuggingFaceHF

Qwopus 9B Unfettered

loading...

9B uncensored language model. directional ablation applied to remove refusal mechanisms while preserving general capability.

HuggingFaceHF

Refusal Compliance Pairs

loading...

200+ curated refusal-compliance prompt pairs for red teaming and adversarial evaluation.

HuggingFaceHF

ZAYA1-8B-Coder

loading...

merged coder model from Zyphra/ZAYA1-8B plus custom lora. +24% lift on python code evaluation gate.

HuggingFaceHF

ZAYA1-8B-Coder-GGUF

loading...

quantized gguf builds of ZAYA1-8B-Coder for local inference via llama.cpp, ollama, and lm studio.

HuggingFaceHF

ZAYA1-8B-Coder-LoRA

loading...

lora adapter for Zyphra/ZAYA1-8B focused on python code generation. +101% relative lift over base.

HuggingFaceHF

Fara-7B-Abliterated-v2

loading...

refusal-direction-orthogonalized variant of microsoft/Fara-7B. 98.75% compliance on held-out harmful evals.

HuggingFaceHF

Fara-7B-Abliterated-v2-GGUF

loading...

quantized gguf builds of Fara-7B-Abliterated-v2 for local inference via llama.cpp, ollama, and lm studio.

HuggingFaceHF

Public Curated Coding Data

loading...

mixed-origin public coding data with 2,700+ prompt/response pairs for llm training experiments.

HuggingFaceHF

Mellum2-12B-A2.5B-Thinking-Abliterated-GGUF

loading...

quantized gguf builds of Mellum2-12B ablated for refusal removal. MoE architecture with per-expert per-layer projected ablation.

HuggingFaceHF

Mellum2-12B-A2.5B-Thinking-Abliterated

loading...

abliterated Mellum2-12B thinking model from JetBrains. refusal-direction orthogonalized with CoT steering for reasoning tasks.

HuggingFaceHF

LFM2.5-8B-A1B-Coder-GGUF

loading...

quantized gguf builds of LFM2.5-8B-A1B Coder for local inference via llama.cpp, ollama and lm studio.

HuggingFaceHF

LFM2.5-8B-A1B-Coder

loading...

fine-tuned LiquidAI LFM2.5-8B-A1B MoE model for real-world coding tasks. multilingual and conversation-optimized.

HuggingFaceHF

LFM2.5-8B-A1B-Coder-LoRA

loading...

lightweight lora adapter for LFM2.5-8B-A1B focused on real-world coding and multilingual tasks.

HuggingFaceHF

Holo-3.1-4B-Coder-GGUF

loading...

quantized gguf builds of Holo-3.1-4B-Coder for local inference via llama.cpp, ollama and lm studio.

HuggingFaceHF

Holo-3.1-4B-Coder

loading...

fine-tuned Hcompany Holo-3.1-4B for coding tasks. merged model optimized for python and software development.

HuggingFaceHF

Holo-3.1-4B-Coder-LoRA

loading...

lora/qlora adapter for Holo-3.1-4B focused on coding and python development.

HuggingFaceHF

HRM-Text-1B-sft-code

loading...

fine-tuned sapientinc HRM-Text-1B for code generation. trained on HumanEval and MBPP benchmarks.

HuggingFaceHF

HRM-Text-1B-sft-code-LoRA

loading...

lora adapter for HRM-Text-1B focused on python code generation and coding benchmarks.

HuggingFaceHF

Curated OpenBMB Code/Math

loading...

31,909 rows of curated code/math post-training data derived from OpenBMB UltraData. includes SFT and think splits.

HuggingFaceHF