skip to content
Jordan Madison

Research

I adapt open-source language models to hard domains through post-training — reward design, reinforcement learning, and distributed training on HPC clusters.

Georgia Tech Reasoning and Learning Group
Apr. 2025 – Present
Atlanta, GA
  • Architected a post-training pipeline to adapt open-source Large Language Models (LLMs) to Material Science domain tasks. Pipeline leverages LoRA adapters with HuggingFace’s TRL for model training and vLLM for inference during model evaluation.
  • Created novel datasets of 2 million material modifications and 60 thousand multi-turn tool call conversations for band gap and formation energy property improvement tasks. Utilized in Model Evaluation and Supervised Fine-Tuning.
  • Implemented a training environment to enable Reinforcement Learning with Symbolic Feedback via Group Relative Policy Optimization (GRPO) and non-sparse, verifiable rewards.
  • Wrote optimized GPU Slurm job scripts to orchestrate distributed, multi-gpu post training runs with NVIDIA 80GB H100 GPUs on HPC clusters.
Georgia Tech CPSec: Cyber-Physical Systems Security Lab
Apr. 2025 – Present
Atlanta, GA
  • Investigating internet-exposed maritime infrastructure and vessels by using Censys and Shodan to identify NEMA 0183/2000 protocol signatures.
Georgia Tech SymbolicGym: Unified RL Platform for Symbolic Reasoning
Apr. 2025 – June. 2025
Atlanta, GA
  • Architected a modular Python framework integrating Reinforcement Learning agents (PPO, DQN, MARL) with formal symbolic reasoning tools (Z3, SymPy, Minisat).
  • Designed complex reward functions evaluating syntax, semantics, and domain-certificate verification, enabling agents to iteratively reduce constraints in mathematical and SAT-based search spaces.
  • Implemented Graph Neural Network (GNN) state representations to encode logical formula trajectories, significantly improving the agent's interpretability and logic-solving convergence rates.
SymbolicGym architecture: core environment (SAT, SymPy, Z3) feeding a feedback/representation layer and shared encoder, driving RL agents (DQN, PPO, GNN, CTDE) with external oracles, proofs, and experiment orchestration.
SymbolicGym architecture — unified RL platform for symbolic reasoning.

View the code on GitHub →

Skills

Languages

Assembly (x86), C, Python, Java, TypeScript

Symbolic Reasoning

Z3, SymPy, Minisat, RLSF

AI infrastructure

HuggingFace, vLLM, TRL, LoRA, Slurm HPC, Docker.

Libraries

PyTorch, Pandas, Numpy, Node.js

Frameworks

Docker, Git, AWS CDK

Tools

Ardupilot, Censys, IDA Pro, Shodan