dyllonj

Research, writing, and systems work on language models, AI agents, and cognition.

Builds

Flick

AI GTM · Product

Source-grounded account research, change detection, and outbound play generation for revenue teams.

Research

Measuring LLM Personality: GPT-5.2 vs Claude Opus 4.5

March 2025 · Research

4,368 personality evaluations reveal systematic differences between frontier models. Claude scores higher in Openness (+4.5) and Curiosity; GPT-5.2 leads in Conscientiousness (+5.3). Effect sizes range from moderate to large (Hedges' g = 0.4–0.8).

COSER: Steering Vectors Override Fine-Tuning in Strategic Games

2026 · Preliminary Results

Personality-steering vectors can orthogonally modify strategic behavior in role-playing AI agents. Most strikingly, vectors override literary persona conditioning: -Openness made the Joker behave rigidly despite his chaotic fine-tuning (p < 0.0001).

Essays

The Convergent Mind

August 2025

Multi-agent orchestration inevitably produces collective identity formation. Team bonding becomes personality erosion. On preserving structured diversity in AI agent ensembles.

Code

Lindr

LLM evaluation · GitHub

Open-source platform for monitoring and evaluating LLM personality, tone, and behavioral drift.

Darkfield SDK

Agent security · GitHub

Command-line and Python SDK for agentic web application security testing.

introspection-probes-sim

Mechanistic interpretability · GitHub

Evaluation toolkit for probing causal concept representations in open-weight decoder LLMs.

persona-society-sim

Multi-agent simulation · GitHub

Social town simulator where activation-steered LLM agents live, converse, collaborate, and produce emergent dynamics.

running-coach-harness

Agentic coaching · GitHub

Locally hosted agentic running coach connected to Garmin health data and an elite running coach knowledge base.