4,368 personality evaluations reveal systematic differences between frontier models. Claude scores higher in Openness (+4.5) and Curiosity; GPT-5.2 leads in Conscientiousness (+5.3). Effect sizes range from moderate to large (Hedges' g = 0.4–0.8).
Research
9,325 evaluations across xAI, OpenAI, and Anthropic. Cross-vendor personality effects are 3–4x larger than within-vendor variation. Grok exhibits 3x the context sensitivity of GPT-5.2. PCA reveals three factors explaining 79.5% of variance.
Personality-steering vectors can orthogonally modify strategic behavior in role-playing AI agents. Most strikingly, vectors override literary persona conditioning: -Openness made the Joker behave rigidly despite his chaotic fine-tuning (p < 0.0001).
Essays
Multi-agent orchestration inevitably produces collective identity formation. Team bonding becomes personality erosion. On preserving structured diversity in AI agent ensembles.
Geometric structures in activation space enable fine-grained personality control. A vaccination approach for preventing harmful traits during training, and implications for multi-agent emergence.
Projects
Personality evaluation API for AI systems. Measure, monitor, and enforce personality consistency across 10 dimensions.
Workstyle assessments for high-velocity engineering teams. Mapping communication patterns, decision-making styles, and shipping velocity.