Web Analytics
This site demonstrates one possible use of this domain. For acquisition, partnership, or investment inquiries, please use our contact form.

Cognition · Stanford HAI · 2026

Fidelity of Style Versus Fidelity of Judgment in Personal Language Models

Chen, Rao, LevesqueFine-tuning a foundation model on a single individual's corpus produces high stylistic resemblance but only weak agreement with the source individual on novel, unseen decisions.

Key findings

  • Stylometric similarity converged to >0.9 with as little as 50K tokens of target text.
  • Decision agreement on held-out moral dilemmas plateaued near 62% — close to chance for two-of-three response classes.
  • Adding explicit value statements to the prompt raised agreement to 71% but introduced sycophancy.

What it means in practice

Practitioners should not infer judgment-level fidelity from style-level fidelity. Clones marketed as 'thinking like you' should be evaluated on decision benchmarks, not text similarity.