project/paperbench/data/papers/mechanistic-understanding/config.yaml (
2
lines of code) (
raw
):
id: mechanistic-understanding title: "A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity"