Todo Fix bugs in direction extraction and rerun experiments Scale up to 70B model Create lib for steering in pure Pytorch Benchmark with Span(h,d) Apply on other behaviours Sentiment Linear Representations of Sentiment in Large Language Models Reasoning Understanding Reasoning in Thinking Language Models via Steering Vectors Harmfulness Understanding Reasoning in Thinking Language Models via Steering Vectors Programming Refusal with Conditional Activation Steering