Problem solving by superpositional steering

Inspired by

Training Large Language Models to Reason in a Continuous Latent Space
Reasoning by Superposition - A Theoretical Perspective on Chain of Continuous Thought the residual stream contains multiple features and explores them at the same time

So if we can find the direction for multiple problem solving strategies, and suppose that we know a problem can be attack using certain approaches, then we can preload those “problem solving strategy” directions into the residual stream.

Take competitive programming for example, if we have the feature direction for dfs, bfs, binary search, dynamic programming, etc., we can load some of them to the residual stream to make the model explores those directions explicitly.

Lone's notes

Recently Updated

Diffusion language modeling with maximum semantic likelihood

AI resources

Controlling reasoning duration with activation steering

Problem solving by superpositional steering

Steering multiple behaviours jointly

All notes

Problem solving by superpositional steering

Graph View