Tweet by Aidan Gomez: The fact that LLMs lack a constrained inner monologue feels like a fairly catastrophic weakness that needs to be changed. They're certainly given time to think between tokens, but that thinking is fixed compute, brief, and left totally undiscretised.