Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> The reasoning part is not diferente from the part that goes in answer.

Exactly. And this instruction isn't telling it to skip the reasoning. That part is unaffected. The instruction is only for the user-visible output.

By the time the reasoning models get to writing the output you see, they've already decided what they are going to say. The answer is based on whatever it decided while reasoning. It doesn't matter whether you tell it to put the answer first or the explanation first. It already knows both by the time it starts outputting either.

You're basically hoping that adding more CoT in the output after reasoning will improve the answer quality. It won't. It's already done way more CoT while reasoning, and its answer is already decided by then.



Im sorry. You are thinking in terms of one time interactions. I’m thinking about the next step in the interaction.

To understand my point, think about a prompt to tell the model “here is a very difficult code problem, answer in a single word.”

It thinks a lot and answer. You send the next prompt. At this moment, you are completely in out of distribution territory.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: