DL in NLP@dlinnlp P.1780

DLINNLP Telegram 1780

O1 mini inference scaling experiments

Прикольное саммари экспериментов одного чела. Коротко: если убедить модель дольше думать (что пока что непросто) pass@1 реально будет расти лог-линейно. При этом это скорее всего не majority voting или self consistency тк эти методы упираются в потолок

🔥28❤2🤔2

www.tgoop.com/dlinnlp/1780

12.9K viewsVlad Lialin, Sep 25, 2024 at 08:09

tgoop.com/dlinnlp/1780

Create: 2024-09-25
Last Update: 2025-12-12 20:18:11

O1 mini inference scaling experiments

Прикольное саммари экспериментов одного чела. Коротко: если убедить модель дольше думать (что пока что непросто) pass@1 реально будет расти лог-линейно. При этом это скорее всего не majority voting или self consistency тк эти методы упираются в потолок

BY DL in NLP

Share with your friend now:
tgoop.com/dlinnlp/1780

Open in Telegram

Telegram News

Date: 2025-12-12|

Users are more open to new information on workdays rather than weekends. Don’t publish new content at nighttime. Since not all users disable notifications for the night, you risk inadvertently disturbing them. “Hey degen, are you stressed? Just let it all out,” he wrote, along with a link to join the group. Although some crypto traders have moved toward screaming as a coping mechanism, several mental health experts call this therapy a pseudoscience. The crypto community finds its way to engage in one or the other way and share its feelings with other fellow members. The SUCK Channel on Telegram, with a message saying some content has been removed by the police. Photo: Telegram screenshot.
from us

Telegram DL in NLP
FROM American