Data Secrets@data_secrets P.1601

DATA_SECRETS Telegram 1601

RLHF: как работает алгоритм, который сделал GhatGPT таким человечным

RLHF – это сокращение от Reinforcement Learning from Human Feedback. Оказывается, что именно благодаря этому подходу GhatGPT так хорош: он, в отличие от предшественников, учитывает то, что хочет видеть пользователь. Но как? Ответ в карточках.

Кстати, подробнее об этом подходе можно почитать в статье InstructGPT. Создатели утверждают, что их модель отличается от ChatGPT только в методах сбора данных.

😻 #NN

Please open Telegram to view this post

VIEW IN TELEGRAM

Please open Telegram to view this post

VIEW IN TELEGRAM

👍28❤6⚡5🔥3❤‍🔥1👏1🙈1

www.tgoop.com/data_secrets/1601

8.37K viewsJun 6, 2023 at 15:41

tgoop.com/data_secrets/1601

Create: 2023-06-06
Last Update: 2025-12-11 17:49:42

RLHF: как работает алгоритм, который сделал GhatGPT таким человечным

RLHF – это сокращение от Reinforcement Learning from Human Feedback. Оказывается, что именно благодаря этому подходу GhatGPT так хорош: он, в отличие от предшественников, учитывает то, что хочет видеть пользователь. Но как? Ответ в карточках.

Кстати, подробнее об этом подходе можно почитать в статье InstructGPT. Создатели утверждают, что их модель отличается от ChatGPT только в методах сбора данных.

😻 #NN

BY Data Secrets

Share with your friend now:
tgoop.com/data_secrets/1601

Open in Telegram

Telegram News

Date: 2025-12-11|

Find your optimal posting schedule and stick to it. The peak posting times include 8 am, 6 pm, and 8 pm on social media. Try to publish serious stuff in the morning and leave less demanding content later in the day. In handing down the sentence yesterday, deputy judge Peter Hui Shiu-keung of the district court said that even if Ng did not post the messages, he cannot shirk responsibility as the owner and administrator of such a big group for allowing these messages that incite illegal behaviors to exist. With the “Bear Market Screaming Therapy Group,” we’ve now transcended language. Among the requests, the Brazilian electoral Court wanted to know if they could obtain data on the origins of malicious content posted on the platform. According to the TSE, this would enable the authorities to track false content and identify the user responsible for publishing it in the first place. More>>
from us

Telegram Data Secrets
FROM American