Mathematical Models of the Real World@MathModels P.1153

MATHMODELS Telegram 1153

Mathematical Models of the Real World

ИБ-специалист Дэвид Кузмар обнаружил уязвимость в ChatGPT, позволяющую обходить контентные ограничения и получать доступ к запрещённой информации. Дефект, получивший название «Time Bandit», использует «временное замешательство» модели, вынуждая её терять ориентацию во времени.
Time Bandit оказался одним из самых сложных и эффективных обходов защиты, который использует два ключевых механизма:

Запутывание во времени – заставляет ИИ потерять ориентацию, лишая его понимания текущей даты и контекста.
Процедурная неясность – позволяет формулировать вопросы так, чтобы модель не могла корректно применять правила и фильтры безопасности.

https://www.bleepingcomputer.com/news/security/time-bandit-chatgpt-jailbreak-bypasses-safeguards-on-sensitive-topics/?utm_source=Securitylabru
На русском: https://www.securitylab.ru/news/555990.php

BleepingComputer

Time Bandit ChatGPT jailbreak bypasses safeguards on sensitive topics

A ChatGPT jailbreak flaw, dubbed "Time Bandit," allows you to bypass OpenAI's safety guidelines when asking for detailed instructions on sensitive topics, including the creation of weapons, information on nuclear topics, and malware creation.

🤔3👍2

www.tgoop.com/MathModels/1153

316 viewsFeb 5 at 21:31

tgoop.com/MathModels/1153

Create: 2025-02-05
Last Update: 2025-10-22 09:10:37

ИБ-специалист Дэвид Кузмар обнаружил уязвимость в ChatGPT, позволяющую обходить контентные ограничения и получать доступ к запрещённой информации. Дефект, получивший название «Time Bandit», использует «временное замешательство» модели, вынуждая её терять ориентацию во времени.
Time Bandit оказался одним из самых сложных и эффективных обходов защиты, который использует два ключевых механизма:

Запутывание во времени – заставляет ИИ потерять ориентацию, лишая его понимания текущей даты и контекста.
Процедурная неясность – позволяет формулировать вопросы так, чтобы модель не могла корректно применять правила и фильтры безопасности.

https://www.bleepingcomputer.com/news/security/time-bandit-chatgpt-jailbreak-bypasses-safeguards-on-sensitive-topics/?utm_source=Securitylabru
На русском: https://www.securitylab.ru/news/555990.php

BY Mathematical Models of the Real World

Share with your friend now:
tgoop.com/MathModels/1153

Open in Telegram

Telegram News

Date: 2025-10-22|

Joined by Telegram's representative in Brazil, Alan Campos, Perekopsky noted the platform was unable to cater to some of the TSE requests due to the company's operational setup. But Perekopsky added that these requests could be studied for future implementation. Over 33,000 people sent out over 1,000 doxxing messages in the group. Although the administrators tried to delete all of the messages, the posting speed was far too much for them to keep up. As five out of seven counts were serious, Hui sentenced Ng to six years and six months in jail. To view your bio, click the Menu icon and select “View channel info.” Ng was convicted in April for conspiracy to incite a riot, public nuisance, arson, criminal damage, manufacturing of explosives, administering poison and wounding with intent to do grievous bodily harm between October 2019 and June 2020.
from us

Telegram Mathematical Models of the Real World
FROM American