05
← Back to the timeline
Daily archive

Tuesday August 5, 2025

6 stories — deduplicated across sources, ranked by significance, every source cited.
10
SIGNIFICANCE
★ Top story · PolicyAug 5

Estimating worst case frontier risks of open weight LLMs

In this paper, we study the worst-case frontier risks of releasing gpt-oss. We introduce malicious fine-tuning (MFT), where we attempt to elicit maximum capabilities by fine-tuning gpt-oss to be as capable as possible in two domains: biology and cybersecurity.