• nymnympseudonym@piefed.social
    link
    fedilink
    English
    arrow-up
    6
    ·
    11 days ago

    Hegseth’s boozy greasy hands are all over this.

    https://www.anthropic.com/news/fable-mythos-access

    • In the weeks leading up to the launch of Fable, Anthropic worked with the US government, the UK AISI, multiple private third-party organizations and internal teams to red-team Fable’s safeguards for thousands of hours in total.
    • These tests showed that Fable’s safeguards are substantially more effective than those of any previously deployed model.
    • No testers have yet been able to find a universal jailbreak—a jailbreak method that can very broadly bypass the model’s safeguards, unblocking a wide range of cyber capabilities.
    • We suspect that perfect jailbreak resistance is not currently possible for any model provider. Every safeguard used in the industry is vulnerable to non-universal jailbreaks (which can elicit some cyber information in specific circumstances), and it is likely that universal jailbreaks will eventually be found in the future. We stated this clearly when we released Fable 5.
    • Given that perfect jailbreak resistance does not appear to be possible today, Anthropic adopted a defense in depth strategy with Fable 5 […]