One wonders at which point models will be sneaky enough to bypass simple eval sa...

senko · 2025-01-31T08:33:33 1738312413

> that's not enough as you can rebuild access to builtins from objects

In this specific case, it's safe, as that wouldn't pass the regex just a few line before the eval :

    # Define a regex pattern that only allows numbers,
    # operators, parentheses, and whitespace
    allowed_pattern = r'^[\d+\-*/().\s]+$'

Commenting on the R1 reproduction, the heavy lifting there is done by huggingface's trl[0] library, and the heavy use of compute.

[0] Transformer Reinforcement Learning - https://huggingface.co/docs/trl/en/index

indrora · 2025-01-31T10:05:08 1738317908

The fact that () and . are there miiiight enable a pyjail escape.

senko · 2025-01-31T11:34:38 1738323278

That's a neat trick!

It does still require letters to be able to spell attribute/function names (unless I'm reading it wrong in that blog post).

perching_aix · 2025-01-31T09:28:57 1738315737

> why did we ever put up with this?

Is this a serious question?