They team specifically "use AI agents built from GPT-4o and Claude 3.5 Sonnet". The question here is "how did they manage to do so" not "what else can do it with less effort".
As those two are run by companies actively trying to prevent their tools being used nefariously, this is also what it looks like to announce they found an unpatched bug in an LLM's alignment. (Something LessWrong, where this was published, would care about much more than Hacker News).