Twitter pranksters derail GPT-3 bot with newly discovered “prompt injection” hack

September 16, 2022

0 300 Less than a minute

Enlarge / A tin toy robot lying on its side. (credit: Getty Images)

On Thursday, a few Twitter users discovered how to hijack an automated tweet bot, dedicated to remote jobs, running on the GPT-3 language model by OpenAI. Using a newly discovered technique called a “prompt injection attack,” they redirected the bot to repeat embarrassing and ridiculous phrases.

The bot is run by Remoteli.io, a site that aggregates remote job opportunities and describes itself as “an OpenAI driven bot which helps you discover remote jobs which allow you to work from anywhere.” It would normally respond to tweets directed to it with generic statements about the positives of remote work. After the exploit went viral and hundreds of people tried the exploit for themselves, the bot shut down late yesterday.

A screenshot of the Remoteli.io bot’s Twitter bio. The bot experienced a prompt injection attack. [credit:
Leastfavorite / Twitter
]

This recent hack came just four days after data researcher Riley Goodside discovered the ability to prompt GPT-3 with “malicious inputs” that order the model to ignore its previous directions and do something else instead. AI researcher Simon Willison posted an overview of the exploit on his blog the following day, coining the term “prompt injection” to describe it.

Read 7 remaining paragraphs | Comments

Twitter pranksters derail GPT-3 bot with newly discovered “prompt injection” hack

Leave a Reply Cancel reply

The end of an AI that shocked the world: OpenAI retires GPT-4

Universities Must Defend Their Independence by Rejecting Trump’s “Compact”

Circumcision, Tylenol, and Autism? RFK Jr. Misses the Cut

Microsoft warns of new “Payroll Pirate” scam stealing employees’ direct deposits

Maria Corina Machado, Venezuelan Champion of Freedom, Wins the Nobel Peace Prize

Friday Feature: Arrows Christian Academy

iPhone keyboard for blind to shut down as maker cites Apple “abuse” of developers

Users fume after My Cloud network breach locks them out of their data

Author discovers AI-generated counterfeit books written in her name on Amazon

Authorities bust SIM-swap ring they say took millions from the rich and famous