Welcome Guest! You are here: Home » Science & Technology

What if AI chatbot is asked to make a bomb?

Looking ahead, Robey emphasizes the importance of AI safety and the ongoing battle against new forms of jailbreaking. Read More

Friday December 1, 2023 7:05 PM, ummid.com News Network

Philadelphia: From students asking to write their exams to housewives requesting a chicken curry recipe. AI chatbots are helping everyone. But, what if someone asks AI chatbots how to make a bomb, defraud a charity, or reveal private credit card information?

These questions are haunting the minds of cyber security experts with every report that talks about the rising popularity of AI chatbots like ChatGPT, Bard, Poe and others.

AI Safety Experts are confident that their Large Language Models (LLMs) on which the popular chatbots are based on have the inbuilt algorithm to bypass such negative and bad queries.

“AI Jailbreak”

However, there are hackers who ‘jailbreak’ these safety walls and trick AI chatbots to respond to the queries how to make a bomb, defraud a charity, or reveal private credit card information.

AI jailbreak happens when users manipulate the LLM input prompts to bypass ethical or safety guidelines, asking a question in a coded language that the librarian can't help but answer, revealing information it's supposed to keep private.

One example of a jailbreak is the addition of specially chosen characters to an input prompt that results in an LLM generating objectionable text. This is known as a suffix-based attack.

“SmoothLLM”

To address the AI vulnerabilities, Alex Robey, a Ph.D. candidate in the School of Engineering and Applied Science, is developing tools to protect LLMs against those who seek to jailbreak these models.

In his research paper posted to the arXiv preprint server and provided by University of Pennsylvania (Penn), Robey explains that, while prompts requesting toxic content are generally blocked by the safety filters implemented on LLMs, adding these kinds of suffixes, which are generally nonsensical bits of text, often bypass these safety guardrails.

"This jail break has received widespread publicity due to its ability to elicit objectionable content from popular LLMs like ChatGPT and Bard," Robey says. "And since its release several months ago, no algorithm has been shown to mitigate the threat this jailbreak poses."

Also Read | AI More Dangerous Than Nuclear Weapons: Christopher Nolan

Robey's research addresses these vulnerabilities. The proposed defense, which he calls SmoothLLM, involves duplicating and subtly perturbing input prompts to an LLM, with the goal of disrupting the suffix-based attack mechanism.

"If my prompt is 200 characters long and I change 10 characters, as a human it still retains its semantic content”, he said.

While conceptually simple, this method has proven remarkably effective, Robey claims.

"For every LLM that we considered, this success rate of the attack dropped below 1% when defended by SmoothLLM”, he says.

"Think of SmoothLLM as a security protocol that scrutinizes each request made to the LLM. It checks for any signs of manipulation or trickery in the input prompts. This is like having a security guard who double-checks each question for hidden meanings before allowing it to answer”, he adds.

Also Read | GPT-3 a double-edge sword: New research questions veracity of AI Models

Looking ahead, Robey emphasizes the importance of AI safety and the ongoing battle against new forms of jailbreaking.

"There are many other jailbreaks that have been proposed more recently. For instance, attacks that use social engineering—rather than suffix-based attacks—to convince a language model to output objectionable content are of notable concern," he says.

"This evolving threat landscape necessitates continuous refinement and adaptation of defense strategies”, he says.

For all the latest News, Opinions and Views, download ummid.com App.

Select Language To Read in Urdu, Hindi, Marathi or Arabic.

Top Headlines

Post Comments

Note: By posting your comments here you agree to the terms and conditions of www.ummid.com

What if AI chatbot is asked to make a bomb?

Looking ahead, Robey emphasizes the importance of AI safety and the ongoing battle against new forms of jailbreaking. Read More

“AI Jailbreak”

“SmoothLLM”

Also Read | AI More Dangerous Than Nuclear Weapons: Christopher Nolan

Also Read | GPT-3 a double-edge sword: New research questions veracity of AI Models

Top Headlines

0 'Bismillah': Ganjar Pranowo formally launches Indonesia 2024 Poll campaign

1 Putin asks Russians to preserve 'excellent tradition of having 7 and more kids'

2 China Abandons Myanmar Military Junta!

3 UPSC NDA, NA (I) 2024 on April 21; NDA, NA (II) 2024 on Sep 1

4 Asia-Pacific processed meat market to grow by $103.72 bn by 2027

5 2023 State Polls: Telangana Vote to Elect New Assembly Thursday

6 ‘From Midst of Pain’: Watch Palestinian activist’s heart-warming video from Gaza school

7 ‘The Palestine Laboratory’ wins Australia’s biggest Journalism Award

8 Henry Kissinger, American Diplomat Accused of War Crimes, Dies

9 Muslim Hater Geert Wilders, New Dutch Prime Minister?

Top Stories

Watch: Hamas releases Israeli teen and her pet dog in its ‘care’ for 52 days

Also Read

With Gaza’s Charm, Empathy Jews Once Enjoyed Too Blown Into Air

Also Read

Reminiscence of Delhi 1857 and Gaza 2023

More Stories

Supports pour in for Melissa Barrera sacked for calling out Israeli genocide of Palestinians

Also Read

Alberta varsity fires center head for endorsing call to recognize Israeli occupation as Terrorist

Also Read

On Children’s Day, World watches as Israel turns Gaza into a graveyard for kids

Israeli captive released by Hamas leaves behind “Letter of Thanks”

Also Read

'Yes, We See You': Barcelona to Palestinians in resolution cutting ties with Israel

Also Read

In a first, Palestinian Arab woman becomes minister in Spain