Askhat - stock.adobe.com

Tip

How data poisoning attacks work

Generative AI brings business opportunities to the enterprise but also security risks. Learn about an evolving attack vector called data poisoning and how it works.

Rob Shapland

By

Rob Shapland

Published: 13 Mar 2024

The ongoing use of AI and machine learning -- combined with the explosion in interest in generative AI tools, such as ChatGPT -- has led to inevitable questions about new cybersecurity risks they pose in the enterprise.

AI algorithms are trained on data sets, which might be incredibly large or could be relatively small if the AI tool is designed for a specific purpose, such as a narrow business use case. If an AI tool's data set is altered or corrupted in some way, the tool's output could be inaccurate and possibly even discriminatory or inappropriate.

In some cases, it might be possible for an attacker to poison the data set to introduce a backdoor or other vulnerability into the AI tool. Imagine, for example, that an AI model is trained to recognize suspicious emails or unusual behavior on a corporate network. A successful data poisoning attack could enable phishing or ransomware activity to go undetected and bypass email and spam filters.

How data poisoning attacks work

To launch a data poisoning attack, a threat actor needs access to the underlying data. Approaches vary depending on whether the data set is private or public.

Data poisoning attack on a private data set

In the case of a small, privately held data set used to train a specific AI tool, the attacker could be a malicious insider or a hacker who has gained unauthorized access.

In some cases, it might be possible for an attacker to poison the data set to introduce a backdoor or other vulnerability into the AI tool.

Such an actor might choose to poison only a small subset of data in what's known as a targeted attack. In this situation, the tool functions correctly the majority of the time, and the compromise flies under the radar of the software's owners.

Should a user prompt call upon the model to reference the corrupted data, however, the tool suddenly goes haywire and responds in a way that is completely different from what operators expected or intended. Depending on the industry and use case -- finance or healthcare, for example -- the implications could be costly and even life-threatening.

Data poisoning attack on a public data set

If the data used to train the AI tool is publicly available data, the poisoning would likely need to happen through a coordinated, multiparty effort.

A tool known as Nightshade, for example, enables artists to insert changes -- mostly invisible to the human eye but not to generative AI tools, such as Midjourney and Dall-E -- into their art, with the aim of confusing AI that uses it as training data without permission.

The changes that Nightshade makes can manipulate the AI into generating incorrect images -- for example, a house instead of a car -- effectively poisoning the tools' data sets and potentially undermining users' trust.

Nightshade operators' stated goal is to increase the cost of training AI on unlicensed data. In turn, AI operators might ultimately decide to configure their tools to avoid scraping content without permission.

How to prevent data poisoning attacks

Protecting against data poisoning requires a multilayered approach. For tools that do not use massive volumes of data -- those that meet narrow enterprise use cases, for example -- it is easier to ensure the integrity of the data set the tool is trained on and guarantee it comes only from trusted sources.

That said, it is possible to sanitize data from public sources, pre-processing it to ensure no deliberate errors have been introduced into the data set.

AI developers can also implement a procedural check that ensures any output meets certain standards, such as appropriateness and nondiscrimination, regardless of the data set or the user prompt.

Rob Shapland is an ethical hacker specializing in cloud security, social engineering and delivering cybersecurity training to companies worldwide.

Dig Deeper on Threats and vulnerabilities

Part of: Top LLM threats and how to defend against them

Up Next

Explore mitigation strategies for 10 LLM vulnerabilities

As large language models enter more enterprise environments, it's essential for organizations to understand the associated security risks and how to mitigate them.

4 types of prompt injection attacks and how they work

Compromised LLMs can expose sensitive corporate data and put organizations' reputations at risk. Learn about four types of prompt injection attacks and how they work.

How data poisoning attacks work

Generative AI brings business opportunities to the enterprise but also security risks. Learn about an evolving attack vector called data poisoning and how it works.

ChatGPT plugin flaws introduce enterprise security risks

Insecure plugin design -- one of the top 10 LLM vulnerabilities, according to OWASP -- opens enterprises to attacks. Explore ChatGPT plugin security risks and how to mitigate them.

How to identify and prevent insecure output handling

Sanitation, validation and zero trust are essential ways to reduce the threat posed by large language models generating outputs that could cause harm to downstream systems and users.

Search Networking

Private 5G for utilities: Benefits, use cases and deployment
Utilities increasingly choose private over public 5G for its superior control, flexibility and security, enabling applications ...
Breaking down Palo Alto Networks' $3.35B Chronosphere deal
Palo Alto Networks acquired observability platform Chronosphere for $3.35 billion. The deal aims to enable AI-driven autonomous ...
5G for public safety: Improved networks for first responders
The drones, surveillance systems and monitoring devices favored by public safety agencies aren't feasible without 5G's high ...

Search CIO

Four ways CIOs should help improve CX strategy
CIOs must take an active role in driving CX initiatives by getting closer to and better understanding customers, improving ...
A day in the life of a strategy-driven CIO
CIO Stephen Franchetti spends his days balancing IT operations with strategic planning. It's about putting out fires while laying...
Why enterprises shouldn't accept "good enough" AI ROI
Many companies see only modest AI gains while far greater value sits untouched. The real gap isn't technology, it's strategy.

Search Enterprise Desktop

How IT admins can check BIOS or UEFI versions in Windows 11
Firmware, such as BIOS or UEFI, plays a crucial role in how securely a Windows device starts and operates. Organizations need to ...
Microsoft opens Copilot agent building to office rank and file
The battle for desktop agent mindshare heats up. Microsoft is the latest to arm everyday office workers with tools to make their ...
Set up MFA in Microsoft 365 to safeguard data
Learn how to set up multifactor authentication in Microsoft 365 to enhance security, prevent unauthorized access and protect ...

Search Cloud Computing

The big three grab two-thirds of $107B cloud market in Q3
Cloud dominance intensifies as AWS, Microsoft and Google capture 63% of the $107B market. AWS leads at 29%, despite erosion, ...
Custom Amazon CloudWatch metrics: When default isn't enough
Transform your AWS monitoring beyond basic CPU and network stats. Discover how CloudWatch custom metrics unlock ...
Move from reactive to predictive cloud management with AI
Discover how AI transforms cloud management from reactive firefighting to predictive optimization. Learn executive strategies for...

ComputerWeekly.com

Dutch voters grasp digital urgency better than their politicians
A grassroots campaign has propelled digitally competent candidates into the Dutch parliament, despite party leaders placing them ...
Protecting the defenders: Addressing cyber's burnout crisis
The Computer Weekly Security Think Tank considers the burdens and responsibilities that accompany the role of chief information ...
Meta announces completion of core 2Africa cable
IT behemoth reveals final phase of subsea cable connecting Africa and the world

Close