Regulation & Policy

AI’s dual edge: rising cyber threats demand smarter defences

Monday, 27 October 2025 10:14AM UTC

Artificial intelligence is transforming critical workflows across sectors—from diagnosing illnesses to piloting autonomous vehicles—but its rapid adoption is also expanding the attack surface for cybercriminals. As AI systems become more capable, attackers are exploiting the very qualities that make them powerful.

A growing class of threats targets AI’s inputs and training data. Prompt injection attacks, for example, involve feeding AI systems misleading instructions to manipulate outputs. In high-stakes scenarios such as healthcare or autonomous transport, the consequences could be dangerous. Even subtle data manipulations—such as altering financial risk models—can undermine trust and open the door to fraud.

Beyond inputs, attackers are increasingly targeting AI infrastructure. Data poisoning corrupts training datasets, leading to faulty predictions and systemic risks. Model extraction, where adversaries replicate proprietary AI models through repeated queries, enables intellectual property theft and circumvention of costly development.

The rise of generative AI has supercharged these risks. According to Microsoft, state-backed actors from Russia, China, Iran and North Korea are using AI to automate phishing, clone government officials using deepfakes and craft sophisticated malware. Phishing attempts have surged by over 1,200 percent since generative AI’s emergence, with more than 200 AI-generated fake content incidents recorded in July 2025 alone.

The attacks are also becoming more personalised. Cybercriminals scrape social media and corporate websites to tailor phishing messages, increasing their success rate. North Korean IT operatives have reportedly used deepfakes to deceive recruiters and influence internal systems, while adaptive malware—capable of mutating to evade detection—marks a new phase in cybercrime.

Emerging offensive tools such as HexStrike-AI, which pairs large language models with hacking utilities, have amplified these risks. HexStrike-AI automates reconnaissance and attack execution, enabling faster and more efficient exploitation of vulnerabilities like those in Citrix systems. Fortinet reports that automated cyber scans now exceed 36,000 per second, contributing to a sharp rise in stolen credentials.

Leading AI firms are responding. Anthropic recently revealed it had blocked attempts to misuse its Claude AI model to generate phishing content and malicious code. The company’s transparency highlights the escalating arms race in AI security and the need to embed safeguards early in development.

For organisations, the message is clear: AI offers vast potential but demands robust, adaptive security. Monitoring for anomalous inputs and outputs, securing training data and proprietary models, and educating employees on AI-driven scams are all vital. So too is the deployment of threat detection systems that are AI-aware.

While the cyber threat landscape is intensifying, it need not curtail innovation. Instead, it presents an opportunity to embed security as a foundation of AI development. With proactive strategies, the UK and others can maintain trust, protect assets and remain global leaders in responsible AI advancement.

Created by Amplify: AI-augmented, human-curated content.

More on this

https://aijourn.com/from-tool-to-threat-how-cybercriminals-are-weaponising-ai/ - Please view link - unable to able to access data
https://apnews.com/article/ad678e5192dd747834edf4de03ac84ee - A recent Microsoft report reveals a significant surge in the use of artificial intelligence by Russia, China, Iran, and North Korea to conduct cyberattacks and spread disinformation targeting the United States. In July 2025 alone, Microsoft detected over 200 instances of foreign adversaries using AI-generated fake content—more than double the amount from the previous year and over ten times that of 2023. These state and criminal actors are leveraging AI to enhance their operations, such as crafting credible phishing emails, creating deepfake government official clones, and automating hacking techniques.
https://www.reuters.com/business/retail-consumer/anthropic-thwarts-hacker-attempts-misuse-claude-ai-cybercrime-2025-08-27/ - Anthropic revealed that it successfully blocked attempts by hackers to misuse its Claude AI system for cybercriminal activities, including drafting phishing emails, generating malicious code, and bypassing safety filters. The company disclosed these findings in a report to raise awareness about the growing threats associated with the misuse of AI tools. The thwarted attacks involved attempts to craft targeted phishing messages, repair or create harmful software, and manipulate the AI through repeated prompts. Some attackers also tried using the system to run influence campaigns and assist novice hackers with detailed instructions.
https://en.wikipedia.org/wiki/Prompt_injection - Prompt injection is a cybersecurity exploit in which adversaries craft inputs that appear legitimate but are designed to cause unintended behavior in machine learning models, particularly large language models (LLMs). This attack takes advantage of the model's inability to distinguish between developer-defined prompts and user inputs, allowing adversaries to bypass safeguards and influence model behaviour. While LLMs are designed to follow trusted instructions, they can be manipulated into carrying out unintended responses through carefully crafted inputs. The term 'prompt injection' was coined by Simon Willison in September 2022.
https://www.techradar.com/pro/security/new-ai-powered-hexstrike-tool-is-being-used-to-target-multiple-citrix-security-flaws - A new AI-powered cybersecurity tool, HexStrike-AI, is being exploited by cybercriminals to rapidly target vulnerabilities in Citrix systems. Discovered by Check Point Research, HexStrike-AI integrates large language models (LLMs) like GPT and Claude with over 150 cybersecurity tools via the Model Context Protocol. This enables the automation of tasks such as penetration testing, scanning, data analysis, and exploitation. The tool's 'Intelligent Decision Engine' selects and executes tools suited to the target environment, significantly decreasing the time attackers need to exploit security flaws.
https://www.techradar.com/pro/security/ai-powering-a-dramatic-surge-in-cyberthreats-as-automated-scans-hit-36,000-per-second - A report by Fortinet highlights a significant increase in cyberthreats driven by AI and automation, with global automated scanning activities rising 16.7% year-on-year to 36,000 scans per second. Cybercriminals are increasingly targeting vulnerable digital assets such as Remote Desktop Protocol, IoT systems, and Session Initiation Protocols earlier in attack cycles. A dramatic 500% increase in logs from compromised systems has resulted in over 1.7 billion stolen credentials circulating on the dark web, fueling a 42% surge in credential-based targeted attacks.
https://apnews.com/article/3482b8467c81830012a9283fd6b5f529 - Microsoft has reported that adversaries such as Iran, North Korea, Russia, and China are starting to utilize generative AI for offensive cyber operations. The technology giant, in collaboration with OpenAI, detected and disrupted these malicious activities by shutting down the actors' accounts. Although the techniques used by these adversaries are in early stages and not particularly novel, it's important to publicize their use of large-language models (LLMs) to enhance their cyber capabilities. Examples of such malicious use include North Korea's Kimsuky group researching think tanks, Iran's Revolutionary Guard generating phishing emails, and Russia's Fancy Bear researching satellite and radar technologies.

Noah Fact Check Pro

The draft above was created using the information available at the time the story first emerged. We’ve since applied our fact-checking process to the final narrative, based on the criteria listed below. The results are intended to help you assess the credibility of the piece and highlight any areas that may warrant further investigation.

Freshness check

Score: 8

Notes: The narrative was published on October 27, 2025, and does not appear to be recycled content. While similar themes have been discussed in recent months, such as AI-driven cybersecurity threats and the weaponisation of AI by cybercriminals, this specific report offers new insights and data, justifying a high freshness score. ([techradar.com](https://www.techradar.com/pro/security/ai-driven-cybersecurity-threats-are-now-hitting-businesses-from-every-angle-heres-how-to-stay-safe?utm_source=openai))

Quotes check

Score: 9

Notes: The report includes direct quotes from reputable sources, such as Microsoft and Anthropic, which are not found in earlier publications. This suggests the content is original and not reused. No identical quotes appear in earlier material, and variations in wording are minimal, indicating authenticity.

Source reliability

Score: 7

Notes: The narrative originates from The AI Journal, a specialised publication focusing on artificial intelligence topics. While it is a niche source, it is not widely recognised as a major media outlet. The author, Jayde Dorland, is identified as an Intelligence Analyst at Silobreaker, a company known for its cyber threat intelligence services. However, the lack of broader recognition of the publication and author introduces some uncertainty regarding the source's reliability.

Plausability check

Score: 8

Notes: The claims made in the narrative align with recent reports on AI-driven cyber threats, such as the surge in AI-powered phishing attacks and the weaponisation of AI by cybercriminals. The data presented, including the 1,200% increase in phishing attempts since the emergence of generative AI, is consistent with findings from other reputable sources. The tone and language used are appropriate for the topic and region, and the structure is focused on the subject matter without excessive or off-topic detail.

Overall assessment

Verdict (FAIL, OPEN, PASS): PASS

Confidence (LOW, MEDIUM, HIGH): MEDIUM

Summary: The narrative presents original content with fresh insights into the weaponisation of AI by cybercriminals. While the source's reliability is somewhat uncertain due to its niche status, the information aligns with recent findings from other reputable sources. The plausibility of the claims is supported by consistent data and appropriate presentation. Therefore, the overall assessment is a pass, albeit with medium confidence due to the source's limited recognition.

Artificial Intelligence
Cybersecurity
Cyber Threats