Beyond the Hype: Unleashing AI's True Power in the Cyber Trenches

Sep 09, 2025

AI. The word itself conjures images of everything from futuristic innovation to dystopian dread. We see it transforming industries, generating mind-bending art, and, yes, even powering the complex world of cryptocurrencies . But amidst the buzz, a critical question looms: what happens when this incredible power is weaponized, and how can we wield it as our strongest shield against the escalating tide of cyber threats?

Welcome to the cutting edge of cybersecurity, where the age-old battle between digital defenders (blue teamers) and cunning attackers (red teamers) is being radically redefined by artificial intelligence . To truly understand this dynamic shift, we recently sat down with Tomas Roccia, a Senior Threat Researcher at Microsoft, whose 15 years in the security industry have given him a unique vantage point on AI's fascinating – and sometimes frightening – dual role in cyber defense . From his early days in IT support to leading advanced threat research teams at McAfee and now at Microsoft's MSEC AI team, Tomas brings a wealth of experience, focusing on how AI can be applied to threat investigations, intelligence, and malware analysis .

The AI Paradox: A Double-Edged Sword for Digital Defenders

It’s impossible to ignore AI's "dual edge" . The headlines are filled with concerns about AI being weaponized by malicious actors, leading to new frontiers in cybercrime . Imagine deepfakes so convincing they can bypass multi-factor authentication, or novel malware variants generated on the fly. Attackers are even leveraging clever prompt engineering to create zero-day exploits, effectively bypassing the need for years of traditional programming skill . This rapid evolution and widespread adoption of AI by organizations also significantly increases the attack surface for enterprises .

When technical defenses become robust – think regular patching and widespread Multi-Factor Authentication (MFA) – attackers don't simply give up . Instead, they often pivot, turning to the weakest link: human vulnerabilities . Sophisticated social engineering, like "click fix" scams asking users to perform seemingly innocuous but dangerous commands (Windows R, Control B, Enter), exploits this human element to gain insider access, establish remote connections, or facilitate lateral movement within an enterprise . This means the threat landscape isn't just evolving; it's becoming more insidious and harder to predict .

Unleashing AI as Your Ultimate Defender: Beyond ChatGPT

The good news? AI is not just a weapon for the adversary; it's an invaluable "force multiplier" for defenders . What once took security teams weeks of painstaking manual work, AI can now help achieve in a fraction of the time . Tomas shared a compelling example from his own experience: an investigation into a data breach from a ransomware gang . By creating a small AI agent, he was able to analyze the leaked data, generate statistical reports on users and discussions, and extract critical indicators of compromise (IOCs) like IP addresses and wallet addresses . This entire process, which would have taken him a full week of manual coding just three or four years ago, was completed in about an hour with the AI agent . While the initial report might not have been perfect, it was "good enough" to allow him to dive deeper into critical details .

However, leveraging AI for reliable security results isn't as simple as casually querying a general-purpose AI like ChatGPT . Large Language Models (LLMs) are not deterministic, meaning their outputs can vary each time you ask the same question . To achieve the reliability required for cybersecurity, defenders must employ advanced techniques :

• Leveraging External Tools: Integrating LLMs with specialized external security tools can enhance accuracy .

• Validating LLM Outputs: Always cross-reference and verify the information generated by an LLM .

• Retrieval Augmented Generation (RAG): This powerful technique connects an LLM to a curated knowledge base, ensuring that generated responses are grounded in factual, validated information from that source . This helps validate retrieved data against LLM-generated content, making the system more trustworthy .

Microsoft is at the forefront of this defensive innovation with tools like Microsoft Defender and Copilot . Copilot, for instance, has the potential to supercharge productivity by drafting proposals, analyzing historical data, and more . Yet, the widespread adoption of such powerful AI tools also raises crucial questions for CISOs regarding data access and segmentation . How do you prevent an intern from gaining unauthorized access to sensitive financial data by simply asking Copilot a question that perfectly crafts a prompt to bypass traditional security ?

Nova: Your AI Firewall Against Adversarial Prompts

This is precisely where Tomas Roccia's innovative work truly shines. He introduced Nova, an open-source framework he developed in his spare time, a testament to his dedication to the security community . Nova acts as an "AI firewall," effectively a shim positioned between the user and the LLM system . Its core function is to scan prompts before they reach the AI system, intercepting and detecting adversarial intents .

Nova is designed to catch anything potentially harmful, such as requests for sensitive data (e.g., "give me the last password from this document"), attempts to generate misinformation, or even clever bypass techniques like "Dan mode" prompts . What makes Nova particularly powerful and flexible is its detection mechanism :

• Keyword Matching: Rules can match prompts based on specific keywords .

• Semantic Meaning: It can detect prompts that are similar in meaning to a defined malicious pattern, even if the exact wording is different .

• LLM as a Judge: Nova can even use another LLM to evaluate if a prompt meets specific detection criteria .

This flexibility allows threat analysts worldwide to build their own custom logic, using combinations of keywords, semantic matching, or LLM-based judgments (AND or OR conditions) . Because Nova is open-source, it allows for dynamic updating as the community discovers new adversarial techniques, making it a powerful, collaborative defense tool that continually gets better . Nova even won the community prize at the Sense AI hackathon .

Following the Digital Breadcrumbs: AI in Cryptocurrency Investigations

Tomas also shared fascinating insights from his Defcon talk, "Where's My Crypto Dude," which showcased a sophisticated Bybit case study . In February, North Korea's threat actors (DPRK) successfully breached the Bybit exchange, stealing an astonishing $1.4 billion worth of Ethereum . This wasn't a smash-and-grab; it was a highly sophisticated, persistent "low and slow" attack . The adversaries subtly altered code within a Docker instance, allowing them to redirect funds over time without immediate detection . Users thought their funds were being securely transferred, but they were, in fact, being diverted to different, illicit addresses .

Tracking stolen cryptocurrency on a public blockchain might sound straightforward, given the transparency of the ledger . However, it's incredibly complex due to the sheer volume of transactions and the use of obfuscation tools like mixers . Mixers are systems that blend stolen funds with many other legitimate transactions, making it extremely difficult to trace the money's origin and destination . Imagine a hundred bank robbers throwing all their money (including marked bills) onto a table, mixing it, and then everyone taking back their correct amount; it becomes almost impossible for a single officer to find the marked bills .

Despite these formidable challenges, Tomas demonstrated how an AI agent could be leveraged to track the stolen funds . His agent was able to retrieve information directly from the blockchain, enrich that information, and then identify patterns and known suspicious wallets . While the immense data volume can still overwhelm even AI systems (Tomas noted he couldn't feed the entire blockchain into his agent's context window), it significantly speeds up the initial investigation, provides an invaluable "assistant," and helps human analysts understand the complex money laundering schemes used by attackers .

Essential Recommendations for CISOs in the AI Era

For security executives navigating this rapidly changing landscape, Tomas offers critical, actionable advice that can fortify your organization :

1. Prioritize and Reinforce Basic Security Hygiene: Before integrating any new AI systems, ensure your fundamental security measures are impeccably robust . AI systems operate on the same underlying infrastructure as traditional systems, so essentials like regular patching, robust Multi-Factor Authentication (MFA) for everyone, and strong access controls remain paramount . A weak foundation will compromise even the most advanced AI defenses .

◦ CISO Action: Conduct a comprehensive audit of your current security posture. Identify and remediate any gaps in foundational security controls before embarking on significant AI integration projects. Emphasize that AI is not a band-aid for existing security vulnerabilities.

2. Strategic and Incremental AI Adoption: Do not view AI as a magic bullet for every problem . It's a mistake to assume AI will solve everything . Instead, carefully identify where AI systems are truly relevant and beneficial for your organization, aligning them with specific business needs and security challenges .

◦ CISO Action: Develop a clear AI security strategy that outlines specific use cases, potential benefits, and anticipated risks. Implement AI solutions incrementally, building them "piece by piece" to thoroughly understand the new risks and attack surfaces they introduce . This allows for continuous learning and adaptation.

3. Implement AI-Specific Guardrails and Firewalls: Recognize that interacting with AI via natural language increases the attack surface, requiring new types of security controls . Tools like Tomas's Nova framework serve as an "AI firewall," intercepting and scanning prompts for adversarial intent before they reach your LLMs . This can prevent malicious prompt injection, data exfiltration attempts, and the generation of misinformation .

◦ CISO Action: Explore open-source AI firewall solutions like Nova or commercial equivalents. Mandate their use as a shim between users and LLMs to enforce policy and detect malicious prompts. Invest in understanding prompt engineering risks and train your teams to recognize and report suspicious AI interactions.

4. Emphasize Data Segmentation and Access Control for AI Tools: As AI tools like Copilot become integrated into workflows, they will have unprecedented access to organizational data . It is critical to ensure that data access is strictly segmented based on user roles and permissions . An intern should not be able to query Copilot for sensitive financial data simply because the LLM could access it .

◦ CISO Action: Work closely with IT and development teams to implement robust data governance and access control policies for all AI systems. Design your AI deployments with security and privacy by design, ensuring that LLMs only access data that users are explicitly authorized to view.

5. Cultivate Advanced Threat Intelligence and Vigilance: Today's attackers are incredibly sophisticated and highly motivated, especially when state-backed, as seen in the DPRK's Bybit attack . They employ persistent, "low and slow" tactics, capable of altering core infrastructure like Docker instances to achieve their goals .

◦ CISO Action: Invest in advanced threat intelligence capabilities, particularly those focusing on state-sponsored actors and sophisticated cybercrime groups. Foster a culture of continuous vigilance within your security teams, emphasizing the importance of detecting subtle anomalies and persistent threats.

6. Invest in AI-Assisted Investigation and Analysis: While large-scale data (like entire blockchains) can overwhelm even AI systems, AI agents significantly speed up investigations and help analysts understand complex patterns, such as money laundering schemes . This "force multiplier" effect is crucial for rapidly responding to breaches.

◦ CISO Action: Explore and pilot AI-powered tools for security operations, incident response, and threat hunting. Provide training for your security analysts on how to effectively leverage AI agents to automate data collection, analysis, and pattern recognition, thereby freeing them up for more strategic, in-depth work.

The Future is Now: Adapt and Thrive

The security industry is undergoing a profound shift in how we process data, analyze information, and investigate breaches . It's crucial for security professionals to not just understand, but actively embrace and leverage AI effectively for their own activities . This continuous evolution is the defining characteristic of cybersecurity, and those who adapt will be the ones to thrive .

As Tomas Roccia wisely put it, "time is changing and I think for us it's, it's exciting... you have to adapt and evolve" . The future of digital defense is here, and it's inextricably linked with AI – a force that demands both our respect and our strategic mastery.
--------------------------------------------------------------------------------

Special thanks to Tomas Roccia, Senior Threat Researcher at Microsoft, for sharing his invaluable expertise and insights on this critical topic.

CISO Tradecraft® Newsletter

Discussion about this post