Skip to main navigation Skip to search Skip to main content

You Can't Steal Nothing: Mitigating Prompt Leakages in LLMs via System Vectors

  • Bochuan Cao
  • , Changjiang Li
  • , Yuanpu Cao
  • , Yameng Ge
  • , Ting Wang
  • , Jinghui Chen
  • Pennsylvania State University
  • Palo Alto Networks, Inc.

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Large language models (LLMs) have been widely adopted across various applications, leveraging customized system prompts for diverse tasks. Facing potential system prompt leakage risks, model developers have implemented strategies to prevent leakage, primarily by disabling LLMs from repeating their context when encountering known attack patterns. However, it remains vulnerable to new and unforeseen prompt-leaking techniques. In this paper, we first introduce a simple yet effective prompt leaking attack to reveal such risks. Our attack is capable of extracting system prompts from various LLM-based application, even from SOTA LLM models such as GPT-4o or Claude 3.5 Sonnet. Our findings further inspire us to search for a fundamental solution to the problems by having no system prompt in the context. To this end, we propose SysVec, a novel method that encodes system prompts as internal representation vectors rather than raw text. By doing so, SysVec minimizes the risk of unauthorized disclosure while preserving the LLM's core language capabilities. Remarkably, this approach not only enhances security but also improves the model's general instruction-following abilities. Experimental results demonstrate that SysVec effectively mitigates prompt leakage attacks, preserves the LLM's functional integrity, and helps alleviate the forgetting issue in long-context scenarios.

Original languageEnglish
Title of host publicationCCS 2025 - Proceedings of the 2025 ACM SIGSAC Conference on Computer and Communications Security
PublisherAssociation for Computing Machinery, Inc
Pages4423-4437
Number of pages15
ISBN (Electronic)9798400715259
DOIs
StatePublished - Nov 22 2025
Event32nd ACM SIGSAC Conference on Computer and Communications Security, CCS 2025 - Taipei, Taiwan, Province of China
Duration: Oct 13 2025Oct 17 2025

Publication series

NameCCS 2025 - Proceedings of the 2025 ACM SIGSAC Conference on Computer and Communications Security

Conference

Conference32nd ACM SIGSAC Conference on Computer and Communications Security, CCS 2025
Country/TerritoryTaiwan, Province of China
CityTaipei
Period10/13/2510/17/25

Keywords

  • AI Security
  • Large Language Models
  • Prompt Leaking Attack

Fingerprint

Dive into the research topics of 'You Can't Steal Nothing: Mitigating Prompt Leakages in LLMs via System Vectors'. Together they form a unique fingerprint.

Cite this