OpenAI has introduced Lockdown Mode to ChatGPT to prevent data theft resulting from prompt injection attacks.
OpenAI has started to implement a new security feature called Lockdown Mode for ChatGPT, aimed at preventing data theft through prompt injection attacks. This setting disables live web browsing, agent mode, deep research, image retrieval, Canvas networking, and file downloads. It is accessible to all logged-in users, including those on Free, Go, Plus, Pro, and self-serve ChatGPT Business plans.
Prompt injection continues to be a significant challenge for all large language models. This type of attack involves embedding harmful instructions within the content processed by the model, such as webpages or uploaded files. If the model executes those instructions, it could unintentionally send sensitive information to an attacker’s server.
While Lockdown Mode is effective at severing the pathways that attackers use to exfiltrate data, it does not prevent injections entirely. Malicious payloads embedded in cached web pages or uploaded PDFs can still affect the model’s behavior. The mode ensures that no network requests to external servers can be made, as live browsing is restricted, and image retrieval is disabled, cutting off pixel-based data channels.
OpenAI acknowledges that Lockdown Mode significantly mitigates the risk of data loss through prompt injection, but it cannot guarantee complete protection. Risks may still exist through enabled apps, unforeseen combinations of capabilities, or new techniques that may emerge.
However, the implementation of Lockdown Mode comes with considerable trade-offs. When enabled, ChatGPT loses many of its useful features, such as live browsing, which becomes limited to cached content, and the entire agent mode is removed. Deep research is also unavailable, making it clear that, as OpenAI notes, this mode is “not intended for everyone.”
This feature is introduced amid growing concerns around prompt injection attacks on AI agents. Security researchers have demonstrated vulnerabilities in agents from companies like Anthropic, Google, and Microsoft through their GitHub Actions integrations. Although all three offered bug bounties, they did not issue public advisories. The core issue lies in the fundamental challenge that large language models struggle to differentiate between data and instructions.
Lockdown Mode cannot be activated simultaneously with Developer Mode; turning one on disables the other. Additionally, OpenAI has introduced a separate session management tool that allows users to review active ChatGPT sessions and log out of specific devices if they suspect unauthorized access.
This feature reflects a practical acknowledgment by OpenAI that while it has not resolved the prompt injection issue, it is providing users with a way to minimize risks by sacrificing certain functionalities. For individuals working with sensitive data in ChatGPT, this trade-off may be justified. However, for others, the expanding ecosystem and increasing vulnerabilities heighten the risk.
Other articles
OpenAI has introduced Lockdown Mode to ChatGPT to prevent data theft resulting from prompt injection attacks.
ChatGPT's new Lockdown Mode turns off live browsing, agent mode, and deep research to prevent attackers from extracting data via prompt injection.
