OpenAI has released Lockdown Mode for ChatGPT, a security configuration that restricts the AI’s external communication capabilities to prevent the most common class of prompt-injection data exfiltration attack. When enabled, ChatGPT will not load external URLs provided in user prompts, will not render images from arbitrary remote sources, and will not make calls to third-party plugins or custom GPT actions that retrieve external content.
The feature is available for ChatGPT Enterprise and ChatGPT Edu accounts, enforceable organisation-wide by administrators through the admin console. Individual Plus and Team users can enable it per-session from the settings panel.
The Threat Lockdown Mode Addresses
Prompt-injection attacks against AI assistants exploit a fundamental characteristic of large language model systems: the model cannot reliably distinguish between instructions from the authorised user and instructions embedded in content it processes. When ChatGPT reads a document, browses a webpage, or processes an image, adversarial instructions embedded in that content can redirect the model’s behaviour.
The most damaging class of these attacks combines prompt injection with an exfiltration channel. An attacker who controls a webpage or document that a user will ask ChatGPT to process can embed instructions that cause ChatGPT to:
- Extract sensitive information from the conversation (email content, document text, API keys the user has shared)
- Construct a URL encoding that exfiltrated data
- Request that URL as an image or resource load — sending the data to an attacker-controlled server
This attack class has been demonstrated repeatedly in research settings and has been reported in real-world incidents involving users who asked ChatGPT to summarise emails containing embedded malicious content, or to analyse documents from unknown sources.
Lockdown Mode prevents the exfiltration step: by disabling ChatGPT’s ability to load external URLs or make external API calls, the attack chain is broken even if the prompt injection instruction is successfully embedded in content.
What Lockdown Mode Restricts
Disabled in Lockdown Mode:
- External URL loading from user prompts
- Image rendering from arbitrary remote URLs (including tracking pixels and exfiltration beacons)
- Third-party plugin API calls
- Custom GPT action calls that retrieve external content
- Code interpreter file exfiltration via network calls
Remains available in Lockdown Mode:
- Text generation and analysis
- Code generation (no execution)
- File analysis (PDF, documents) for text content
- ChatGPT’s built-in capabilities that do not require external network calls
- Browsing capability remains available but restricted to a curated set of approved domains (configurable by enterprise admins)
Enterprise Configuration
For ChatGPT Enterprise organisations, Lockdown Mode is enforced through the admin console under Settings → Security → Content and Connectivity Controls. The configuration allows:
- Global Lockdown Mode: All users in the organisation run in Lockdown Mode by default; individual users cannot override
- Group-level policies: Apply Lockdown Mode to specific user groups (e.g., finance, legal, HR) that handle sensitive documents while allowing broader access for other groups
- Approved browsing domains: Whitelist specific domains that ChatGPT may load in Lockdown Mode (e.g., internal documentation sites, specific trusted research sources)
Enterprise DLP integration: ChatGPT Enterprise’s DLP connectors (available for Microsoft Purview and Forcepoint) can be configured to flag conversations in which users share content classified as sensitive. Lockdown Mode reduces the risk that sensitive content shared with ChatGPT can be exfiltrated via prompt injection; DLP policies provide visibility into what content is being shared in the first place.
Limitations and Residual Risk
Lockdown Mode addresses the external exfiltration vector but does not prevent:
- In-context data leakage: If a user is deceived into asking ChatGPT to summarise a malicious document, the output of that summarisation may include attacker-crafted content that misleads the user
- Plugin architecture exceptions: Custom GPTs with pre-approved actions configured by the enterprise administrator can still make external calls — the security of those integrations depends on the action’s implementation
- User-driven exfiltration: Lockdown Mode does not prevent a user from voluntarily copying and pasting ChatGPT output to external systems
The feature represents a meaningful reduction in the AI prompt-injection attack surface, not a complete elimination. Enterprise security programmes should combine Lockdown Mode with user awareness training on AI risks and DLP policies that cover AI platforms as a data handling channel.
Share this article