-
May 26, 2026
-
May 26, 2026

Claude Cowork is Anthropic’s enterprise-focused entry into the new wave of autonomous AI agents for end-users popularized by tools like OpenClaw and Hermes. Unlike assistants that just generate answers or summarize, Cowork is designed to operate. For example, the Computer Use feature allows it to interact with browsers, desktop apps, files, and enterprise tools directly on behalf of the user.
That changes the security conversation.
The question is no longer just "Can employees paste sensitive data into AI?" but "What happens when AI gains continuous operational access to enterprise environments?"
Anthropic clearly designed Cowork with security in mind: VM sandboxing, permission prompts, network controls, and enterprise administration capabilities provide a strong foundation. But these protections still require proper configuration and governance, and various Cowork capabilities introduce different levels of risk and visibility challenges.
Anthropic couldn’t have phrase it better themselves:
“Desktop extensions (MCPs) and plugins expand what Claude can do, but each one introduces new ways for attacks to reach Claude.”
Organizations that understand where the real risks are, and which features require additional controls, will be able to adopt agentic AI safely and move faster than their competitors.
We strongly recommend continuously reviewing Anthropic’s enterprise controls and applying clear governance guidelines before and after broad rollout.
Unlike developer-focused agents like Claude Code or Codex, Cowork is built for a much broader set of enterprise use cases and less technical users, which significantly expands both its capabilities and its potential risk surface.
Instead of focusing mainly on developer workflows, Cowork can interact with browsers, control desktop applications, work across files and documents, use enterprise tools and MCP integrations, and even operate the computer directly through features like Computer Use and Browser Use.
Several unique Cowork capabilities materially change the enterprise risk model:
Computer use
Computer use is arguably the most powerful and highest-risk capability. Instead of interacting through APIs alone, Cowork can directly navigate the computer by taking screenshots, clicking buttons, typing, and interacting with applications visually. This introduces several important implications:
Unlike other Cowork tools that require explicit permission prompts, computer use can operate more fluidly once enabled. For enterprises, this means visibility and governance become far more important than simple application allowlists.
Browser use
As part of its computer use capabilities, Cowork can leverage the user’s authenticated browser sessions to interact with SaaS platforms and enterprise applications. From a security perspective, this is important because the AI inherits the trust and permissions already granted to the user’s browser session. In practice, the AI may gain indirect access to:
The risk is not necessarily credential theft. The risk is operational misuse or unintended actions executed through an already-authenticated session, leading to potential data exposure, or interactions that might damage business reputation and outcomes.
Microsoft 365 apps add-ins
Cowork supports integrations with desktop applications like Excel, PowerPoint and Outlook. This is one of the clearest examples of how agentic AI differs from traditional copilots. Cowork can move context between applications as part of completing a task. For example:
While extremely useful, this also weakens traditional assumptions around data separation between applications.
MCPs, connectors and Skills
MCPs and plugins significantly expand what Cowork can do by connecting it to external systems and enterprise tooling. The challenge is that every integration becomes part of the AI trust boundary.
For example, integrations connected to GitHub, Jira, Slack, internal APIs, databases, or cloud infrastructure effectively extend the AI’s operational reach into those systems. This creates a new category of attack surface where prompt injections, malicious content, or unintended instructions can propagate through connected tools.
In addition, unofficial MCPs and community-provided skills can introduce supply chain risks, especially when organizations allow employees to install or connect unverified extensions without governance or review.
Mobile “Dispatch” (Beta) / “Remote-control”
Anthropic’s Dispatch capability allows users to interact with Cowork remotely from their phones while Cowork operates on the desktop machine.
Operationally, this is powerful. From a security standpoint, it means the mobile device effectively becomes a remote control for enterprise desktop resources, including:
This shifts some of the security focus away from the endpoint itself and toward identity security and mobile device trust.
Access to host resources
Cowork can interact with mounted files and folders to complete tasks. Instead of simply summarizing uploaded documents, it can analyze project directories, manipulate files, and execute workflows using local resources.
The good news is that this risk is largely mitigated by Anthropic’s sandbox architecture. Cowork operates inside an isolated VM environment that separates the agent from the host filesystem and only grants access to explicitly mounted project files and directories. This significantly reduces exposure to sensitive host data, limits persistence opportunities, and constrains lateral movement across the endpoint.
Capabilities per subscription tier and Claude agent
Some of the above capabilities are only available in specific Cowork tiers, while others are also supported in developer-focused tools like Claude Code with a different “flavor” like remote-control in claude code and “Dispatch” in Cowork , which are often overlooked by security teams. Additionally, the capabilities and features depend on their phased rollout - some were available to teams only in beta, and became generally available a few months later to all tiers. Our recommendation is to review the admin settings and make sure you enable/disable the capabilities based on your need and risk appetite.
To Anthropic’s credit, Cowork includes several security mechanisms designed to reduce enterprise risk. These controls meaningfully reduce many traditional risks associated with autonomous software, but it is important to understand both where they help and where their protections stop.
VM sandbox isolation
One of the strongest security controls in Cowork is its VM sandbox architecture. Instead of operating directly on the host operating system, Cowork performs actions inside an isolated virtualized environment with access limited to explicitly mounted project files and directories. This significantly reduces exposure to the full host filesystem, isolates credentials outside mounted paths, and lowers the risk of persistence or lateral movement across the endpoint. In practice, Cowork behaves less like unrestricted malware and more like a constrained workload operating inside a controlled environment, which is an important distinction for enterprise security teams.
Network egress controls
Anthropic also provides configurable network egress controls that can restrict outbound access from the sandboxed environment, helping enterprises limit accessible destinations, reduce exfiltration paths, and apply corporate networking policies.
However, organizations should avoid assuming these controls fully contain the agent. Certain Cowork capabilities can still interact with external systems through browser-based actions, MCP integrations, connected applications, and other approved tools, extending the AI’s operational reach beyond what traditional egress controls alone can govern.
AI guardrails and permission controls
Anthropic also built safeguards directly into Claude’s behavior and harness layer, including model training against malicious instructions, classifiers for suspicious content, and permission prompts for sensitive actions like file deletion or application access. These controls provide an important security baseline and add useful friction to high-risk operations.
However, they should not be treated as a complete security solution. Prompt injection is ultimately a cat-and-mouse game, with attackers constantly finding new ways to bypass protections and manipulate agent behavior. Enterprises should assume some attacks will eventually succeed and build additional governance, monitoring, and access controls accordingly.
Example enterprise workflow
Consider a common Cowork workflow: an employee asks Claude to prepare a quarterly business presentation. Cowork may retrieve data from Excel, access authenticated SaaS dashboards through Browser Use, collect Jira metrics through MCP integrations, generate charts, and automatically update PowerPoint slides.
From a productivity perspective, this is extremely powerful. From a security perspective, it means the AI agent may operate across multiple enterprise systems, browser sessions, applications, and data sources within a single workflow.
The important nuance is that Cowork’s security model focuses primarily on containment and risk reduction, not on complete prevention. Features like sandboxing and permission controls significantly reduce traditional threats such as host compromise, credential theft, persistence, and lateral movement.
But agentic AI introduces a different category of risk that these protections do not fully address, including:
Anthropic itself acknowledges part of this challenge:
“Desktop extensions (MCPs) and plugins expand what Claude can do, but each one introduces new ways for attacks to reach Claude.” Source
There are also several governance areas that would require extraneous organizational processes today:
As a result, the security challenge shifts from preventing malware-like compromise to governing AI operations. Enterprises need visibility into what the AI can access, what actions it can perform, how those actions are monitored, and whether employees understand the operational implications of using autonomous agents. That is the core security transition introduced by agentic AI. Now let’s flesh out the best practices for implementing it.
Anthropic provides several enterprise controls that organizations should configure before broad Cowork rollout. According to Anthropic’s enterprise administration guidance, security teams can:
Anthropic also provides governance around plugins, desktop extensions, connectors, and observability features, helping organizations reduce unnecessary AI access and operational blast radius.
Capabilities such as Computer Use, Browser Use, Dispatch, and MCP integrations should receive additional review before rollout because they significantly expand the AI agent’s operational reach inside enterprise environments.
Enterprise controls are only effective if employees actually use the managed enterprise tenant. Without tenant restrictions, employees can still use personal Claude tenants to access corporate data outside organizational governance and monitoring.
Anthropic supports network-level tenant restrictions, allowing organizations to limit Claude access to approved enterprise tenants only. Security teams should strongly consider enforcing these controls on managed corporate networks and devices.
Traditional endpoint visibility may not fully capture Cowork activity because many operations occur inside the sandboxed environment. Anthropic supports exporting Cowork telemetry through OTEL integrations into SIEM or observability platforms.
Organizations should monitor:
If organizations choose to enable higher-risk capabilities such as Computer Use and the Chrome integration, they should create dedicated detections for AI-driven browser automation, unusual cross-application workflows, high-risk MCP usage, mass file operations, and repeated permission approvals, and integrate them into existing SIEM, DLP, identity, and endpoint monitoring pipelines.
Because many agentic AI risks stem from legitimate but overprivileged access, organizations should also apply least privilege principles to Browser Use, MCPs, plugins, and connected enterprise systems, while defining clear governance around cross-application data movement and sensitive workflows.
While Cowork has broader enterprise adoption and more operational integrations, developer-focused agents like Claude Code and Codex are often overlooked by security teams.
These tools typically operate directly inside developer environments, source repositories, terminals, and cloud workflows, sometimes with fewer isolation controls than Cowork’s sandboxed model. In practice, they may have access to production credentials, infrastructure tooling, proprietary source code, and sensitive development workflows.
Organizations should apply the same governance principles across all agentic AI tools, not just enterprise productivity-focused assistants like Cowork, and developer workstations especially deserve a deeper approach because the potential blast radius is larger.
Claude Cowork represents a major shift in enterprise AI. Organizations are no longer deploying tools that simply answer questions - they are deploying systems that can take actions across browsers, applications, enterprise tools, and operational workflows.
Anthropic provides a good security foundation through sandboxing, RBAC, permission controls, plugin governance, and observability integrations. But built-in protections alone are neither sufficient on their own, nor guaranteed to work when not configured and maintained properly and continuously. Enterprises still need governance around what AI agents can access, which capabilities are enabled, which integrations are approved, and how autonomous actions are monitored.
Traditional assistants generate content. Cowork takes actions - driving browsers, controlling desktop apps, and using enterprise tools on the employee's behalf. The question shifts from "what data goes in?" to "what can this agent do once it's inside our environment?"
Computer Use. Unlike most other features, it can operate fluidly once enabled, see anything on screen, and work across applications with no native AI integration. Existing approval boundaries built around APIs and SSO scopes can quietly weaken as a result.
Not credential theft - misuse of trust the user has already established. The agent inherits whatever the browser session can reach: SaaS platforms, dashboards, cloud consoles. A prompt injection or misread instruction can trigger real actions with no obvious authentication anomaly.
It substantially reduces traditional endpoint risks like persistence, lateral movement, and broad filesystem access. But sandboxing addresses containment, not agentic risks like cross-application data movement, browser session misuse, or connector abuse. Those need separate governance.
Every integration extends the agent's trust boundary into the connected system. Prompt injections in content from those tools can propagate back through the agent. Unofficial MCPs and community Skills add supply-chain risk, especially when employees can install them without review.
It shifts part of the perimeter from the desktop endpoint to the mobile device and identity layer. The phone becomes a remote control for browser sessions, files, plugins, and MCP tools on the desktop. Mobile device trust and identity controls now matter as much as endpoint protection.
Anthropic covers containment - sandboxing, egress controls, permission prompts, model-level guardrails. They don't fully solve data exposure, overprivileged workflows, or trust delegation. There's also no native workflow for Skill review, connector policies are org-wide rather than per-team, and sharing controls lack granularity. Treat the built-ins as a baseline and layer your own visibility and approval processes on top.