What AI Agents Can Do Now: From Errands to Real Work

A realistic operations desk where an AI agent coordinates research notes, code changes, customer tickets, calendar actions, and analytics dashboards across several screens, clean modern office light, no readable text

The easiest way to misunderstand AI agents is to imagine one general robot clerk doing everything. That is not where the best uses are.

The useful agents are narrower. They live inside a workflow. They know which tools they can touch. They have a definition of done. They can show their work. They can stop at a gate when money, customer promises, production systems, legal language, or private data are involved.

Within those limits, agents can already do a surprising amount.

Research that ends with a usable answer

A normal search session leaves you with tabs. An agent can leave you with a brief.

It can search the web, read several sources, compare dates, summarize disagreements, pull out citations, and turn the result into a memo. The key improvement is not that the model has memorized everything. It is that the agent can go fetch current material, keep track of the question, and organize the answer around a decision.

This is useful for market scans, vendor comparisons, policy research, technical surveys, travel planning, and competitive monitoring. The work still needs source checking, especially when the stakes are high. But the first pass can move from hours to minutes.

Coding with a working memory

Software development is one of the clearest agent use cases because the work naturally has a loop. Inspect files. Understand the pattern. Make a change. Run tests. Read the failure. Fix the mistake. Explain the diff.

That loop maps directly to agent design. OpenAI’s 2026 Agents SDK update emphasized controlled workspaces, file inspection, command execution, code edits, and sandboxing. Microsoft highlighted GitHub Copilot’s move toward asynchronous coding agents in 2025. The pattern is simple: give the agent a repository, instructions, tools, and guardrails, then let it handle bounded engineering tasks.

The strongest agents are not replacing senior engineers. They are absorbing the time between intention and patch: finding the right files, making mechanical edits, drafting tests, and doing the first debug pass.

Customer operations

Customer support agents can read a ticket, search a knowledge base, check account status, draft a reply, and suggest the next action. Sales agents can qualify leads, enrich account notes, schedule follow-up, and prepare outreach. Service agents can triage complaints, pull policy details, and escalate cases that need a person.

Salesforce’s Agentforce pitch is built around this idea: agents that can analyze data, make decisions, and take action across customer workflows. The important part is not the brand name. It is the workflow shape. Customer operations contain many repeated decisions, lots of internal context, and clear handoffs.

Good deployments usually begin with low-risk actions: draft the reply, classify the issue, recommend a next step. Only later do they move toward autonomous action.

Office work between systems

A lot of work is not hard because each step is difficult. It is hard because the steps are scattered.

Find the document. Compare the spreadsheet. Update the CRM. Draft the email. Create the ticket. Ask finance for approval. Put the note in the project system. Remind the owner next week.

Agents are well suited to this glue work. Google framed Agentspace around enterprise search, knowledge, and action across silos. Microsoft reported broad use of Copilot Studio for agents and automations. McKinsey’s 2025 survey found many organizations experimenting with agents, but also found that most companies were still not scaling AI across the enterprise. That tension is the story of 2026: the technology is useful enough to test widely, but the operating model is still catching up.

Personal assistants with boundaries

Personal agents can plan a trip, compare products, draft a note, manage a reading list, prepare a meeting brief, or watch for changes in a topic. The safer tasks are informational. The risk rises when the agent can spend money, send messages, change accounts, delete data, or represent you to another person.

The right future for personal agents is not total delegation. It is staged authority.

An agent might first gather options. Then it might draft a plan. Then it might ask before booking. Over time, for low-risk recurring tasks, it may earn more freedom. “Buy the same dog food when the price drops below X” is different from “negotiate my lease.”

Computer use

Computer use is the bridge between neat APIs and messy reality.

Many important systems do not have a clean integration. They have a website, a form, a spreadsheet, or an old internal tool. Anthropic’s computer use capability showed how an agent could interact with a desktop environment by using screenshots, mouse movement, clicks, and typing. That is powerful because it lets agents operate software built for humans.

It is also fragile. Screens change. Buttons move. Popups appear. The agent may misread a UI. Computer use is best when paired with tight scopes, confirmation steps, logs, and fallback paths.

What agents should not do alone

Agents should not be given silent control over high-stakes work just because they can complete low-stakes work. They need gates around:

Payments and purchases
Legal commitments
Medical advice
Employment decisions
Security changes
Deleting or exposing data
Messages sent under a person’s name

The better question is not “Can the agent do it?” The better question is “What happens if it is wrong, and who catches that before harm occurs?”

Jump to another site

Culture

Create

Future

On this page

We found the best deals just for you

Research that ends with a usable answer

Coding with a working memory

Customer operations

Office work between systems

Personal assistants with boundaries

Computer use

What agents should not do alone

Sources

Turn agent lessons into a better review setup

JJ Ben-Joseph

On this page

We found the best deals just for you

Research that ends with a usable answer

Coding with a working memory

Customer operations

Office work between systems

Personal assistants with boundaries

Computer use

What agents should not do alone

Sources

Turn agent lessons into a better review setup

JJ Ben-Joseph

Related guidebooks

What AI Agents Are: The Moment Software Started Taking Initiative

AI Agents at Work: The New Shape of the Office

AI Agent Memory and Context: What to Remember, What to Forget