AI for legal research: where it helps, where it gets lawyers sanctioned

AI for legal research can locate case law, statutes, and secondary authority with linked citations and draft a first pass in minutes, but even purpose-built legal tools still hallucinate, and courts have fined attorneys for filing cases that do not exist. This guide covers the real use cases, the architecture of a defensible legal agent, the documented accuracy risks, and why attorney review is a duty you cannot hand to software.

By Kanika Mathur, Head of Service Delivery

Reviewed by Resourcifi engineeringPublished May 15, 2026Updated May 15, 202611 min read

Key takeaways

The short version

Adoption is real and accelerating. Per Thomson Reuters, the share of legal organizations actively using generative AI rose from 14% in 2024 to 26% in 2025, and 78% of legal professionals expect it to become central to their work within five years.
The accuracy problem is documented rather than hypothetical. A peer-reviewed Stanford RegLab study found that leading commercial legal research tools still hallucinate roughly 17% to 34% of the time, so a tool being purpose-built for law does not make it safe.
Courts are sanctioning lawyers for it. In Mata v. Avianca (2023) attorneys were fined $5,000 for a brief citing six cases that did not exist, and the problem scaled into a 2025 wave of sanctions across federal courts.
Verification is a lawyer’s duty rather than the software’s job. ABA Formal Opinion 512 (2024) places non-delegable verification, candor, confidentiality, and supervision duties on the attorney; an agent can draft and cite, only a licensed lawyer can take responsibility for a filing.
A defensible build is retrieval-grounded and human-in-the-loop: connected to the firm’s documents and primary law, citing every assertion to a source, filtering privileged data at the boundary, and routing a fully cited draft to an attorney for sign-off.

Where legal AI adoption stands

Legal adoption of generative AI is accelerating fast: per the Thomson Reuters 2025 Generative AI in Professional Services Report, the share of legal organizations actively using generative AI rose from 14% in 2024 to 26% in 2025, and 78% of legal professionals expect it to become central to their work within five years.¹ The demand is coming from clients too, with 59% of corporate legal clients wanting their outside firms to use the technology.

Usage is becoming routine, well past the experimental stage. Nearly 70% of law-firm respondents who use generative AI report using it at least weekly, with the top reported tasks being document review at 77%, legal research at 74%, and document summarization at 74%.¹ Sentiment has flipped with it: in 2024 hesitancy led at 35%, while in 2025 the leading feelings were hopefulness at 28% and excitement at 27%.

The gap that matters for any build is readiness, and enthusiasm has run well ahead of it. Only 41% of organizations have established a governance policy, only 40% provide staff training, and only 20% measure ROI, while 71% of corporate clients do not even know whether their outside counsel uses the technology.¹ That readiness gap, more than raw adoption, is what a careful project has to close. This page supports the broader AI agents guide; here the focus is the legal angle and its specific duties of care.

Legal generative AI adoption and readiness

Active adoption nearly doubled in a year, but governance, training, and ROI measurement lag well behind. The readiness gap is the real work.

Data behind this chart
Metric (legal organizations)	Share
Actively using generative AI (2024)	14%
Actively using generative AI (2025)	26%
Have a generative AI governance policy	41%
Provide staff training	40%
Measure ROI	20%

Source: Thomson Reuters, 2025 Generative AI in Professional Services Report, via LawNext coverage (2025). Figures are reported survey results.

Use cases that actually fit legal work

The use cases that fit are agentic workflows where the agent plans, retrieves, drafts, and cites, then routes the result to a lawyer: contract review and redlining, first-draft drafting, legal research, due diligence, e-discovery, matter intake and triage, and compliance monitoring. The pattern that matters is plan, retrieve, draft, cite, and route for human review, which is a step beyond a one-shot chatbot prompt.

The recurring patterns map cleanly to where firms already spend time. The table below lays them out; each is delivered through our AI agent development work, grounded for the legal sector in our legal engineering practice. Read every row with the same caveat, that the agent proposes and a lawyer disposes.

AI agent use cases in legal work

Seven agentic patterns, framed as plan, retrieve, draft, cite, then route to a lawyer. None of them ends with the agent; each ends with attorney review.

AI agent use cases by legal function
Function	What the agent does	Human gate
Contract review and redlining	Reads contracts, flags missing or risky clauses, and compares against a firm playbook.	Lawyer accepts or rejects each redline.
Drafting	Generates first-draft memos, briefs, clauses, and routine correspondence in the firm’s templates.	Attorney verifies every assertion and citation.
Legal research	Locates case law, statutes, and secondary authority with linked citations.	Counsel confirms each authority exists and applies.
Due diligence	Reads data-room documents to surface liabilities, change-of-control, and assignment clauses.	Deal team reviews flagged items.
E-discovery	Scans large document sets for relevance and privilege and builds chronologies.	Review attorney validates privilege calls.
Intake and triage	Classifies matters, routes to the right team, and summarizes client submissions.	Lawyer takes the matter before advice is given.

Source: Thomson Reuters top use cases for legal (2025) for the prevalence of review, research, and summarization; remaining patterns are representative delivery patterns. Productivity claims are not independently audited.

How to build a defensible legal AI agent

You build a defensible legal AI agent by connecting it to the firm’s document systems and trusted legal databases, grounding it with retrieval-augmented generation over primary law and the firm’s own precedent, requiring an auditable citation on every assertion, and wrapping it in privilege filters, citation checks, audit logs, and a mandatory attorney sign-off. It is an orchestrated, retrieval-grounded, human-in-the-loop system, a world away from a chatbot pointed at the open web.

Work through it as a layered architecture.

System integrations. Connect to the document management system, contract repositories or contract lifecycle management, matter-management systems, and trusted legal databases, so the agent works from vetted institutional sources instead of the open internet.
RAG over authoritative corpora. Ground generation in primary law, meaning case law, statutes, and regulations, plus the firm’s own precedent, templates, and playbooks. Retrieval narrows the model to vetted sources instead of its parametric memory, which is the main lever for reducing hallucination.
Citations and grounding. Every material assertion carries an auditable citation back to the source document or authority. Grounding the output to retrieved text rather than the model’s recall is what makes a draft checkable, and checkability is the whole point in a filing.
Confidentiality and privilege filters. Apply privilege and confidentiality controls at the data boundary so client material is not exposed to an uncontrolled model. Under ABA Model Rule 1.6, putting client information into a self-learning tool can be improper disclosure, so data handling is a design constraint from the outset.⁵
Guardrails and human-in-the-loop. Add confidence thresholds, citation-existence and quote-verification checks, scope limits on autonomous actions, and full audit logs, ending in a mandatory attorney review-and-sign-off gate before anything is filed or sent. The agent proposes; a lawyer accepts, rejects, or refines.

The framing that keeps this defensible: the agent is built for verification, confidentiality, and supervision from day one, so a licensed attorney can stand behind the output. That is the work our AI agent development team does, grounded in the sector context our legal engineering practice brings.

How accurate is AI for legal research, and the sanctions risk

The defining risk of AI for legal research is hallucinated citations, and it is documented and sanctionable even in purpose-built legal tools. A peer-reviewed Stanford RegLab study found leading commercial legal research tools hallucinated roughly 17% to 34% of the time, and courts have sanctioned attorneys for filing AI-generated cases that do not exist, starting with Mata v. Avianca in 2023.³⁴ A tool being built for law does not make its output safe to file unverified.

Lead with the evidence before any reassurance. In Mata v. Avianca, Inc. (S.D.N.Y., 2023), plaintiff’s attorneys submitted a brief citing six cases that did not exist, generated by ChatGPT, and the court imposed a $5,000 sanction, stressing that the deeper failure was not catching the error and then standing behind the fake cases.⁴ The problem then scaled instead of fading: a research database tracking AI-hallucinated content in court filings logged hundreds of cases worldwide through late 2025, the majority decided that year, with sanctions commonly running from about $1,500 to $6,000 and escalating to mandatory training, bar referrals, and disqualification.²⁷

The Stanford finding is the part that should change how teams scope a build. The study, published in the Journal of Empirical Legal Studies, tested leading commercial tools and found Lexis+ AI answered about 65% of queries accurately while hallucinating more than 17%, and Westlaw AI-Assisted Research was accurate about 42% of the time while hallucinating more than 34%.³ The table below is the cautionary centerpiece of this page.

Sanctioned and audited: the accuracy reality

Documented court sanctions for AI-fabricated citations, plus the Stanford RegLab measurement that purpose-built legal tools still hallucinate. This is why every citation gets verified.

Documented AI-citation failures and a measured hallucination rate in legal tools
Case or study	What happened	Outcome
Mata v. Avianca, Inc. (S.D.N.Y., 2023)	Brief cited six cases that did not exist, generated by ChatGPT.	$5,000 sanction on the attorneys.
Mid Cent. Operating Eng’rs Fund v. Hoosiervac LLC (S.D. Ind., 2025)	Citations to non-existent cases in filings.	$6,000 fine.
Tercero v. Sacramento Logistics, LLC (E.D. Cal., 2025)	AI-fabricated citations.	$1,500 fine plus State Bar service.
Johnson v. Dunn (N.D. Ala., 2025)	Counsel relied on unverified AI output.	Disqualification and state bar referral.
Stanford RegLab study (2024 to 2025)	Audited leading commercial legal research tools across 202 queries.	Roughly 17% to 34% hallucination rate.

Sources: Mata v. Avianca court opinion (2023); Sterne Kessler 2025 sanctions year-in-review; Stanford RegLab and HAI, Hallucination-Free study (2024 to 2025). Hallucination rates are measured study results.

Why attorney review is non-delegable

Attorney review is non-delegable because ABA Formal Opinion 512 (2024) places the duties of competence, candor, confidentiality, and supervision on the lawyer, and lawyers cannot blindly rely on AI output. An agent can draft and cite, but only a licensed attorney can take professional responsibility for a filing, so every substantive AI-generated element must be reviewed and verified by a responsible lawyer before it is used.

The duties are specific. Under ABA Model Rule 1.1 as applied by Formal Opinion 512, a lawyer must understand a tool’s capabilities and failure modes, including hallucination and knowledge-cutoff limits.⁶ Rule 3.3 on candor to the tribunal means unverified AI citations cannot be filed; Rules 5.1 and 5.3 treat the tool as a non-lawyer assistant the firm must supervise; Rule 1.6 governs confidentiality, and Rule 1.5 means a client should not be billed for time the AI saved as if it had been done by hand.⁵

Put the three threads together and the conclusion is plain. Purpose-built tools still hallucinate, courts are actively sanctioning attorneys for unverified output, and the ABA rules make verification a duty the lawyer cannot hand to software. The honest position, and the one a defensible build is engineered around, is that an AI agent augments lawyers and never replaces attorney review and judgment. The agent does the repetitive drafting, research, and review; the lawyer supervises, verifies, and decides.

Frequently asked

AI agent for legal questions

What is an AI agent for legal work?

An AI agent for legal is software that executes multi-step legal workflows, researching, reviewing, drafting, and citing, by retrieving from a firm’s documents and primary law, then routing a fully cited draft to a lawyer for review. Unlike a basic chatbot, it plans and uses tools such as document systems, contract repositories, and case-law databases. It supports attorneys and preserves their judgment, and a licensed lawyer still signs off on every output.

Are AI agents safe for law firms to use?

They can be, with guardrails, but the risks are real and documented. Even purpose-built tools hallucinate roughly 17 to 34 percent of the time per a Stanford RegLab study, and courts have sanctioned lawyers for filing fabricated AI citations, as in Mata v. Avianca in 2023. Safe use requires retrieval grounding in vetted sources, citation verification, confidentiality-aware data handling, and mandatory attorney review under ABA Model Rules 1.1, 1.6, and 3.3.

Do AI agents replace lawyers?

No. ABA Formal Opinion 512 from 2024 makes verification a non-delegable lawyer duty, and only a licensed attorney can take professional responsibility for a filing. Agents handle the repetitive drafting, research, and review, while lawyers supervise, verify, and decide. The agent proposes and a lawyer disposes, which is also why these systems are built with a mandatory human sign-off gate.

How accurate are legal AI tools?

Accuracy varies and is well below 100 percent. A peer-reviewed Stanford RegLab study found leading legal research tools answered roughly 42 to 65 percent of queries accurately while still hallucinating about 17 to 34 percent of the time. That is why every AI-produced citation and assertion must be independently verified by a responsible attorney before it is used or filed.

How do you build an AI agent for a law firm?

Connect it to the firm’s document systems, contract repositories, and trusted legal databases, ground it with retrieval-augmented generation over case law and the firm’s own precedent, and require an auditable citation on every claim. Then wrap it in guardrails such as privilege filters, citation-existence checks, audit logs, and a human-in-the-loop sign-off. Build for verification, confidentiality, and supervision from day one instead of bolting them on later.

Kanika Mathur

Head of Service Delivery, Resourcifi

Kanika Mathur is Head of Service Delivery at Resourcifi, where her engineering pods build retrieval-grounded agents wired to a firm’s document systems and primary law, with citation verification and privilege-aware data handling. She has scoped the review gates and audit logging that keep a licensed attorney in the loop on every output, which is the lens this guide is written from. It never claims software can practice law.

Resourcifi on LinkedIn →