AI for legal research: where it helps, where it gets lawyers sanctioned
AI for legal research can locate case law, statutes, and secondary authority with linked citations and draft a first pass in minutes, but even purpose-built legal tools still hallucinate, and courts have fined attorneys for filing cases that do not exist. This guide covers the real use cases, the architecture of a defensible legal agent, the documented accuracy risks, and why attorney review is a duty you cannot hand to software.

The short version
- Adoption is real and accelerating. Per Thomson Reuters, the share of legal organizations actively using generative AI rose from 14% in 2024 to 26% in 2025, and 78% of legal professionals expect it to become central to their work within five years.
- The accuracy problem is documented rather than hypothetical. A peer-reviewed Stanford RegLab study found that leading commercial legal research tools still hallucinate roughly 17% to 34% of the time, so a tool being purpose-built for law does not make it safe.
- Courts are sanctioning lawyers for it. In Mata v. Avianca (2023) attorneys were fined $5,000 for a brief citing six cases that did not exist, and the problem scaled into a 2025 wave of sanctions across federal courts.
- Verification is a lawyer’s duty rather than the software’s job. ABA Formal Opinion 512 (2024) places non-delegable verification, candor, confidentiality, and supervision duties on the attorney; an agent can draft and cite, only a licensed lawyer can take responsibility for a filing.
- A defensible build is retrieval-grounded and human-in-the-loop: connected to the firm’s documents and primary law, citing every assertion to a source, filtering privileged data at the boundary, and routing a fully cited draft to an attorney for sign-off.
Where legal AI adoption stands
Legal adoption of generative AI is accelerating fast: per the Thomson Reuters 2025 Generative AI in Professional Services Report, the share of legal organizations actively using generative AI rose from 14% in 2024 to 26% in 2025, and 78% of legal professionals expect it to become central to their work within five years.1 The demand is coming from clients too, with 59% of corporate legal clients wanting their outside firms to use the technology.
Usage is becoming routine, well past the experimental stage. Nearly 70% of law-firm respondents who use generative AI report using it at least weekly, with the top reported tasks being document review at 77%, legal research at 74%, and document summarization at 74%.1 Sentiment has flipped with it: in 2024 hesitancy led at 35%, while in 2025 the leading feelings were hopefulness at 28% and excitement at 27%.
The gap that matters for any build is readiness, and enthusiasm has run well ahead of it. Only 41% of organizations have established a governance policy, only 40% provide staff training, and only 20% measure ROI, while 71% of corporate clients do not even know whether their outside counsel uses the technology.1 That readiness gap, more than raw adoption, is what a careful project has to close. This page supports the broader AI agents guide; here the focus is the legal angle and its specific duties of care.
| Metric (legal organizations) | Share |
|---|---|
| Actively using generative AI (2024) | 14% |
| Actively using generative AI (2025) | 26% |
| Have a generative AI governance policy | 41% |
| Provide staff training | 40% |
| Measure ROI | 20% |
Use cases that actually fit legal work
The use cases that fit are agentic workflows where the agent plans, retrieves, drafts, and cites, then routes the result to a lawyer: contract review and redlining, first-draft drafting, legal research, due diligence, e-discovery, matter intake and triage, and compliance monitoring. The pattern that matters is plan, retrieve, draft, cite, and route for human review, which is a step beyond a one-shot chatbot prompt.
The recurring patterns map cleanly to where firms already spend time. The table below lays them out; each is delivered through our AI agent development work, grounded for the legal sector in our legal engineering practice. Read every row with the same caveat, that the agent proposes and a lawyer disposes.
| Function | What the agent does | Human gate |
|---|---|---|
| Contract review and redlining | Reads contracts, flags missing or risky clauses, and compares against a firm playbook. | Lawyer accepts or rejects each redline. |
| Drafting | Generates first-draft memos, briefs, clauses, and routine correspondence in the firm’s templates. | Attorney verifies every assertion and citation. |
| Legal research | Locates case law, statutes, and secondary authority with linked citations. | Counsel confirms each authority exists and applies. |
| Due diligence | Reads data-room documents to surface liabilities, change-of-control, and assignment clauses. | Deal team reviews flagged items. |
| E-discovery | Scans large document sets for relevance and privilege and builds chronologies. | Review attorney validates privilege calls. |
| Intake and triage | Classifies matters, routes to the right team, and summarizes client submissions. | Lawyer takes the matter before advice is given. |
How to build a defensible legal AI agent
You build a defensible legal AI agent by connecting it to the firm’s document systems and trusted legal databases, grounding it with retrieval-augmented generation over primary law and the firm’s own precedent, requiring an auditable citation on every assertion, and wrapping it in privilege filters, citation checks, audit logs, and a mandatory attorney sign-off. It is an orchestrated, retrieval-grounded, human-in-the-loop system, a world away from a chatbot pointed at the open web.
Work through it as a layered architecture.
- System integrations. Connect to the document management system, contract repositories or contract lifecycle management, matter-management systems, and trusted legal databases, so the agent works from vetted institutional sources instead of the open internet.
- RAG over authoritative corpora. Ground generation in primary law, meaning case law, statutes, and regulations, plus the firm’s own precedent, templates, and playbooks. Retrieval narrows the model to vetted sources instead of its parametric memory, which is the main lever for reducing hallucination.
- Citations and grounding. Every material assertion carries an auditable citation back to the source document or authority. Grounding the output to retrieved text rather than the model’s recall is what makes a draft checkable, and checkability is the whole point in a filing.
- Confidentiality and privilege filters. Apply privilege and confidentiality controls at the data boundary so client material is not exposed to an uncontrolled model. Under ABA Model Rule 1.6, putting client information into a self-learning tool can be improper disclosure, so data handling is a design constraint from the outset.5
- Guardrails and human-in-the-loop. Add confidence thresholds, citation-existence and quote-verification checks, scope limits on autonomous actions, and full audit logs, ending in a mandatory attorney review-and-sign-off gate before anything is filed or sent. The agent proposes; a lawyer accepts, rejects, or refines.
The framing that keeps this defensible: the agent is built for verification, confidentiality, and supervision from day one, so a licensed attorney can stand behind the output. That is the work our AI agent development team does, grounded in the sector context our legal engineering practice brings.
How accurate is AI for legal research, and the sanctions risk
The defining risk of AI for legal research is hallucinated citations, and it is documented and sanctionable even in purpose-built legal tools. A peer-reviewed Stanford RegLab study found leading commercial legal research tools hallucinated roughly 17% to 34% of the time, and courts have sanctioned attorneys for filing AI-generated cases that do not exist, starting with Mata v. Avianca in 2023.34 A tool being built for law does not make its output safe to file unverified.
Lead with the evidence before any reassurance. In Mata v. Avianca, Inc. (S.D.N.Y., 2023), plaintiff’s attorneys submitted a brief citing six cases that did not exist, generated by ChatGPT, and the court imposed a $5,000 sanction, stressing that the deeper failure was not catching the error and then standing behind the fake cases.4 The problem then scaled instead of fading: a research database tracking AI-hallucinated content in court filings logged hundreds of cases worldwide through late 2025, the majority decided that year, with sanctions commonly running from about $1,500 to $6,000 and escalating to mandatory training, bar referrals, and disqualification.27
The Stanford finding is the part that should change how teams scope a build. The study, published in the Journal of Empirical Legal Studies, tested leading commercial tools and found Lexis+ AI answered about 65% of queries accurately while hallucinating more than 17%, and Westlaw AI-Assisted Research was accurate about 42% of the time while hallucinating more than 34%.3 The table below is the cautionary centerpiece of this page.
| Case or study | What happened | Outcome |
|---|---|---|
| Mata v. Avianca, Inc. (S.D.N.Y., 2023) | Brief cited six cases that did not exist, generated by ChatGPT. | $5,000 sanction on the attorneys. |
| Mid Cent. Operating Eng’rs Fund v. Hoosiervac LLC (S.D. Ind., 2025) | Citations to non-existent cases in filings. | $6,000 fine. |
| Tercero v. Sacramento Logistics, LLC (E.D. Cal., 2025) | AI-fabricated citations. | $1,500 fine plus State Bar service. |
| Johnson v. Dunn (N.D. Ala., 2025) | Counsel relied on unverified AI output. | Disqualification and state bar referral. |
| Stanford RegLab study (2024 to 2025) | Audited leading commercial legal research tools across 202 queries. | Roughly 17% to 34% hallucination rate. |
Why attorney review is non-delegable
Attorney review is non-delegable because ABA Formal Opinion 512 (2024) places the duties of competence, candor, confidentiality, and supervision on the lawyer, and lawyers cannot blindly rely on AI output. An agent can draft and cite, but only a licensed attorney can take professional responsibility for a filing, so every substantive AI-generated element must be reviewed and verified by a responsible lawyer before it is used.
The duties are specific. Under ABA Model Rule 1.1 as applied by Formal Opinion 512, a lawyer must understand a tool’s capabilities and failure modes, including hallucination and knowledge-cutoff limits.6 Rule 3.3 on candor to the tribunal means unverified AI citations cannot be filed; Rules 5.1 and 5.3 treat the tool as a non-lawyer assistant the firm must supervise; Rule 1.6 governs confidentiality, and Rule 1.5 means a client should not be billed for time the AI saved as if it had been done by hand.5
Put the three threads together and the conclusion is plain. Purpose-built tools still hallucinate, courts are actively sanctioning attorneys for unverified output, and the ABA rules make verification a duty the lawyer cannot hand to software. The honest position, and the one a defensible build is engineered around, is that an AI agent augments lawyers and never replaces attorney review and judgment. The agent does the repetitive drafting, research, and review; the lawyer supervises, verifies, and decides.
AI agent for legal questions
What is an AI agent for legal work?
Are AI agents safe for law firms to use?
Do AI agents replace lawyers?
How accurate are legal AI tools?
How do you build an AI agent for a law firm?
Sources
- LawNext (coverage of Thomson Reuters 2025 Generative AI in Professional Services Report), Over 95% of legal professionals expect gen AI to become central within five years (2025).
- Thomson Reuters Institute, GenAI hallucinations in court filings remain pervasive (2025).
- Stanford RegLab and HAI, Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools (2024 to 2025).
- Mata v. Avianca, Inc., sanctions opinion, S.D.N.Y. (2023).
- NCBE Bar Examiner, Generative AI tools and the ABA Formal Opinion 512 duties (2024).
- American Bar Association, ABA issues first ethics guidance on lawyers’ use of AI (Formal Opinion 512) (2024).
- Sterne Kessler, AI hallucinations in court filings: a 2025 review of sanctions (2025).
Agents & RAG
Agentic RAG: When to Use It and How to Build It
Agentic RAG explained: how it differs from naive and advanced RAG, the key patterns like corrective RAG and self-RAG, the...
Read guide →
Agents & RAG
AI Agent for Fintech: Risk, Compliance, Ops, Customer
AI agents in finance: fraud, AML, KYC and servicing use cases, how to build with money-movement guardrails and human appr...
Read guide →
Agents & RAG
AI Agent for Healthcare: Use Cases, Governance & Implementation
AI agents in healthcare: the use cases that pay off first, how to build one HIPAA-safe on FHIR with clinician review, and...
Read guide →
Agents & RAG
AI Agent for HR: Recruiting, Onboarding, People Ops
AI agents for HR: screening, employee Q and A and onboarding use cases, how to build them, and the bias, EEOC and Local L...
Read guide →
Agents & RAG
AI Agent for SaaS: How to Embed Autonomous Agents in Your Product
AI agents' disruptive impact on the SaaS industry in 2025: Gartner sees agentic AI at 30% of app-software revenue by 2035...
Read guide →
Agents & RAG
AI Agent for Sales: Pipeline & Outreach Automation
AI agents for sales: lead qualification, outreach and CRM use cases, how to build with guardrails on autonomous outreach...
Read guide →
Strategy, architecture & ops
AI Architecture Patterns
Agentic design patterns explained: reflection, tool use, planning, and multi-agent collaboration, with a framework to pic...
Read guide →
Strategy, architecture & ops
AI Architecture Patterns for SaaS: A Technical Guide
Generative AI architecture for SaaS: layered design, multi-tenant isolation, LLM gateway, RAG, and security. Built by Res...
Read guide →
Building AI
AI Copilots for SaaS: Build vs Buy Guide
AI copilot vs AI agent for SaaS: a copilot assists, an agent acts. How an in-app copilot works, the RAG and multi-tenant...
Read guide →
