Legal Reasoning Still a Struggle for LLMs

The authors in this paper created a benchmark including long-form, open-ended questions and multiple-choice questions to evaluate the performance of a number of different LLMs with respect to legal reasoning. Legal reasoning requires the application of deductive and inductive logic to complex scenarios, often with undefined parameters. Their results show that these models still “struggle with open questions that require structured, multi-step legal reasoning.”

Legal reasoning is a critical frontier for large language models (LLMs) specifically and artificial intelligence (AI) at large, requiring specialized domain knowledge and advanced reasoning abilities such as precedent interpretation, statutory analysis, and legal inference. Despite progress in general reasoning, legal reasoning remains difficult and under-assessed in NLP research. Moreover, the legal domain is inherently high-stakes and a failure to thoroughly examine the capabilities and limitations of models could lead to serious real-world consequences …

Our analysis reveals substantial variability and limitations in LLM capabilities for addressing MCQs and especially on complex open questions; notably, increasing the number of MCQ options consistently reduces model accuracy. Our evaluation framework offers a scalable approach for assessing legal reasoning quality beyond simple accuracy metrics, thereby facilitating
future research aimed at enhancing the reliability and robustness of LLMs on challenging legal tasks.
View referenced article

Law regulation concept. Businessman or lawyer holding balance of justice to legal study on computer application.

akim@foley.com

Boston 617.502.3295

Related Insights

August 7, 2025 Foley Viewpoints

USPTO Addresses Reports of New Patent Fee Structure

In a recent webinar hosted by the Licensing Executives Society, U.S. Patent and Trademark Office acting Director Coke Morgan Stewart…

August 6, 2025 Foley Viewpoints

Fiscal Year 2026 Changes to the NFIP WYO Financial Assistance/Subsidy Arrangement Costs, Expenses, Reimbursement, and Refund Provisions

Pursuant to 42 U.S.C. § 4081(a),[1] the Federal Emergency Management Agency (FEMA) is authorized to engage private insurers to sell…

August 5, 2025 Foley Viewpoints

Virginia Bureau of Insurance Issues Updated Guidance Regarding Minimum Capital and Surplus, Seasoning, and Procedures for Companies Seeking to be Licensed as Insurers

On August 6, the Virginia Bureau of Insurance (BOI) issued Administrative Letter 2025-03 (the Letter)[1] to all companies seeking to be…

Legal Reasoning Still a Struggle for LLMs

Author(s)

Austin J. Kim

Related Insights

USPTO Addresses Reports of New Patent Fee Structure

Fiscal Year 2026 Changes to the NFIP WYO Financial Assistance/Subsidy Arrangement Costs, Expenses, Reimbursement, and Refund Provisions

Virginia Bureau of Insurance Issues Updated Guidance Regarding Minimum Capital and Surplus, Seasoning, and Procedures for Companies Seeking to be Licensed as Insurers