Conference on Data Science and Law

August 5-6, 2025


Welcome to the Conference on Data Science and Law! We aim to showcase the latest research applying computational methods to empirical legal studies, and convene researchers in this new field. The 3rd iteration of the conference will be held at Fordham Law School in New York, NY from August 5th-6th, 2025

Registration Link: here.

Key Dates

Date
Submission Deadline April 15th
Acceptance Notification May 1st
Conference Date August 5-6th

Call for Papers

Sponsored by Fordham University, ETH Zurich, and the University of Virginia.

Interest is growing among scholars from a variety of disciplines and across the globe in research at the intersection of data science and law. In recent years, the digitization of legal texts and developments in statistics and computer science have paved the way for new methodological approaches in this area. This constellation of new data and methods has created exciting opportunities for progress on old questions as well as the prospect of opening entirely new areas in empirical legal scholarship.

The 3rd Conference on Data Science and Law will convene researchers across disciplines who are interested in this burgeoning field. Scholars in law, social and behavioral sciences, digital humanities, computer science, machine learning, and data analytics are invited to present works-in-progress in all areas of empirical legal studies that involve big data sources or use data science techniques, including natural language processing, machine learning, algorithmic fairness, topic modeling, or network analysis. The purpose of the conference is to highlight the best and most innovative scholarship in data science and law and to help build an intellectual community in support of this new field. Works that are descriptive in nature are invited as well as research that focuses on causal inference.

The workshop will be held at Fordham Law School in New York City on August 5th – 6th, 2025. Researchers are invited to submit unpublished working papers for presentation at the conference by April 15, 2025. Papers will be selected through a peer review process. While full drafts are recommended, we will accept and review extended abstracts or slide decks.

Submissions for the conference have now closed. If you are interested in attending, please register at this link.

Please feel free to circulate this call to colleagues or graduate students who may be interested.
On behalf of the organizers,
Elliott Ash, ETH Zurich
Mike Livermore, University of Virginia
Aniket Kesari, Fordham University

Please see the tentative conference schedule below

CDSL 2025 – August 5

Time Room 4-04 Room 4-05
9:15 AM Welcome and Opening Remarks – Room 4-08
9:30 AM Adam Badawi (UC Berkeley)
Separating Fact and Opinion in Financial Disclosures
Discussant: Sabrina Arias

Weikun Dong (WashU)
AI Teaching Puffery
Discussant: Adam Badawi

Sabrina Arias (Lehigh)
Everything Old is New Again: Textual Recycling in UN Resolutions
Discussant: Weikun Dong
Jakob Merane (ETH Zurich)
Machine Learning Compliance Analysis for Email Regulation
Discussant: Janet Freilich

Emily Hua (Harvard)
How Close is Close Enough: Examining TransUnion’s Impact on Statutory Privacy Violations
Discussant: Jakob Merane

Janet Freilich (BU)
Using Legal Texts to Assess the Novelty of AI-Generated Drugs
Discussant: Emily Hua
11:30 AM Plenary – Room 4-08
Dan Milo (NYU)
The Costs of Housing Regulation: Evidence From Generative Regulatory Measurement
Discussant: Ryan Hubert
12:15 PM Lunch (on your own)
1:30 PM Kevin Cope (UVA)
Judicial Dimensions
Discussant: Josh Fischman

Jon Choi (WashU)
Large Language Models Are Unreliable Judges
Discussant: Kevin Cope

Joshua Fischman (UVA)
Ideology in the Circuit Courts: Estimates from Criminal Cases, 2008–23
Discussant: Jon Choi
Justin Simard (Michigan State)
Mapping the Canon: Case Selection in Law School Casebooks
Discussant: Mike Livermore

Eric Martinez (UChicago)
Traditional and Computational Canons
Discussant: Justin Simard

Mike Livermore (UVA)
Speaking With or Talking Past: Discursive Cooperation on the U.S. Supreme Court
Discussant: Eric Martinez
3:30 PM Break
4:00 PM Lightning Talks (12 talks – 5 min each):

1. Vivian Nastl (ETH Zurich), Extending Legal Databases with LLM Annotations: Opportunities and Challenges

2. Yu Fan (ETH Zurich), AI and Law School Exams

3. Ryan Leung (Northwestern University), Testing the Rule of Law: Authoritarian Institutional Shock in Hong Kong

4. Karolina Naranjo (University of Virginia), Arinbjörn Kolbeinsson (UVA, Regava), Benedikt Kolbeinsson (Imperial College London, Regava), Jonathan Kropko (UVA), Yangfeng Ji (UVA), and Thomas Hartvigsen (UVA), Extracting Patterns of Legal Citations over Time with Large Language Models: A Case Study of Colombia and Iceland

5. Matthew Dahl (Yale University), Bye-bye, Bluebook? Testing AI's Ability to Automate Legal Procedure

6. Rui Zuo (University of Texas at Austin), Algorithm as Manager: How Algorithmic Judge-Case Assignment Influences Court Performance

7. Yawri Carr (TU Munich), Explainable AI in Natural Language Processing: Enhancing Transparency in Legal Judgment Predictions with Shapely Values

8. Qin (Sky) Ma (Max Planck Institute for the Study of Crime, Security and Law), The Dual Impact of Generative AI on Judicial Efficiency: A Cross-Jurisdictional Analysis

9. Bao Kham (Cornell Tech), Measuring Legal Importance: From Case Citations to Linguistic Shifts

5:00 PM Cocktail Hour and Poster Session
7:00 PM Dinner. All & Sundry (312 W 58th St, New York, NY 10019)

CDSL 2025 – August 6

Time Room 4-04 Room 4-05
9:30 AM Zubin Jelveh (University of Maryland)
Perils and Pitfalls in the Use of Synthetic Control Methods to Study Public Safety Interventions
Discussant: Talia Gillis

David Abrams (UPenn)
Prose and Cons: Measuring Policing Disparities with Text Data
Discussant: Zubin Jelveh

Talia Gillis (Columbia)
Algorithmic UDAAP
Discussant: David Abrams

Aniket Kesari (Fordham University)
What Are Oral Arguments For?
Discussant: Dan Milo

Ryan Copus (UMKC) and Hannah Laqueur (UC Davis)
Error in the Loop: How Human Mistakes Can Improve Algorithmic Learning
Discussant: Daniel Chen

Daniel Chen (Harvard Radcliffe)
Large Language Models as Machini Moralis: Aligning AI with Social Preferences
Discussant: Ryan Copus
11:30 AM Plenary Discussion
12:15 PM Lunch (on your own)
1:30 PM Marshall Steinbaum (University of Utah)
The Balance of Power in Franchising
Discussant: Ben Chen

Ben Chen (HKU)
Official Discourses of Legality in China: A Computational Analysis of 30 Years of the People's Daily
Discussant: Marshall Steinbaum
David Schwartz (Northwestern)
SCALES: Systematic Content Analysis of Litigation Events
Discussant: Dominik Stammbach

Dominik Stammbach (Princeton)
An open-source and free Search Engine for Paragraph Retrieval from United States' Caselaw
Discussant: David Schwartz
2:50 PM Concluding Remarks