Close Menu
Fin Street NewsFin Street News
  • Home
  • Business
  • Finance
    • Banking
    • Stocks
    • Commodities & Futures
    • ETFs & Mutual Funds
    • Funds
    • Currencies
    • Crypto
  • Markets
  • Investing
  • Personal Finance
    • Loans
    • Credit Cards
    • Dept Management
    • Retirement
    • Mortgages
    • Saving
    • Taxes
  • Fintech
  • More Articles

Subscribe to Updates

Get the latest finance and business news and updates directly to your inbox.

Trending
The biggest winners and losers from US restrictions on Anthropic’s AI

The biggest winners and losers from US restrictions on Anthropic’s AI

June 17, 2026
Why traditional advertising is dead, according to Mastercard Senior Fellow Raja Rajamannar

Why traditional advertising is dead, according to Mastercard Senior Fellow Raja Rajamannar

June 17, 2026
75 Top Companies With Remote Jobs This Summer

75 Top Companies With Remote Jobs This Summer

June 17, 2026
The ,000 Line: This New Plan Protects Lower Earners But Caps COLA for Everyone Above It

The $45,000 Line: This New Plan Protects Lower Earners But Caps COLA for Everyone Above It

June 17, 2026
Ukraine says even its obsolete drone-war tech still has value for friendly countries bracing for Shahed-style attacks

Ukraine says even its obsolete drone-war tech still has value for friendly countries bracing for Shahed-style attacks

June 17, 2026
Facebook X (Twitter) Instagram
  • Privacy Policy
  • Terms of use
  • Press Release
  • Advertise
  • Contact
June 17, 2026 9:33 am EDT
|
Facebook X (Twitter) Instagram
  Market Data
Fin Street NewsFin Street News
Newsletter Login
  • Home
  • Business
  • Finance
    • Banking
    • Stocks
    • Commodities & Futures
    • ETFs & Mutual Funds
    • Funds
    • Currencies
    • Crypto
  • Markets
  • Investing
  • Personal Finance
    • Loans
    • Credit Cards
    • Dept Management
    • Retirement
    • Mortgages
    • Saving
    • Taxes
  • Fintech
  • More Articles
Fin Street NewsFin Street News
Home » One of legal’s hottest startups is helping lawyers finally answer: Is the AI’s work any good?
One of legal’s hottest startups is helping lawyers finally answer: Is the AI’s work any good?
Finance

One of legal’s hottest startups is helping lawyers finally answer: Is the AI’s work any good?

News RoomBy News RoomJune 17, 20262 ViewsNo Comments

Legal technology wants its vibe-coding moment. But first, it has to prove the tools can think like a lawyer.

Taking up the task is Crosby, a startup-meets-law-firm that sells basic legal services to companies, including Cursor and Rogo. On Wednesday, it released the Redline Bench, a tool built to measure how well artificial intelligence models perform real-world legal tasks, starting with contract review.

Software engineers have spent the past few years watching these systems get shockingly good at writing code and debugging errors. Now legal tech companies are chasing a similar prize: artificial intelligence that can review contracts, spot risks, and haggle terms faster and cheaper than lawyers.

But law has a problem that coding does not, says Ryan Daniels, a former in-house lawyer turned Crosby founder. “It’s really hard to define ‘good’ or ‘bad,'” he said.

Models can write code that either runs or breaks. Legal work is a murkier target. A sales contract can be edited, or “redlined,” in lots of defensible ways, Daniels explains. A change that one lawyer sees as prudent, another might call too aggressive.

That ambiguity has become a headache for companies racing to automate legal work, from the scrappy neofirms to the model labs themselves. Anthropic has spent the past few months courting in-house lawyers with tools built for them. That push has been closely watched by investors. Earlier this year, Anthropic’s new legal plugin stirred a sell-off in legal tech stocks.

Benchmarks are one of the main ways companies track progress. The labs building frontier models use them as stress tests, measuring whether a new system is better at tasks than the last one.

Coding has hundreds of benchmarks for evaluating models. But the legal industry still lacks a shared way to answer the question: Is the AI’s work any good?

Crosby has been working on a new yardstick. The company pulled its engineers and lawyers into a tactical unit called Crosby Intelligence to build agents for Crosby’s law firm and a benchmark to grade them against. That team includes engineer Sharan Ramjee, who worked on transformer models to sniff out fraud at Stripe, and Ross Weiser, a lawyer who joined from elite law firm Sullivan & Cromwell.

Crosby also partnered with Micro1, a company that helps model-makers recruit expert workers, to find more lawyers who could help define what counts as good legal work.

To build the benchmark, senior lawyers simulated software deals and marked the contract changes they considered most important at each stage of the negotiation. Those changes were turned into weighted criteria.

When Crosby runs a new test, it gives models the same contracts and asks them to make their own edits. Then a panel of three judges compares these redlines with the lawyer-built rubric. The judges vote pass or fail on each item, and the final score shows how often the models made the kinds of edits that lawyers considered important.

Redline Bench will be made public so any lab can put its models through Crosby’s paces. Crosby also plans to regularly release reports tracking how major models compare.

The first release of the Redline Bench put ChatGPT 5.5 at the top of the heap, with a score of 50.5%, meaning the model’s redlines matched half of the edits that lawyers prioritized. Gemini 3.5 Flash followed at 45.1%, and Claude Opus 4.8 scored 44.4%.

Crosby was able to test Anthropic’s highly capable new model, Fable 5, only once before Anthropic pulled it off the shelves. The results were promising, with a score of 47.3%. When access is restored, Crosby will run the benchmark again and update it.

Crosby isn’t the only company trying to measure how the models stack up. Harvey, one of the best-funded legal startups, has released benchmarks for case law research and contract review.

Anthropic and OpenAI also build their own benchmarks to measure performance on real-world tasks. But Daniels said those results can be hard to trust. Over time, the labs eventually tune their systems to perform well on their own tests, he said.

The stakes are bigger than a scoreboard. Billions of investment dollars are riding on the promise that artificial intelligence can lower legal bills and absorb work that used to pile up on the general counsel’s desk.

Lawyers will only use the tools if they trust them. Crosby wants to give them a reason to.



Read the full article here

AIs answer finally good helping hottest lawyers legals startups work
Share. Facebook Twitter LinkedIn Telegram WhatsApp Email

Keep Reading

The biggest winners and losers from US restrictions on Anthropic’s AI

The biggest winners and losers from US restrictions on Anthropic’s AI

Ukraine says even its obsolete drone-war tech still has value for friendly countries bracing for Shahed-style attacks

Ukraine says even its obsolete drone-war tech still has value for friendly countries bracing for Shahed-style attacks

Panera’s CEO regrets a cost-cutting move he approved as CFO

Panera’s CEO regrets a cost-cutting move he approved as CFO

I spent ,500 to watch the ‘Summer House’ reunion at the show’s Hamptons house. It felt like reliving my youth.

I spent $3,500 to watch the ‘Summer House’ reunion at the show’s Hamptons house. It felt like reliving my youth.

Leaked audio: Disney product chief lays out what’s part of its ‘super app’ plans — and what isn’t

Leaked audio: Disney product chief lays out what’s part of its ‘super app’ plans — and what isn’t

The 20 most peaceful countries in the world, ranked

The 20 most peaceful countries in the world, ranked

Pizza Hut is getting a new owner: private equity firm LongRange buys chain in .5 billion deal

Pizza Hut is getting a new owner: private equity firm LongRange buys chain in $1.5 billion deal

Microsoft walked away from a  billion deal to lease Oracle cloud capacity over security concerns

Microsoft walked away from a $3 billion deal to lease Oracle cloud capacity over security concerns

An Iran peace deal won’t lower airfares anytime soon, analysts say

An Iran peace deal won’t lower airfares anytime soon, analysts say

Add A Comment
Leave A Reply Cancel Reply

Editors Picks

Why traditional advertising is dead, according to Mastercard Senior Fellow Raja Rajamannar

Why traditional advertising is dead, according to Mastercard Senior Fellow Raja Rajamannar

June 17, 2026
75 Top Companies With Remote Jobs This Summer

75 Top Companies With Remote Jobs This Summer

June 17, 2026
The ,000 Line: This New Plan Protects Lower Earners But Caps COLA for Everyone Above It

The $45,000 Line: This New Plan Protects Lower Earners But Caps COLA for Everyone Above It

June 17, 2026
Ukraine says even its obsolete drone-war tech still has value for friendly countries bracing for Shahed-style attacks

Ukraine says even its obsolete drone-war tech still has value for friendly countries bracing for Shahed-style attacks

June 17, 2026
Why a neuroscientist worries outsourcing thinking to AI could weaken your brain’s defenses against dementia

Why a neuroscientist worries outsourcing thinking to AI could weaken your brain’s defenses against dementia

June 17, 2026

Latest News

Panera’s CEO regrets a cost-cutting move he approved as CFO

Panera’s CEO regrets a cost-cutting move he approved as CFO

June 17, 2026
Bose is becoming a media company

Bose is becoming a media company

June 17, 2026
One of legal’s hottest startups is helping lawyers finally answer: Is the AI’s work any good?

One of legal’s hottest startups is helping lawyers finally answer: Is the AI’s work any good?

June 17, 2026

Subscribe to News

Get the latest finance and business news and updates directly to your inbox.

Advertisement
Demo
Facebook X (Twitter) Pinterest TikTok Instagram
2026 © Prices.com LLC. All Rights Reserved.
  • Privacy Policy
  • Terms
  • For Advertisers
  • Contact

Type above and press Enter to search. Press Esc to cancel.