Workflow
Parsbench
icon
Search documents
Deep Dive into Semantic Formatting Score: A New Metric for Meaningful Document Formatting
LlamaIndex· 2026-04-28 03:05
A price tag shows $10 and then crossed out to $4. Your parser outputs both prices as plain texts next to each other. Now the agent thinks there are two valid prices.Which one should it be using. I'm Simon from Llama Index. Most document OCR benchmarks completely ignore text formatting.They strip it before evaluation treating it as cosmetic. But in parsbench, the first document OCR benchmark for AI agents, we introduced a semantic formatting score because formatting carries meaning. Striketh through marks de ...
Deep Dive into TableRecordMatch: A New Metric for Evaluating Parsing Accuracy on Complex Tables
LlamaIndex· 2026-04-15 13:58
If a partial swaps two column headers in a financial table, every single row gets misread. But the industry standard metric, TEDs, barely notices. That's a problem.Hi, I'm Preston from Llama Index. We just released Parsbench, the first document OCR benchmark designed specifically for AI agents. In Parsebench, we introduced GTRM metric that changes how we evaluate table extraction.TED's treeedit distance similarity has been the go-to table metric for years. It compares the HTML tree structure of a predicted ...