Parsing Accuracy
Search documents
Deep Dive into TableRecordMatch: A New Metric for Evaluating Parsing Accuracy on Complex Tables
LlamaIndex· 2026-04-15 13:58
If a partial swaps two column headers in a financial table, every single row gets misread. But the industry standard metric, TEDs, barely notices. That's a problem.Hi, I'm Preston from Llama Index. We just released Parsbench, the first document OCR benchmark designed specifically for AI agents. In Parsebench, we introduced GTRM metric that changes how we evaluate table extraction.TED's treeedit distance similarity has been the go-to table metric for years. It compares the HTML tree structure of a predicted ...