Baidu has developed a new OCR system named Unlimited OCR that can process dozens of document pages in a single pass, significantly surpassing previous systems that handled about ten pages at most, according to The Decoder.
The key innovation lies in a modified attention mechanism that maintains consistent memory usage regardless of the number of pages processed. This advancement allows the system to efficiently handle large volumes of documents without increased computational costs.
Currently, Baidu's Unlimited OCR holds the top position on the leading OCR benchmark, reflecting its superior performance. For Japanese markets, where document digitization and automation are critical in sectors like finance and legal services, this technology could enhance efficiency in processing large data sets.
