Paddle OCR with Lightweight Models

H2O.ai Launches New Multimodal Foundation Models to Undertake Document AI Use Cases

H2OVL Mississippi 0.8B Model Surpasses Leading Small Vision Language Models (SVLMs) and Impressively Outperforms Larger State-of-the-Art Vision Language Models (VLMs) in OCR Benchmarks for Text ...

Gizmochina

Baidu releases PP-OCRv5, a compact AI model that beats large rivals in OCR tests

Baidu just dropped something pretty interesting in the AI scene. After their recent launch of Ernie X1.1 deep thinking model, they’ve now released PP-OCRv5, a new optical character recognition model ...

Geeky Gadgets

Top AI Vision-Language Models : What You Need to Know

Imagine a world where your devices not only see but truly understand what they’re looking at—whether it’s reading a document, tracking where someone’s gaze lands, or answering questions about a video.

scmp.com

DeepSeek taps Alibaba open-source AI technology to boost OCR performance

Chinese artificial intelligence start-up DeepSeek on Tuesday unveiled an upgraded version of its optical character recognition (OCR) model, incorporating an Alibaba Cloud-developed open-source system ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results