Universal OCR Text Recognition
Free high-precision universal OCR text recognition API interface, can submit images via local file, image link or Base64, accurately extract complete text, paragraph-by-paragraph text and coordinate information. Suitable for receipt recognition, document scanning, screenshot text extraction, document digitization and automated form entry scenarios. · Suitable for receipt screenshot recognition, form auto-entry, contract scanned document text extraction, image text annotation box selection, photo text extraction, document information recognition and structured data extraction scenarios · Universal OCR Text Recognition
Free OCR interface, Text recognition API, Universal OCR, Image to text, Extract text from image, Screenshot text extraction, OCR with coordinates, Receipt recognition interface, Document recognition API, Photo text extraction, image ocr api, ocr coordinate api
POST /image/ocr - Universal OCR Text Recognition
Whether you need to implement automated receipt entry, or highlight text coordinates on images in web frontend, this high-precision OCR endpoint can provide you with powerful basic capabilities.
Overview
[!IMPORTANT]
If you only care about what's written on the image (such as screenshot text extraction or content security review), it is strongly recommended to set needlocation to false. This will significantly reduce the size of returned JSON data, improving network transmission and system parsing efficiency.
In addition to regular image-to-text conversion, this endpoint has some practical designs for actual development scenarios:
- Frontend Text Highlighting and Structured Analysis: By default returns rectangular coordinates and four vertex coordinates for each paragraph of text. This is very suitable for using Canvas to draw boxes and highlight on the original image, or extracting key-value pair information from receipts based on relative positions on the backend.
- Anti-Distortion in Complex Shooting Environments: For rotation or tilt caused by mobile phone shooting, you can enable enablecls=true. The server will automatically perform direction pre-correction before recognition, significantly improving recognition accuracy.
- Flexible Input and Request Requirements: The endpoint supports three input methods: file, url or imagebase64. Please ensure the request format is multipart/form-data, and the image link is directly accessible from the public internet.