Image

Universal OCR Text Recognition

0calls

4 credits / call

Whether you need to implement automated receipt entry, or highlight text coordinates on images in web frontend, this high-precision OCR endpoint can provide you with powerful basic capabilities.

Overview

Important

If you only care about what's written on the image (such as screenshot text extraction or content security review), it is strongly recommended to setneed_location to false. This will significantly reduce the size of returned JSON data, improving network transmission and system parsing efficiency.

In addition to regular image-to-text conversion, this endpoint has some practical designs for actual development scenarios:

Frontend Text Highlighting and Structured Analysis: By default returns rectangular coordinates and four vertex coordinates for each paragraph of text. This is very suitable for using Canvas to draw boxes and highlight on the original image, or extracting key-value pair information from receipts based on relative positions on the backend.
Anti-Distortion in Complex Shooting Environments: For rotation or tilt caused by mobile phone shooting, you can enable enable_cls=true. The server will automatically perform direction pre-correction before recognition, significantly improving recognition accuracy.
Flexible Input and Request Requirements: The endpoint supports three input methods: file, url or image_base64. Please ensure the request format is multipart/form-data, and the image link is directly accessible from the public internet.

Request body

Form data containing image to be recognized and optional configurations. Regardless of input method, request body must use multipart/form-data format. Please choose one of file, url or image_base64 as input source.

file

fileoptional

Image file to be recognized. Supports common formats such as JPG, JPEG, PNG, BMP, GIF, WebP, etc., maximum size not exceeding 10MB. Do not submit together with url or image_base64.

url

stringoptional

Publicly accessible image address. Do not submit together with file or image_base64.

image_base64

stringoptional

Base64 string of the image. Can pass complete Data URI or only pure Base64 content. Do not submit together with file or url.

image_name

stringoptional

Custom image filename. Recommended to pass together when passing link or pure Base64, to facilitate retaining or inferring extension.

need_location

stringoptional

Whether to return text coordinate information. Please pass true or false, defaults to true when not passed.

return_markdown

stringoptional

Whether to additionally return organized Markdown text. Please pass true or false, defaults to false when not passed.

enable_cls

stringoptional

Whether to enable additional text direction correction. Please pass true or false, defaults to false when not passed.

Name	Type	Attributes	Description
`file`	file	optional	Image file to be recognized. Supports common formats such as JPG, JPEG, PNG, BMP, GIF, WebP, etc., maximum size not exceeding 10MB. Do not submit together with url or image_base64.
`url`	string	optional	Publicly accessible image address. Do not submit together with file or image_base64.
`image_base64`	string	optional	Base64 string of the image. Can pass complete Data URI or only pure Base64 content. Do not submit together with file or url.
`image_name`	string	optional	Custom image filename. Recommended to pass together when passing link or pure Base64, to facilitate retaining or inferring extension.
`need_location`	string	optional	Whether to return text coordinate information. Please pass `true` or `false`, defaults to `true` when not passed.
`return_markdown`	string	optional	Whether to additionally return organized Markdown text. Please pass `true` or `false`, defaults to `false` when not passed.
`enable_cls`	string	optional	Whether to enable additional text direction correction. Please pass `true` or `false`, defaults to `false` when not passed.

Response

200 / OK

Recognition successful, returns unified OCR result object. Includes coordinate information by default; when need_location=false, coordinate-related fields will be omitted.

JSON

400 / Bad Request

Request parameters are incorrect, such as not providing image source, submitting multiple image sources, or invalid boolean parameters and Base64 format.

Format 1Missing Image Source

JSON

Format 2Input Source Conflict

JSON

Format 3Base64 Format Error

JSON

413 /

Image size exceeds current limit.

JSON

415 /

Uploaded content is not a recognizable common image format.

JSON

502 / Bad Gateway

Recognition processing failed, please try again later.

JSON

503 / Service Unavailable

Text recognition service is temporarily unavailable, please try again later.

JSON

Quick start

Pick your language to see usage examples

Auth mode:

cURL command

curl -X POST 'https://uapis.cn/api/v1/image/ocr' \
  -F 'url=https://uapis.cn/ocr-samples/bilingual-poetry-sample.png'

Frequently asked questions

How to draw text boxes on frontend Canvas based on coordinates?

You can iterate through the `words_result` array and read the `location` in each item. Where `left` and `top` are the top-left coordinates, `width` and `height` are the rectangle dimensions, pass them directly to Canvas's `strokeRect` to draw the recognition box.

Why is it easier to get file size too large errors when passing Base64?

Base64 will inflate the image size additionally. If the original image is already large, converting to Base64 makes it easier to trigger request body size limits. For large images, it's more recommended to pass `file` directly, or first upload the image to a public URL and then pass `url`.

What if recognition results are unstable for skewed receipts or screenshots?

For this scenario, it's recommended to set `enable_cls` to `true`. The server will first attempt text direction correction before entering the recognition process, which is usually more stable than direct recognition.

PreviousGet Gravatar Avatar

NextImage Sensitive Content Detection

Universal OCR Text Recognition

Overview

Request body

Response

200 / OK

400 / Bad Request

413 /

415 /

502 / Bad Gateway

503 / Service Unavailable

Quick start

cURL command

Frequently asked questions

How to draw text boxes on frontend Canvas based on coordinates?

Why is it easier to get file size too large errors when passing Base64?

What if recognition results are unstable for skewed receipts or screenshots?

主要更新：

Quick start

cURL command

Frequently asked questions

How to draw text boxes on frontend Canvas based on coordinates?

Why is it easier to get file size too large errors when passing Base64?

What if recognition results are unstable for skewed receipts or screenshots?

主要更新：