Skip to content

Latest commit

 

History

History
35 lines (22 loc) · 1.22 KB

File metadata and controls

35 lines (22 loc) · 1.22 KB

101-People-4538-Images-Japanese-Handwriting-OCR-Data

Description

101 People - 4,538 Images Japanese Handwriting OCR Data. The text carrier is A4 paper. The dataset content includes social livelihood, entertainment, tour, sport, movie, composition and other fields. For annotation, character-level rectangular bounding box annotation and text transcription were adopted. The dataset can be used for tasks such as Japanese handwriting OCR.

For more details, please refer to the link:https://www.nexdata.ai/datasets/ocr/1087?source=Github

Data size

101 people, 4,538 images

Collecting environment

A4 paper

Device

scanner

Photographic angle

eye-level angle

Data format

the image data format is .jpg, the annotation file format is .json

Data content

including social livelihood, entertainment, tour, sport, movie, composition and other fields

Annotation content

character-level rectangular bounding box annotation and text transcription

Accuracy

the error bound of each vertex of rectangular bounding box is within 2 pixels, which is a qualified annotation, the accuracy of bounding boxes is not less than 98%; the characters transcription accuracy is not less than 98%

Licensing Information

Commercial License