Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generate line by line images #48

Open
omarabb315 opened this issue Feb 18, 2024 · 3 comments
Open

Generate line by line images #48

omarabb315 opened this issue Feb 18, 2024 · 3 comments

Comments

@omarabb315
Copy link

How can I extract a dataset with images of each line in the Quran paired with the text?
I want to use it for OCR model training

@ahmedre
Copy link
Contributor

ahmedre commented Feb 18, 2024

salam 3alaikum,
the table contains data about which lines each glyph contains, and what word each glyph is, so you should be able to build some sql queries and/or a script to extract this data fairly easily.
walsalam 3alaikum

@omarabb315
Copy link
Author

وعليكم السلام ورحمة الله وبركاته، جزاكم الله خيرا على الرد السريع
Could you provide me some details about using the table, I am not familiar with this framework, and I want to use a dataset of line images paired with the text

@ahmedre
Copy link
Contributor

ahmedre commented Feb 18, 2024

The repository comes bundled with a MySQL database - you can extract the data from it without running the included scripts.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants