Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix list content duplication #396

Open
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

adityachandak287
Copy link

Issue #, if available: #391

Description of changes:

What?

Prevent duplication of list contents.

How?

Exclude all LAYOUT* elements which are children of LIST_LAYOUT elements when returning layout blocks from _get_layout_blocks.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

LAYOUT_LIST elements can have LAYOUT_TEXT elements as children, refer
https://docs.aws.amazon.com/textract/latest/dg/layoutresponse.html.
Exclude LAYOUT_TEXT elements that are children of LAYOUT_LIST elements
from list of layouts returned from LinearizeLayout._get_layout_blocks.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants