Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Full visit without any selection criteria is dropping handful of documents with discrepancy in documentCount #32645

Open
nehajatav opened this issue Oct 24, 2024 · 2 comments
Assignees

Comments

@nehajatav
Copy link

Describe the bug
Every time we run a full visit of the cluster we are losing out on few documents exactly same in number as the #documents misreported in documentCount of response. The missed documents were not consistently the same, but they were not created nor reindexed during the visit (created in 2021)

To Reproduce
Run full visit using slices
Very few visit calls results in #documents in json lesser than documentCount json key provided in visit response

Expected behavior
We expected the missed document to be there in a full visit run esp since the document was an old one and not recently created

Environment (please complete the following information):

  • OS: RHEL8
  • Infrastructure: Podman

Vespa version
8.332.5

@vekterli
Copy link
Member

From my understanding this is observed when using stream=true, correct? Does this also happen if stream=false, i.e. buffered response mode?

@nehajatav
Copy link
Author

We only use stream=true

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: No status
Development

No branches or pull requests

3 participants