-
Notifications
You must be signed in to change notification settings - Fork 2.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Integrate Weaviate DB #642
Conversation
I'm confused about how we will add data to weaviate. Because we need to store data in dictionary format we chunk the data given by users and store it into a list of documents. So I need help here :) |
Hey @Dev-Khant : can you tell me more about what help you need here? |
Hi @taranjeet, Here as every data source is chunked into documents in So here I'm confused about how to define a schema for every use case. |
Hey @Dev-Khant , I am unsure if I follow the issue here. In chroma DB, we do chunking, and each chunk has a unique ID. This means a single document can have multiple chunks in the DB. Weaviate seems quite similar to me in that sense, or maybe I have completely missed something. Do you mind elaborating? Happy to hop on a call to discuss further |
Hi @rupeshbansal, If you follow https://weaviate.io/developers/weaviate/manage-data/create and https://weaviate.io/developers/weaviate/manage-data/import then according to it I don't think a single document can be pushed. And even if we somehow push, I'm not sure if it will be able to query relevant data. And sure let me know want to discuss over call |
Thats correct, a single document cannot be pushed, which is why the data has to be chunked before pushing. You are correct in saying that each chunk will then be mapped to a different id. But all this holds true for any database. Embedchain has a concept of chunkers which uses a basic technique of recursively breaking the context into chunks of equal size and pushing them. Do you mind if I take over this task of integrating Weaviate, and maybe you can try integrating some other databases? Let me know if you want to continue with this |
Yes you can pick this up, I wanted to learn how Weaviate worked with this type of data but I'll go through your code once you build it. I'll look for other databases |
Description
Add support for Weaviate database.
Fixes #436
Type of change
Please delete options that are not relevant.
How Has This Been Tested?
Not yet
Checklist: