Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

There is no archive for the manually reconciled entries for MusicBrainz #194

Closed
candlecao opened this issue Sep 23, 2024 · 5 comments
Closed
Assignees
Labels
Priority: high high priority

Comments

@candlecao
Copy link
Contributor

candlecao commented Sep 23, 2024

This issue follows #186
Especially those "sub-types" for the entities are more or less reconciled manually. For example,
e.g.: type of area:

<style> </style>
type type_uri
city https://www.wikidata.org/wiki/Q515
country https://www.wikidata.org/wiki/Q6256
county https://www.wikidata.org/wiki/Q28575
district https://www.wikidata.org/wiki/Q149621
Indigenous territory / reserve  
island https://www.wikidata.org/wiki/Q23442
mahakuma https://www.wikidata.org/wiki/Q15637757
military base https://www.wikidata.org/wiki/Q245016
municipality https://www.wikidata.org/wiki/Q15284
@candlecao candlecao added the Priority: high high priority label Sep 23, 2024
@candlecao candlecao self-assigned this Sep 23, 2024
@candlecao candlecao changed the title There are no record for the manually reconciled entries for MusicBrainz There is no archive for the manually reconciled entries for MusicBrainz Sep 23, 2024
@candlecao
Copy link
Contributor Author

The archive should be put in linkedmusic-datalake/musicbrainz/data/reconciledEntries/archiveForManuallyReconciledEntries.xlsx

@dchiller
Copy link
Contributor

Does OpenRefine not have an output for this?

@candlecao
Copy link
Contributor Author

Does OpenRefine not have an output for this?

I believe OpenRefine can output this. But what I'm suggesting is that we store those manually reconciled records. This way, in the future, when updating the database, we don't need to manually reconcile them again.

@dchiller
Copy link
Contributor

I don't think I totally understand the use case. Is the point that we have reconciled the various entities listed above (city, district, etc.) in a particular dataset with Wikidata, we have done this reconciliation manually, and we want to be able to repeat that reconciliation with updated data/new datasets, etc?

How do we currently support reconciliation of updated data?

@candlecao
Copy link
Contributor Author

https://github.com/DDMAL/linkedmusic-datalake/tree/main/ArchiveForReconciledEntries
Please check this "archive"--as a particular dataset you mentioned.

Is the point that we have reconciled the various entities listed above (city, district, etc.) in a particular dataset with Wikidata, we have done this reconciliation manually,
--Yes.

and we want to be able to repeat that reconciliation with updated data/new datasets
--I don't quite understand this sentence. Anyway, we have this archive so that next time we won't need to manually reconcile again.

How do we currently support reconciliation of updated data?
--This is an issue to be solved or in further discussion: #144

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Priority: high high priority
Projects
None yet
Development

No branches or pull requests

2 participants