Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update content type for Web Archive Crawl objects #486

Open
edsu opened this issue Jun 21, 2022 · 1 comment
Open

Update content type for Web Archive Crawl objects #486

edsu opened this issue Jun 21, 2022 · 1 comment
Labels
web archiving for June-July 2022 work cycle

Comments

@edsu
Copy link
Contributor

edsu commented Jun 21, 2022

Due to improvements in wasCrawlDissemination it is now possible to browse Web Archive content using the webarchive-binary content-type:

https://argo.stanford.edu/catalog?f%5Bcontent_type_ssim%5D%5B%5D=webarchive-binary

However prior to this change this type was being overwritten with content type file. You can see some of these here:

https://argo.stanford.edu/catalog?f%5Bcontent_type_ssim%5D%5B%5D=file&q=web+archive&search_field=text

For consistency and to ease discoverability we should rewrite these content-types to be webarchive-binary.

@edsu edsu added the web archiving for June-July 2022 work cycle label Jun 21, 2022
@mjgiarlo
Copy link
Member

@andrewjbtw believes this could be handled via a bulk action. Would need work in Argo.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
web archiving for June-July 2022 work cycle
Projects
None yet
Development

No branches or pull requests

2 participants