You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Create an index of DOIs in VuFind. DOIs occur in some 856 fields, and in 024 fields.
We are interested in indexing 024|a fields for 024 cases where 024|2 is equal to ‘doi’. We also need to index 856|u fields that contain valid DOIs.
Some records may contain multiple 856|u’s with valid DOIs. Ex: 8883838 . We should index each DOI in this case.
The standard syntax for DOIs can be found in Bib # 2352930, and the value is http://dx.doi.org/10.1787/16812328. In this case, we can still strip the string http://dx.doi.org/. Only the portion starting with “10.*” is needed to retrieve the material.
DOIs in the 856|u can occur in a variety of non-standard syntaxes. Bib # 11761529 has an 856|u with the value http://link.springer.com/10.1007/978-981-10-6026-7 . In this case, 10.1007/978-981-10-6026-7 is the meaningful DOI value.
What kind of index should this be? That is, it seems like an exactly string match on the DOI itself. No text tokenizing or anything, just an exact string match on something like:
Create an index of DOIs in VuFind. DOIs occur in some 856 fields, and in 024 fields.
We are interested in indexing 024|a fields for 024 cases where 024|2 is equal to ‘doi’. We also need to index 856|u fields that contain valid DOIs.
Some records may contain multiple 856|u’s with valid DOIs. Ex: 8883838 . We should index each DOI in this case.
The standard syntax for DOIs can be found in Bib # 2352930, and the value is http://dx.doi.org/10.1787/16812328. In this case, we can still strip the string http://dx.doi.org/. Only the portion starting with “10.*” is needed to retrieve the material.
DOIs in the 856|u can occur in a variety of non-standard syntaxes. Bib # 11761529 has an 856|u with the value http://link.springer.com/10.1007/978-981-10-6026-7 . In this case, 10.1007/978-981-10-6026-7 is the meaningful DOI value.
Bib # 9130371 has an 856|u of http://onlinelibrary.wiley.com/book/10.1029/GM093 . This can be trimmed to 10.1029/GM093
There are regular expressions for filtering valid DOIs available here:
https://www.crossref.org/blog/dois-and-matching-regular-expressions/
We should use a Solr analyzer to similarly trim search terms that are directed to this index.
The text was updated successfully, but these errors were encountered: