Skip to content

Latest commit

 

History

History
56 lines (48 loc) · 3.32 KB

TODO.md

File metadata and controls

56 lines (48 loc) · 3.32 KB

Action Items

Bugs

  • ep3ds2citations, citations from CaltechDATA seem to be missing orcid even when present in the data.ds record.
  • eprint2rdm missing metadata attributes
    • journal related fields (e.g. journal:journal) in custom fields
    • thesis related fields
    • meeting:meeting in custom fields
    • migrate resolver id (eprint.IDNumber) to metadata.identifiers
    • Map CaltechTHESIS custom fields, issue #44
    • Group transfer problem, see issue #42
  • Local group items are including an empty "{}" as an entry when retrieved with ep3util (e.g. thesis repository, ep3util get_record 7320)
  • progress.go is producing non-sensical estamates of the time remaining, need to review my math (this was a really bad SQL query on my part)
  • rdmutil
    • Send to Cummunity not working
    • Submit Draft not working
    • Review Drafft not wokring
    • Submit Publish not working
    • Upload files to Draft
    • Delete files from Draft
    • Import Files to Draft
    • Put eprints 'suggests' private notes into review comments, see issue #16
    • Do file mapping, see issue #3
  • migrate_record.py (running Python Fixup code)
    • resolver id migrated
    • migrate DOI to metadata.identifiers if already in repository, issue #15
  • doi2rdm
    • Migrate monographs from CrossRef more effectively, see issue #40
    • figure out how to transform mml markup, issue #36
  • rdm2eprint should always populate official URL, in some cases is showing up /records/{rdmid} but in others it is populated, when I checked authors record efd3g-p2669 it isn't populated in the JSON output
  • ep3ds2citations, authors array isn't including orcid and clpid as found in authors.ds, thesis.ds and data.ds
  • citation record 'id' is winding up with keys like 'authors:authors:XXXXX-XXXXX' for CaltechAUTHORS, CaltechDATA and then 'caltechthesis:thesis:XXXX' for CaltechTHESIS.
  • ep3ds2citations, publication_date isn't showing up in JSON stored
  • Need an ability to apply an explicit prefix to keys ingested by ep3ds2citation, this avoids the problem where some record pickup an EPrint collection name versus the dataset collection name fallback

Next

  • irdmtools Go based commands should not use the RDM JSON API, they should always go direct to the Postgres database
  • ep3ds2citation needs to be able to work from a key list or JSON list of keys. When working from key list it should read the entire keylist in then start processing them and display progress
  • Integrate a YAML options file into doi2rdm so that we can easily map our customized mapings via configuration instead of hard coding them.
  • rdmutil get_all_ids needs a get_all_stale_ids counterpart, see issue #68 (implemented get_record_versions"`
  • add put_record to rdmutil, actually done as many steps, new_record, new_draft, update_draft, ...
  • Implement a CrossRef to Invenio RDM record
  • Figure out a faster way to retrieve RDM ids without using the API or OAI-PMH. Possibly options would be to create an rdmapid service, or direct query via PostgreSQL.
    • PostgREST can provide a RESTful JSON API to our Invenio RDM data stored in Postgres

Someday, maybe

  • figure out a faster way to backup stats in RDM other than elasticdump which takes a very long time (single three, single CPU)