Add tabulation method to Outputs class #689

nfahlgren · 2021-02-20T23:36:23Z

nfahlgren
Feb 20, 2021
Maintainer

An issue that has come up a few times (most recently #688) is that a PlantCV workflow result file (JSON format) cannot be converted to CSV using the json2csv method in the utils subpackage (typically with the script plantcv-utils.py json2csv. The json2csv function accepts a JSON-formatted file that is output by plantcv.parallel.process_results, which is not quite the same as a workflow results file. process_results processes an input directory containing workflow results files and combines them into a single output JSON file. While processing the individual results files, process_results builds a global variable index (this allows us to fill in missing values when we export to CSV later) and combines the observations into an aggregate list of "entities" (aka one set of results per image). A workflow results file is missing the variables index and does not have a list of entities since it is only a single entity.

It is fairly common for folks to want to retrieve a table of results for a single workflow. It is currently possible to hack this together by running plantcv.parallel.process_results and then running json2csv but it is not very user-friendly.

I propose that we begin to deprecate pcv.print_results and create one or more new methods in the Outputs class. One potential option is a new method with the file format as an input option (pcv.outputs.save_results(filename, format)), or alternatively a new method for each (pcv.outputs.save_json(filename) and pcv.outputs.save_csv(filename)). For an interim period pcv.print_results() could continue to work with a deprecation warning.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tabulation method to Outputs class #689

{{title}}

Replies: 0 comments

Select a reply

Add tabulation method to Outputs class #689

nfahlgren Feb 20, 2021 Maintainer

Replies: 0 comments

nfahlgren
Feb 20, 2021
Maintainer