Assumption of sorting data for sklearn agents is too strong. #31

maffettone · 2023-12-13T22:27:53Z

Decomposition general preforms stronger and appears more intuitive if the data is sorted. The current assumptions for DecompositionAgentBase
and [for ClusterAgentBase] will also only work for 1-d independent data.

Expected Behavior

Work for n-d independent vars
Potentially don't sort for clustering?

Current Behavior

Sorting higher dims causes value errors.

Possible Solution

partial solution for 2D that needs extension:

try:
    sorted_independents, sorted_observables = zip(
        *sorted(zip(self.independent_cache, self.observable_cache))
    )
except ValueError:
    # Multidimensional case
    sorted_independents, sorted_observables = zip(
        *sorted(zip(self.independent_cache, self.observable_cache), key=lambda x: (x[0][0], x[0][1]))
    )

maffettone self-assigned this Dec 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Assumption of sorting data for sklearn agents is too strong. #31

Assumption of sorting data for sklearn agents is too strong. #31

maffettone commented Dec 13, 2023

Assumption of sorting data for sklearn agents is too strong. #31

Assumption of sorting data for sklearn agents is too strong. #31

Comments

maffettone commented Dec 13, 2023

Expected Behavior

Current Behavior

Possible Solution