[BUG] Linear regression model predicts NaN values only #3210

wrigleyDan · 2024-11-11T07:37:46Z

What is the bug?
I trained a linear regression model with 5000 features and apparently when calling the _predict API only NaN values are returned.

I cannot exclude that I'm using parameters that are not ideal and as a consequence lead to the NaN predictions. I unsuccessfully tried smaller learning rates but did not experiment with all available parameters and parameter values.

How can one reproduce the bug?
Steps to reproduce the behavior:

Get the features at https://gist.github.com/wrigleyDan/a83a5d8294aa0ed493e4feb8cc9d7433
Get the notebook to see how I ingest the data, train a model, predict a value: https://gist.github.com/wrigleyDan/16deb9cd8201ec502acda036c0b150b5
Run the notebook with the feature data
See NaN as the predicted value

What is the expected behavior?
The expected behavior is to receive not only NaN values but reasonable predictions, in the given example values between 0 and 1.

What is your host/environment?

OpenSearch v 2.16.0

Do you have any screenshots?
See the linked Gist with a notebook example and the data used as features.

Do you have any additional context?
Initially reported in the #ml OpenSearch Slack channel: https://opensearch.slack.com/archives/C05BGJ1N264/p1731077205560749

The text was updated successfully, but these errors were encountered:

b4sjoo · 2024-11-19T18:29:43Z

Taking a look

wrigleyDan · 2024-11-27T06:51:10Z

Any news on this one @b4sjoo?

dhrubo-os · 2024-12-03T18:36:17Z

@b4sjoo any update on this?

rithin-pullela-aws · 2024-12-16T22:24:52Z

@dhrubo-os can you please assign this issue to me?

dhrubo-os · 2024-12-16T22:42:48Z

@rithin-pullela-aws I just assigned to you. Thanks for looking into this.

rithin-pullela-aws · 2024-12-19T18:45:03Z

Hi @wrigleyDan, experimenting with different optimiser and learning rates results in better model weights and responses.

I used ADA_GRAD and got the output between 0 and 1:

url = "http://localhost:9200/_plugins/_ml/_train/linear_regression"

payload = {
    "parameters": {
      "target": "neuralness",
      "learningRate": 0.01,
      "optimiser": "ADA_GRAD"
    },
    "input_query": {
        "_source": ["neuralness", "f_1_num_of_terms", "f_2_query_length", "f_3_has_numbers", "f_4_has_special_char", "f_5_num_results",
                    "f_6_max_title_score", "f_7_sum_title_scores", "f_8_max_semantic_score", "f_9_avg_semantic_score"],
        "size": 10000
    },
    "input_index": [
        "features"
    ]
}

response = requests.request("POST", url, headers=headers, data=json.dumps(payload))
print(response.json())
linear_model_id = response.json()['model_id']
print(f"Created model {linear_model_id}")

Open Search uses Tribuo to perform linear regression, please find this bug report on Tribuo for better explanation.

Craigacp · 2024-12-20T02:09:09Z

For Tribuo's linear regressions, it's probably better to default to using AdaGrad with some reasonable learning rate rather than a constant learning rate SGD as it's very tricky to tune that correctly. We provide a default LogisticRegressionTrainer which uses AdaGrad and other default parameter choices, but we didn't provide one for linear regression (mostly because that didn't appear in our demos as much as logistic regression did).

wrigleyDan added bug Something isn't working untriaged labels Nov 11, 2024

wrigleyDan mentioned this issue Nov 11, 2024

Explore model inference options in OpenSearch o19s/opensearch-hybrid-search-optimization#5

Closed

b4sjoo removed the untriaged label Nov 19, 2024

b4sjoo self-assigned this Nov 19, 2024

dhrubo-os added this to ml-commons projects Dec 3, 2024

dhrubo-os moved this to In Progress in ml-commons projects Dec 3, 2024

dhrubo-os assigned rithin-pullela-aws and unassigned b4sjoo Dec 16, 2024

rithin-pullela-aws linked a pull request Dec 23, 2024 that will close this issue

Use Adagrad optimiser for Linear regression by default #3291

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Linear regression model predicts NaN values only #3210

[BUG] Linear regression model predicts NaN values only #3210

wrigleyDan commented Nov 11, 2024

b4sjoo commented Nov 19, 2024

wrigleyDan commented Nov 27, 2024

dhrubo-os commented Dec 3, 2024

rithin-pullela-aws commented Dec 16, 2024

dhrubo-os commented Dec 16, 2024

rithin-pullela-aws commented Dec 19, 2024

Craigacp commented Dec 20, 2024

[BUG] Linear regression model predicts NaN values only #3210

[BUG] Linear regression model predicts NaN values only #3210

Comments

wrigleyDan commented Nov 11, 2024

b4sjoo commented Nov 19, 2024

wrigleyDan commented Nov 27, 2024

dhrubo-os commented Dec 3, 2024

rithin-pullela-aws commented Dec 16, 2024

dhrubo-os commented Dec 16, 2024

rithin-pullela-aws commented Dec 19, 2024

Craigacp commented Dec 20, 2024