GaussNewton params and cost out of sync in state #342

gmilleramilar · 2023-03-21T22:34:13Z

    fn next_iter(
        &mut self,
        problem: &mut Problem<O>,
        state: IterState<P, (), J, (), F>,
    ) -> Result<(IterState<P, (), J, (), F>, Option<KV>), Error> {
        let param = state.get_param().ok_or_else(argmin_error_closure!(
            NotInitialized,
            concat!(
                "`GaussNewton` requires an initial parameter vector. ",
                "Please provide an initial guess via `Executor`s `configure` method."
            )
        ))?;
        let residuals = problem.apply(param)?;
        let jacobian = problem.jacobian(param)?;

        let p = jacobian
            .clone()
            .t()
            .dot(&jacobian)
            .inv()?
            .dot(&jacobian.t().dot(&residuals));

        let new_param = param.sub(&p.mul(&self.gamma));

        Ok((state.param(new_param).cost(residuals.l2_norm()), None))
    }

The code above from gaussnewton_method.rs. Here, residuals are calculated calling apply(params). Those residuals are used to create a new parameter vector. However there is no new set of residuals calculated. So at the end of the function, the new parameter vector and the old residuals (reduced to a cost) are stored in the state. This causes problems for early termination schemes dependent on a correct cost value (like tolerance).

The text was updated successfully, but these errors were encountered:

stefan-k · 2023-03-22T07:10:49Z

Hi,

thanks for reporting this! That's a very good point. I'd like to avoid computing the residuals again at the end of the function just for the cost. I think IterState needs to be able to carry the residuals as well. I'll have to think about this.

gmilleramilar · 2023-03-30T14:20:22Z

Yeah, I think that's probably right. In our application re-computing the residuals is quite expensive. Would this require you to make IterState generic over the type of the residuals?

stefan-k · 2023-03-31T11:41:27Z

Sorry that I wasn't able to work on this, I'm swamped with other responsibilities lately.

Would this require you to make IterState generic over the type of the residuals?

Yes, probably. Well, at least I think that this would be the approach that would make the most sense. Alternatively one could just say that residuals must be of the same type as the parameter vectors. This will probably work well in most cases but will be a difficult to justify limitation in other cases.

stefan-k · 2024-01-17T10:21:48Z

Thanks again for the bug report and for the initial implementation of residual handling in IterState (#343). I've extended upon your work and fixed the bug in GaussNewton in #392. The stopping criteria should now work properly. Let me know if you run into any issues :)

stefan-k added the bug Something isn't working label Mar 22, 2023

gmilleramilar mentioned this issue Mar 31, 2023

Keep track of residuals in IterState, adapted GaussNewton accordingly #343

Merged

stefan-k closed this as completed Jan 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GaussNewton params and cost out of sync in state #342

GaussNewton params and cost out of sync in state #342

gmilleramilar commented Mar 21, 2023 •

edited

Loading

stefan-k commented Mar 22, 2023

gmilleramilar commented Mar 30, 2023

stefan-k commented Mar 31, 2023

stefan-k commented Jan 17, 2024

GaussNewton params and cost out of sync in state #342

GaussNewton params and cost out of sync in state #342

Comments

gmilleramilar commented Mar 21, 2023 • edited Loading

stefan-k commented Mar 22, 2023

gmilleramilar commented Mar 30, 2023

stefan-k commented Mar 31, 2023

stefan-k commented Jan 17, 2024

gmilleramilar commented Mar 21, 2023 •

edited

Loading