Fix actions feedback race #2612

mauropasse · 2024-08-27T11:29:21Z

Fixes issue mentioned in #2451.

When executing the ActionClient, actions feedback & status had predominance over the action results, so if the client spins slower than the server feedback's rate, it will never process the action result (it will be busy processing feedbacks).

Also adding unit test to verify the fix works. The unit test also tests many other actions interactions.

Signed-off-by: Mauro Passerino <[email protected]>

- See ros2#2451 Signed-off-by: Mauro Passerino <[email protected]>

rclcpp_action/test/test_actions.cpp

rclcpp_action/test/test_actions.hpp

rclcpp_action/test/test_actions.cpp

fujitatomoya · 2024-08-27T17:01:49Z

this PR includes #2471 with latest code base, so i will close #2471.

rclcpp_action/src/client.cpp

rclcpp_action/test/test_actions.hpp

rclcpp_action/CMakeLists.txt

rclcpp_action/test/test_actions.cpp

jmachowinski · 2024-08-29T10:42:10Z

Hard no go from my side. This introduces a new bug, that you would not receive all feedbacks, before the result.
See
rclcpp_action.TestClientAgainstServer.async_send_goal_with_feedback_callback_wait_for_result
for more details.

fujitatomoya · 2024-08-30T00:43:59Z

even though https://design.ros2.org/articles/actions.html#clientserver-interaction-examples does not specify that feedback messages needs to be delivered before result, all examples tell me result comes to the client after feedback messages. besides, if the feedback message comes after the result response, it is strange behavior, i am not sure what application is supposed to process this message in the callback.

and i think, this action design and requirement is already broken with current implementation. because rmw uses different channels for feedback and result, the messages are not queued in order, that means there is always possibility that feedback messages would come after result response. (by changing the order to take the data, it can mitigate a bit but not a perfect solution...)

IMO, if we can make it better for user-experience, changing the order would be acceptable? maybe the order needs to be reconsidered well ? (goal response->cancel response->feedback->status->result response)

any thoughts? @mauropasse @jmachowinski @ahcorde @alsora

mauropasse · 2024-08-30T11:27:17Z

My thoughts are that ROS 2 actions' feedback and status messages are useful during the action's execution, as they allow the user to understand how the process is progressing.

However, the responses (goal/cancel/result) are the final pieces of information when an action has completed, and these should be considered the most important. Feedback and status messages received by the client after the action has finished could (should?) be ignored if the user already knows the action's outcome.

This is why I think responses should be prioritized over feedback. Moreover, a response is sent only once, whereas there can be a large number of feedback messages.

I also want to point out that this issue affects only the single-threaded executor. It does not impact the events executor, as events are processed in the order they are generated.

Also I'm unsure about judging the correctness for the new proposed priority of execution, based on the results of previous tests that fail now?

fujitatomoya · 2024-08-30T15:31:23Z

@mauropasse thanks for your comment!

Feedback and status messages received by the client after the action has finished could (should?) be ignored if the user already knows the action's outcome.

This is i am not sure yet by design. To be honest, i thought that is okay feedback and status messages would come result (or even cancel), and either ActionClient or application can ignore that. Let's wait for more feedback on this.

Also I'm unsure about judging the correctness for the new proposed priority of execution, based on the results of previous tests that fail now?

I believe that @jmachowinski just wanted to confirm this behavior just like me. I think that is totally fine to change the test once behavior is changed.

jmachowinski · 2024-08-31T10:22:04Z

I checked the initial bug report, and must say, the test itself is highly flawed.

What is comes down to is you got a provider running with higher frequency than the consumer is processing.

From my point of view the bug report makes wrong assumptions as to how spin_some works / should work. To be fair, the documentation of the function is misleading, as you need real deep knowledge of the executor internals so know what 'Collect all work' really means. The obvious fix to the problem is to use spin_all.

As to the action code in general, as I stated before I think the design is highly flawed. But I don't see a 'simple' fix for the issue, like the one proposed here.

As to the importance of receiving the last feedback before the goal, I agree with @mauropasse that in a (our) real world application, one can normally ignore feedback and it's not important at all. The problem with this change though is, that it will break the tutorial
https://docs.ros.org/en/jazzy/Tutorials/Intermediate/Writing-an-Action-Server-Client/Cpp.html and possible break user code, that rely on this behavior.

Signed-off-by: Mauro Passerino <[email protected]>

mauropasse · 2024-09-18T07:09:33Z

In my last commits I addressed comments from this PR.
I ended up lowering only the Feedback priority, so:

Feedback > Status > Goal Response > Result Response > Cancel Response (original)
Status > Goal Response > Result Response > Cancel Response > Feedback (new priority)

In this way:

Test modifications on test_client.cpp for it to pass, are minimal.
The demo ros2/demos/action_tutorials/action_tutorials_cpp/src/fibonacci_* has same behavior as before.
There's still the risk of breaking user code.

Signed-off-by: Mauro Passerino <[email protected]>

fujitatomoya

overall lgtm, one comment.

rclcpp_action/test/test_client.cpp

fujitatomoya · 2024-10-29T20:57:14Z

@mauropasse can you also resolve @ahcorde 's comments?

Signed-off-by: Mauro Passerino <[email protected]>

fujitatomoya · 2024-11-01T19:34:16Z

@jmachowinski since you have commented this, what do you think of the current change?

jmachowinski · 2024-11-04T15:46:46Z

I don't like this change. It does not really fix the problem itself, but instead fixes it for one obscure case.

I wonder, if we can be smarter on the return of the next ready event instead of returning it in a fixed order.

Do we have access to the receive timestamp at that point ?
Returning the event with the oldest timestamp seems a way better solution...

Or could be introduce a sequence number in the messages and use this one to return the next ready event ?

Mauro Passerino added 2 commits August 27, 2024 12:19

Add test for actions

07dc81a

Signed-off-by: Mauro Passerino <[email protected]>

Fix actions feedback race

eb57f71

- See ros2#2451 Signed-off-by: Mauro Passerino <[email protected]>

mauropasse requested review from ivanpauno, hidmic and wjwwood as code owners August 27, 2024 11:29

mauropasse mentioned this pull request Aug 27, 2024

Humble backport and new fixes irobot-ros/rclcpp#154

Merged

ahcorde requested changes Aug 27, 2024

View reviewed changes

fujitatomoya mentioned this pull request Aug 27, 2024

rclcpp_action: take and execute service entities in priority. #2471

Closed

alsora reviewed Aug 28, 2024

View reviewed changes

rclcpp_action/src/client.cpp Show resolved Hide resolved

fujitatomoya reviewed Aug 28, 2024

View reviewed changes

rclcpp_action/src/client.cpp Show resolved Hide resolved

rclcpp_action/test/test_actions.hpp Outdated Show resolved Hide resolved

rclcpp_action/CMakeLists.txt Outdated Show resolved Hide resolved

rclcpp_action/test/test_actions.cpp Outdated Show resolved Hide resolved

Mauro Passerino added 2 commits September 18, 2024 07:38

Address PR comments and fix test

13a27f6

Signed-off-by: Mauro Passerino <[email protected]>

add comment

e66d6e5

Signed-off-by: Mauro Passerino <[email protected]>

wjwwood assigned mauropasse Oct 3, 2024

fix cpplint

ca5856d

Signed-off-by: Mauro Passerino <[email protected]>

fujitatomoya reviewed Oct 29, 2024

View reviewed changes

rclcpp_action/test/test_client.cpp Outdated Show resolved Hide resolved

Test: Correct feedback expected count

1888892

Signed-off-by: Mauro Passerino <[email protected]>

mauropasse requested a review from ahcorde November 1, 2024 18:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix actions feedback race #2612

Fix actions feedback race #2612

mauropasse commented Aug 27, 2024

fujitatomoya commented Aug 27, 2024

jmachowinski commented Aug 29, 2024

fujitatomoya commented Aug 30, 2024

mauropasse commented Aug 30, 2024

fujitatomoya commented Aug 30, 2024

jmachowinski commented Aug 31, 2024

mauropasse commented Sep 18, 2024

fujitatomoya left a comment

fujitatomoya commented Oct 29, 2024

fujitatomoya commented Nov 1, 2024

jmachowinski commented Nov 4, 2024 •

edited

Loading

Fix actions feedback race #2612

Are you sure you want to change the base?

Fix actions feedback race #2612

Conversation

mauropasse commented Aug 27, 2024

fujitatomoya commented Aug 27, 2024

jmachowinski commented Aug 29, 2024

fujitatomoya commented Aug 30, 2024

mauropasse commented Aug 30, 2024

fujitatomoya commented Aug 30, 2024

jmachowinski commented Aug 31, 2024

mauropasse commented Sep 18, 2024

fujitatomoya left a comment

Choose a reason for hiding this comment

fujitatomoya commented Oct 29, 2024

fujitatomoya commented Nov 1, 2024

jmachowinski commented Nov 4, 2024 • edited Loading

jmachowinski commented Nov 4, 2024 •

edited

Loading