Simplify getInputPortSignal and getOutputPortSignal regarding dynamic memory allocations? #27

traversaro · 2019-01-26T15:19:04Z

For several reasons, typically involving performance (due to use of system calls and violation of cache locality) but also determinism on some hard real-time OS, a good practice is to avoid, sometimes it is considered good practice to avoid any kind of dynamic memory allocation in code that will be called in "tight loop", for example the code that is called at every step of a control, simulation or optimization algorithm.

Related to that, I notice that the current BlockFactory API induces the users to perform two dynamic memory allocations whenever, from an output method, and input or output signal is accessed. The situation could get worse if eventually tackle #8, as some of the same signal would need to be accessed also from the updateDiscreteState and the BlockFactory equivalent of mdlDerivatives. Note that this two memory allocation are avoided in the "CoderBlockInformation" implementation only because in that case the IO signals are stored internally already as std::shared_ptr<Signal>, but for example are necessary in the SimulinkBlockInformation implementation (and similarly they will probably be present in <something>BlockInformation implementations for which the addresses of IO signals could change at each simulation step).

These two dynamic memory allocations are caused by:

Use of pimpl pattern in blockfactory::core::Signal (added in diegoferigo/WB-Toolbox@75c1d36#diff-e32b9f2c904cb22f7a9187e2a76861a3). I guess this change can be relatively safely reverted, if we think it is the case.
Use of std::shared_ptr<Signal> to return a signal in getInputPortSignal and getOutputPortSignal , in place of standard copy on the stack, that in most cases will be optimized away due to Return Value Optimization (RVO). This it is a more complicated change, as it was explicitly introduced in Signal const correctness wb-toolbox#126 to ensure that input data is only read and not wrote without introducing a new type ConstSignal or Signal<isConst>.

The change 2) is definitely more complicated (if we want to continue to enfore the const correctnes of input signals), while 1) should be straightforward.

The text was updated successfully, but these errors were encountered:

traversaro · 2019-01-26T16:20:31Z

For solving 2), a good solution may be to wait for when we will adopt C++17, and switch to use std::optional<Signal> and std::optional<const Signal>, to preserve both the const behavior, and the possibility of explicitly returning a null/invalid signal.

diegoferigo · 2019-01-29T16:41:27Z

Some time ago, the Signal class was rewritten (robotology/wb-toolbox@deed69e) to minimize dynamic allocations. The main trick there was the introduction of a new type of buffer, the CONTIGUOUS_ZEROCOPY, that provided a non-owning access to the buffer.

Today this is the default behavior for both input and output signals, and in Simulink we had to configure input signals as follows:

blockfactory/sources/Simulink/src/BlockFactory.cpp

Lines 175 to 176 in 39b3d44

    
           // Set input signals to be allocated in a contiguous memory storage 
        
           ssSetInputPortRequiredContiguous(S, i, true);

otherwise they are treated as NONCONTIGUOUS and a copy of the buffer is performed.

Said this, here are my comments:

Use of pimpl pattern in blockfactory::core::Signal (added in diegoferigo/WB-Toolbox@75c1d36#diff-e32b9f2c904cb22f7a9187e2a76861a3). I guess this change can be relatively safely reverted, if we think it is the case.

In the case of the Simulink engine, this is correct. The Simulink Coder implementation instead does not have this problem since the core::Signal objects are cached. We can either revert the pImpl for the entire core::Signal class, or convert also the Simulink implementation to something similar (the number of signals cannot change during the simulation, so even if the buffer addresses change there should be no problem).

Use of std::shared_ptr to return a signal in getInputPortSignal and getOutputPortSignal , in place of standard copy on the stack, that in most cases will be optimized away due to Return Value Optimization (RVO). This it is a more complicated change, as it was explicitly introduced in robotology/wb-toolbox#126 to ensure that input data is only read and not wrote without introducing a new type ConstSignal or Signal.

Why do you think that there is a copy in this case? I recently forced in #33 to use the move semantic to return input signals, and this was necessary only due to a defect in ISO C++11:

blockfactory/sources/Simulink/src/SimulinkBlockInformation.cpp

Line 123 in ee0c696

return std::move(signal);

traversaro · 2019-01-29T17:35:42Z

Why do you think that there is a copy in this case?

I probably did not expressed myself clearly, but I am not sure about what you are referring to. I do not think I ever said there is a copy in this case. I just think that if Signal was a relatively standard class with no pimpl, a simple return will avoid the need of calling make_shared to create an ad hoc shared_ptr just to satisfy the interface.

We can either revert the pImpl for the entire core::Signal class, or convert also the Simulink implementation to something similar (the number of signals cannot change during the simulation, so even if the buffer addresses change there should be no problem).

I strongly suspect that eventually removing the pimpl from core::Signal is the preferable solution, as it reduces the complexity of the Signal class, and do not force to increase every the complexity of each engine BlockInformation's implementation just to avoid memory allocations.
The benefits of pimpl in this case (facilitating ABI stability) seems minimal compared with the increased complexity.

diegoferigo · 2019-01-29T20:16:56Z

I probably did not expressed myself clearly, but I am not sure about what you are referring to. I do not think I ever said there is a copy in this case. I just think that if Signal was a relatively standard class with no pimpl, a simple return will avoid the need of calling make_shared to create an ad hoc shared_ptr just to satisfy the interface.

The benefits of pimpl in this case (facilitating ABI stability) seems minimal compared with the increased complexity.

Sorry, I misunderstood your point if it was on the std::make_unique call. I agree on this, the pimpl does not contain much stuff in this case, it should be a straightforward change.

traversaro changed the title ~~Simplify get***PortSignal w.r.t. to memory allocations?~~ Simplify getInputPortSignal and getOutputPortSignal w.r.t. to dynamic memory allocations? Jan 26, 2019

traversaro changed the title ~~Simplify getInputPortSignal and getOutputPortSignal w.r.t. to dynamic memory allocations?~~ Simplify getInputPortSignal and getOutputPortSignal regarding dynamic memory allocations? Jan 26, 2019

diegoferigo added complexity::medium component::core issue::type::enhancement labels Jan 29, 2019

diegoferigo mentioned this issue Jan 29, 2019

Reduce dynamic allocations during simulation loop #37

Merged

diegoferigo self-assigned this Jan 30, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify getInputPortSignal and getOutputPortSignal regarding dynamic memory allocations? #27

Simplify getInputPortSignal and getOutputPortSignal regarding dynamic memory allocations? #27

traversaro commented Jan 26, 2019 •

edited

Loading

traversaro commented Jan 26, 2019

diegoferigo commented Jan 29, 2019

traversaro commented Jan 29, 2019 •

edited

Loading

diegoferigo commented Jan 29, 2019

Simplify getInputPortSignal and getOutputPortSignal regarding dynamic memory allocations? #27

Simplify getInputPortSignal and getOutputPortSignal regarding dynamic memory allocations? #27

Comments

traversaro commented Jan 26, 2019 • edited Loading

traversaro commented Jan 26, 2019

diegoferigo commented Jan 29, 2019

traversaro commented Jan 29, 2019 • edited Loading

diegoferigo commented Jan 29, 2019

traversaro commented Jan 26, 2019 •

edited

Loading

traversaro commented Jan 29, 2019 •

edited

Loading