ByteBuf returned by BookKeeper triggers CRC Checksum calculation when calling "readBytes" #4372

eolivelli · 2024-05-17T07:42:06Z

BUG REPORT

Describe the bug
I have developting a Pulsar BrokerInterceptor. The BrokerInterceptor is able to process the data that Pulsar read from BookKeeper, without memory copies.
While analysing a flamegraph I have seen that the ByteBuf returned by BookKeeper shows this weird behaviour and uses lot of CPU.

This is a flame graph. The version of BookKeeper is based on latest 4.16.x

To Reproduce

See the flamegraph

Expected behavior

readBytes has very little overhead

eolivelli · 2024-05-17T07:47:43Z

Interceptor code is here:
https://github.com/datastax/pulsar-jms/blob/master/pulsar-jms-filters/src/main/java/com/datastax/oss/pulsar/jms/selectors/JMSPublishFilters.java#L277

lhotari · 2024-05-17T07:52:03Z

that seems like a strange flamegraph. I don't see how readBytes could trigger CRC calculation.

ByteBufVisitor is used in checksum calculations:

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/proto/checksum/DigestManager.java

Lines 69 to 71 in f8eb7a0

    
           UpdateContext updateContext = new UpdateContext(digest); 
        
           ByteBufVisitor.visitBuffers(buffer, offset, len, byteBufVisitorCallback, updateContext); 
        
           return updateContext.digest;

PR was #4196

eolivelli · 2024-05-17T08:11:31Z

The ByteBuf was not coming from a read from the BookKeeper client, because the ByteBug is coming from the network (it is the Pulsar producer that is sending a message and the interceptor processes it)

But maybe it is a ByteBuf recycled ?

lhotari · 2024-05-17T08:27:10Z

The ByteBuf was not coming from a read from the BookKeeper client, because the ByteBug is coming from the network (it is the Pulsar producer that is sending a message and the interceptor processes it)

But maybe it is a ByteBuf recycled ?

It's hard to see how it could result in the stacktrace even if there was a recycling bug.
GetBytesCallbackByteBuf instance is not stored as a reference anywhere and gets passed as a parameter here:

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/util/ByteBufVisitor.java

Line 138 in f8eb7a0

visitBuffer.getBytes(visitIndex, callbackByteBuf, 0, visitLength);

.

Perhaps it's a profiler issue.

Please add -XX:+UnlockDiagnosticVMOptions -XX:+DebugNonSafepoints to JVM options to prevent any issues in this area:

When agent is not loaded at JVM startup (by using -agentpath option) it is
highly recommended to use -XX:+UnlockDiagnosticVMOptions -XX:+DebugNonSafepoints JVM flags.
Without those flags the profiler will still work correctly but results might be
less accurate. For example, without -XX:+DebugNonSafepoints there is a high chance
that simple inlined methods will not appear in the profile. When the agent is attached at runtime,
CompiledMethodLoad JVMTI event enables debug info, but only for methods compiled after attaching.

It might also be useful to compare Async Profiler 2.9 and 3.0 results. Just to be sure that the new stacktrace solution in 3.0 isn't causing the problem.

hangc0276 · 2024-05-29T02:35:11Z

I also found the checksum cost a lot of CPU

lhotari · 2024-05-29T06:53:08Z

I also found the checksum cost a lot of CPU

@hangc0276 I guess it is expected to consume a lot of CPU? In Enrico's case, the flamegraph doesn't seem to be valid and my assumption was that adding -XX:+UnlockDiagnosticVMOptions -XX:+DebugNonSafepoints to JVM options would fix it since it's recommended to use these JVM options to get proper results while profiling.

eolivelli added the type/bug label May 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ByteBuf returned by BookKeeper triggers CRC Checksum calculation when calling "readBytes" #4372

ByteBuf returned by BookKeeper triggers CRC Checksum calculation when calling "readBytes" #4372

eolivelli commented May 17, 2024

eolivelli commented May 17, 2024

lhotari commented May 17, 2024

eolivelli commented May 17, 2024

lhotari commented May 17, 2024

hangc0276 commented May 29, 2024

lhotari commented May 29, 2024

ByteBuf returned by BookKeeper triggers CRC Checksum calculation when calling "readBytes" #4372

ByteBuf returned by BookKeeper triggers CRC Checksum calculation when calling "readBytes" #4372

Comments

eolivelli commented May 17, 2024

eolivelli commented May 17, 2024

lhotari commented May 17, 2024

eolivelli commented May 17, 2024

lhotari commented May 17, 2024

hangc0276 commented May 29, 2024

lhotari commented May 29, 2024