Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add DataLoader observability support #1034

Open
mmuth opened this issue Jul 22, 2024 · 3 comments
Open

Add DataLoader observability support #1034

mmuth opened this issue Jul 22, 2024 · 3 comments
Assignees
Labels
in: core Issues related to config and core support type: enhancement A general enhancement
Milestone

Comments

@mmuth
Copy link

mmuth commented Jul 22, 2024

Feature Request
Hi there - I am currently migrating from graphql-kickstart to spring-graphql. The new observations in the GraphQlObservationInstrumentation are really great!

However I suffer from a huge amount of "noise spans" when using dataloaders, as the traces contain a Span for every resolved Child node (with an graphql.field.path in an array-like iteration /myfield/edges[x]). That means for a GQL request with a 5000 page size, 5000 spans are added even though only one single database request happens in the end and this is the interesting part to investigate on the resulting trace.

I'd propose that we can somehow recognize the "dataloaded fields" (e.g by attaching a graphql.datafetcher.completeablefuture=true KevValue as an extra highCardinalityKeyValue for each observation; or something similar), then one would be able to filter them later. To make this complete, an Observation for all called dataloaders would be very helpful.

@spring-projects-issues spring-projects-issues added the status: waiting-for-triage An issue we've not yet triaged label Jul 22, 2024
@bclozel bclozel added the in: core Issues related to config and core support label Jul 22, 2024
@bclozel bclozel self-assigned this Aug 20, 2024
@bclozel bclozel added type: enhancement A general enhancement and removed status: waiting-for-triage An issue we've not yet triaged labels Aug 20, 2024
@bclozel bclozel added this to the 1.4.x milestone Aug 20, 2024
@bclozel
Copy link
Member

bclozel commented Aug 20, 2024

Thanks for reaching out. I had a look at our instrumentation and wrote a test case for this. We can consider several changes here and some of them are quite involved, so I have scheduled this for our next feature release.

As far as I can see, it is completely transparent to a DataFetcher caller whether it is using a DataLoader internally or not. Also, there are various strategies that can trigger the DataLoader dispatch - by default this is called at each depth level of the query.

We can extend our SelfDescribingDataFetcher to expose whether this is a batching operation. I believe we could automatically expose this for @BatchMapping handlers, but of course not for custom data fetcher registrations.
With that extra information, we could then make it available to our DataFetcherObservationContext and turn this information into a low cardinality key-value for the data fetcher observation. We would need to think about a proper name, but graphql.datafetcher.batching=true comes to mind.

We can also consider instrumenting DataLoader operations, but this is more challenging. Instrumenting DataLoader as they are being registered in the DataLoaderRegistry is complex, because we would need to involve the Observation API there and get the actual ObservationRegsitry instance at that point. Right now, instrumentation is neatly separated in its GraphQlObservationInstrumentation class.

We could, when the request execution starts, get the DataLoaderRegistry and for each registered data loader:

  1. get the DataLoader from the registry
  2. unregister it
  3. wrap it for instrumentation purposes
  4. re-register it under the original key

We would then need to create a new context/convention/key-values set for this new graphql.dataloader observation.
This would certainly make the observability better but we should also consider instrumentation overhead.

@rstoyanchev
Copy link
Contributor

rstoyanchev commented Sep 5, 2024

For the route with SelfDescribingDataFetcher and @BatchMapping methods, we can also support cases where DataLoader is injected into a controller method. That would cover all of @Controller usage, which would be a good place to be.

@rstoyanchev rstoyanchev changed the title Allow to distinguish between dataloader fetched fields in datafetcher observations and offer dataloader instrumentation Allow to distinguish DataLoader fetched fields in DataFetcher observations and provide DataLoader instrumentation Dec 4, 2024
@bclozel bclozel changed the title Allow to distinguish DataLoader fetched fields in DataFetcher observations and provide DataLoader instrumentation Add DataLoader observability support Dec 5, 2024
@bclozel bclozel modified the milestones: 1.4.x, 1.4.0-M1 Dec 5, 2024
@bclozel
Copy link
Member

bclozel commented Dec 5, 2024

We have discussed this as a team and here's our plan for this feature currently scheduled for 1.4.0-M1.

We can improve the SelfDescribingDataFetcher contract to let the instrumentation know that it's a @BatchMapping annotated method or that a @SchemaMapping method has a DataLoader argument. This means that the data fetching operation will involve a data loader and batch loading in general.

From there, we could just get that information and add a low cardinality KeyValue to the data fetching observation. This is useful, but we think that skipping the observations in this case and instrumenting the DataLoader operation provides more useful information to developers: when multiple data fetching operations are optimized into a single batch loading operation, this should be reflected in your metrics dashboard and traces.

To achieve this, the instrumentation should get the DataLoaderRegistry when the request execution starts and wrap all the registered data loaders to instrument their execution. As a result, we should create a new type of observation for batch loading.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
in: core Issues related to config and core support type: enhancement A general enhancement
Projects
None yet
Development

No branches or pull requests

4 participants