-
Notifications
You must be signed in to change notification settings - Fork 29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for structured stream checkpointing on azure filesystem #587
Comments
Created branch feature/LOGISLAND-587-Support-for-structured-stream-checkpointing-on-azure-filesystem to implement this feature. |
Usage with currently commited code in the branch: In StructuredStream configuration, put something like (replace <blabla> with relevant strings): spark.base.checkpoint.path: wasbs://<myContainer>@<myStorageAccount>.blob.core.windows.net/spark-checkpointing Then in KafkaStreamProcessingEngine global configuration, set for instance the matching account key using a custom spark configuration key like: spark.custom.config.fs.azure.account.key.<myStorageAccount>.blob.core.windows.net: +H5IuOtsebY7fO6QyyntmlRLe3G8Rv0jcye6kzE2Wz4NrU3IdB4Q8ocJY2ScY9cQrJNXxUg2WbYJPndMuQWUCQ== Generic custom spark configuration keys are introduced in this branch and allow to call spark.custom.config.xxx.yyy.zzz: someValue See this page for examples on the custom Currently identified problem is that although needed azure clients libs The current workaround I found is to put in every |
merged in release/1.4.0, but requires the workaround |
Also used to store the current kafka offset while writing to the structured stream sink
The text was updated successfully, but these errors were encountered: