Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update roadmap.md #6475

Open
wants to merge 3 commits into
base: master
Choose a base branch
from
Open

Update roadmap.md #6475

wants to merge 3 commits into from

Conversation

CharlieTLe
Copy link
Member

What this PR does:
Updates roadmap

Which issue(s) this PR fixes:
Fixes #5251

Checklist

  • Tests updated
  • Documentation added
  • CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

Fixes #5251

Signed-off-by: Charlie Le <[email protected]>

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot reviewed 1 out of 1 changed files in this pull request and generated no comments.

Comments suppressed due to low confidence (2)

docs/roadmap.md:14

  • The word 'utilised' should be spelled 'utilized' to match American English conventions.
We have a [helm chart](https://github.com/cortexproject/cortex-helm-chart) but it needs work before it can be effectively utilised by different backends.

docs/roadmap.md:18

  • The sentence 'adds' is incomplete. It should be 'adds the following features:' or something similar.
[Prometheus Remote Write 2.0](https://prometheus.io/docs/specs/remote_write_spec_2_0/) adds
Comment on lines -32 to -34
## Downsampling
Downsampling means storing fewer samples, e.g. one per minute instead of one every 15 seconds.
This makes queries over long periods more efficient. It can reduce storage space slightly if the full-detail data is discarded.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We have #4322 but it doesn't need to be part of the roadmap. I don't see this with priority

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think downsampling support is a nice feature for our long term roadmap

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I can add this back in the longer-term roadmap so that attention can still be brought towards it in the future.

docs/roadmap.md Outdated
Comment on lines 36 to 38
## Per-metric retention

Cortex blocks storage supports deleting all data for a tenant after a time period (e.g. 3 months, 1 year), but we would also like to have custom retention for subsets of metrics (e.g. delete server metrics but retain business metrics).
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This can be removed. You can use a different tenant for different metrics with different retention today

## Exemplar support
[Exemplars](https://docs.google.com/document/d/1ymZlc9yuTj8GvZyKz1r3KDRrhaOjZ1W1qZVW_5Gj7gA/edit)
let you link metric samples to other data, such as distributed tracing.
As of early 2021 Prometheus will collect exemplars and send them via remote write, but Cortex needs to be extended to handle them.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This can be removed, exemplar support is already there

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think there is a effort on prometheus to create a storage for exemplars, but i agree, still early to put on the roadmap.


## Scalability

Scalability has always been a focus for the project, but there is a lot more work to be done. We can now scale to 100s of Millions of active series but 1 Billion active series is still an unknown.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

1 Billion active series in single tenant is doable today.
There are of course other scalability points we could still implement. I don't think we should put them in the roadmap, they are not that critical


## Billing and Usage analytics

We have all the metrics to track how many series, samples and queries each tenant is sending but don't have dashboards that help with this. We plan to have dashboards and UIs that will help operators monitor and control each tenants usage out of the box.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I created cortexproject/cortex-jsonnet#64 because this is nice. But it doesn't need to be part of the roadmap

@dosubot dosubot bot added the lgtm This PR has been approved by a maintainer label Jan 3, 2025
@alanprot
Copy link
Member

alanprot commented Jan 3, 2025

We can discuss some ideas on the next community call i guess. Some that i can think of:

  • Better way to discover metrics with lots of labels (cardinality analyses)
  • Ingestion Pipeline?
    • Aggregation? Ingestion recording rules? (create recording rules that will perform the transform on the ingestion and not ingest the raw data?)
    • Batching?
  • Improve replication protocol between distributors and ingesters?

@CharlieTLe
Copy link
Member Author

We can discuss some ideas on the next community call i guess. Some that i can think of:

  • Better way to discover metrics with lots of labels (cardinality analyses)

  • Ingestion Pipeline?

    • Aggregation? Ingestion recording rules? (create recording rules that will perform the transform on the ingestion and not ingest the raw data?)
    • Batching?
  • Improve replication protocol between distributors and ingesters?

Added these items to the agenda for our next community call on Jan 30, 2025.

https://docs.google.com/document/d/1shtXSAqp3t7fiC-9uZcKkq3mgwsItAJlH6YW6x1joZo/edit?usp=sharing

@alolita
Copy link

alolita commented Jan 3, 2025

Hi Folks, Happy New Year! Sorry I missed this week's meeting. Will catch up on notes and on slack.

docs/roadmap.md Outdated

## Auth Gateway
## Support for Prometheus Remote Write 2.0
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we mention it is a short term roadmap, IIUC?
This sounds like something we can support soon as there is already PR for it now

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should have roadmap items for longer terms

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I can add a new section for longer term road map items (2+ months) and add a date to the top for when the document was last updated to provide an anchor for the reader on when the expected date of completion should be.

Comment on lines -32 to -34
## Downsampling
Downsampling means storing fewer samples, e.g. one per minute instead of one every 15 seconds.
This makes queries over long periods more efficient. It can reduce storage space slightly if the full-detail data is discarded.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think downsampling support is a nice feature for our long term roadmap

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component/documentation lgtm This PR has been approved by a maintainer size/M
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Update project roadmap
5 participants