Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sentry (Collect Analytics) can not be disabled in Open Source Version #6430

Open
schnaker85 opened this issue Sep 24, 2024 · 8 comments
Open

Comments

@schnaker85
Copy link

schnaker85 commented Sep 24, 2024

Describe the bug
We tried to install label-studio on our GKE Cluster and it worked but we are not able to disable the collection of analytics on FE and BE to sentry.io.
We tried all the described options in our Env / Chart

Also tried the solution from #5787 and #4612 but it does not seem to work.
Any thoughts?

To Reproduce
We tried the following (all together, or 1 by 1) none of it does work
Set in the values.yaml for the helm chart

app:
  extraEnvironmentVars:
    LABEL_STUDIO_COLLECT_ANALYTICS: False
    COLLECT_ANALYTICS: False
    collect_analytics: False

Expected behavior
Sentry calls are not triggered.

Screenshots
In the backend log we get a lot of errors like:

{"asctime": "24/Sep/2024:12:49:08 +0000", "name": "urllib3.connectionpool", "funcName": "urlopen", "lineno": 826, "levelname": "WARNING", "user_id": null, "message": "Retrying (Retry(total=0, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProxyError('Cannot connect to proxy.', OSError('Tunnel connection failed: 403 Forbidden'))': /api/5820521/envelope/", "request_id": null}

as our proxy won't allow any calls to https://o227124.ingest.sentry.io/

Environment (please complete the following information):

  • Label Studio Version 1.13.1 on GKE

Additional context
Helm chart "1.6.3"

Thanks for the help.

@ashishmahamuni21
Copy link

facing the same issue ...
let me know @schnaker85 if you find something...

@KkemChen
Copy link

KkemChen commented Nov 8, 2024

i have same question

@jin-eld
Copy link

jin-eld commented Dec 1, 2024

My observations:

FRONTEND_SENTRY_DSN="" SENTRY_DSN="" COLLECT_ANALYTICS=False label-studio start seems to prevent the backend from calling home, but in the web developer console I still see some attempts:

image

What really made me furious, that creating an account locally leaked my e-mail and I immediately received a spam mail from labelstud.io, most likely I oversaw the checkbox about "receiving news", I guess I thought that it'd be the "tip of the day" on a local setup, but no, it actually subscribed me to some newsletter.

@jin-eld
Copy link

jin-eld commented Dec 1, 2024

Noticed another interesting thing when running locally, got caught by uBlock:

image

@antoine-buisson
Copy link

Same problem here, I set in the Helm Chart :

  • global.extraEnvironmentVars.COLLECT_ANALYTICS: false

and pods have the correct :

  • name: COLLECT_ANALYTICS
    value: "false"

    I am still seeing analytics to sentry and google going out of the pods.

@jin-eld
Copy link

jin-eld commented Dec 6, 2024

My current solution was to rip out all mentions of Sentry/Google from the web part and remove all sentry URLs in the Python code. Still need go over the Python part and double check that it's not "calling home" in the background. I am not a web guy, but the need to leave unsafe-eval in CSP_SCRIPT_SRC still leaves me suspicious of what may still be going on, the UI would not work without it though.

So this is what I am using now, still checking if I did not remove too much or too little:
https://github.com/jin-eld/label-studio-nospy

Don't get me wrong, Label Studio is great software for labeling and I appreciate it being open source, but the automatically enabled and data collection even when running it locally which makes it really not obvious to new users and the inability to truly disable it is a total "no go" for me.

Also, for anyone who is struggling to build/run it locally, at least on Fedora you seem to need to go via poetry to make it work, i.e.:

make frontend-setup
poetry run python label_studio/manage.py collectstatic

Then for a pip installable .whl run:

poetry build

To run it from the git checkout directory:

poetry run python label_studio/manage.py runserver

@antoine-buisson
Copy link

great job, I'm going to try your fork later in the day.

It's very sad to waste such a good product with so many analytics.

@GuitaristForEver
Copy link

My observations:

FRONTEND_SENTRY_DSN="" SENTRY_DSN="" COLLECT_ANALYTICS=False label-studio start seems to prevent the backend from calling home, but in the web developer console I still see some attempts:

image

What really made me furious, that creating an account locally leaked my e-mail and I immediately received a spam mail from labelstud.io, most likely I oversaw the checkbox about "receiving news", I guess I thought that it'd be the "tip of the day" on a local setup, but no, it actually subscribed me to some newsletter.

This worked for me 👍🏼 thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants