You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I stumbled upon the user_agent function from pip/_internal/download and the combination of values returned seems to be pretty detailed. In my case it looks like this:
pip/10.0.0 {"cpu":"x86_64","distro":{"id":"Xenial Xerus","libc":{"lib":"glibc","version":"2.23"},"name":"Ubuntu","version":"16.04"},"implementation":{"name":"CPython","version":"3.6.5"},"installer":{"name":"pip","version":"10.0.0"},"openssl_version":"OpenSSL 1.0.2g 1 Mar 2016","python":"3.6.5","setuptools_version":"39.0.1","system":{"name":"Linux","release":"4.13.0-38-generic"}}
I would expect such combination to be unique (or at least have low cardinality), which could then be used as an implicit identifier of users. Even if this is the case, I assume the information linked to this is not sensitive, but it's hard to say without having access to the logs. If this could be used as an identifier (even implicit), then it could be that at least in Europe, the data collection falls under the jurisdiction of General Data Protection Regulation. I understand this data is very valuable but to stay on the safe side, it would be nice to understand the following:
Can this fingerprints be used to identify users? (at least some of them)
Does this fall under GDPR?
What would be the implications regarding this data collection (opt-out, access to the data for users, etc.)?
Thank you for your time.
The text was updated successfully, but these errors were encountered:
I'm going to go ahead and close this, for the reasons @pfmoore mentioned, and also because AIUI GDPR is targeting service operators, not software. For a similar issue, but on PyPI (with the assumption you're using PyPI), see pypi/warehouse#3532.
Description:
I stumbled upon the
user_agent
function frompip/_internal/download
and the combination of values returned seems to be pretty detailed. In my case it looks like this:You can obtain yours using:
$ python -c 'from pip.download import user_agent; print(user_agent())'
It contains the following information:
I would expect such combination to be unique (or at least have low cardinality), which could then be used as an implicit identifier of users. Even if this is the case, I assume the information linked to this is not sensitive, but it's hard to say without having access to the logs. If this could be used as an identifier (even implicit), then it could be that at least in Europe, the data collection falls under the jurisdiction of General Data Protection Regulation. I understand this data is very valuable but to stay on the safe side, it would be nice to understand the following:
Thank you for your time.
The text was updated successfully, but these errors were encountered: