The pulpproject.org website uses Google Analytics.
In general, this is and will become even more problematic because Google’s data collection practices mean the transfer of user data from EU to US. There is also increasingly strict state-specific regulation of data collection in the USA.
Yesterday, I updated the Google Analytics preferences to anonymize IP addresses before sending data.
However, on chatting to @duck and @misc I have been wondering if we should just move from Google Analytics to a FOSS alternative.
As far as I am aware, we have never used Google Analytics for anything more than getting a general idea of our page views. From time to time, I look at the page view counter as a sign of what information Pulp users might be most interested in, so that I can do a better job at providing relevant information.
There are alternatives of varying complexity that can provide a range of metrics. However, I am strongly opposed to gathering information to the extent that users would have to opt-in. You’ve probably all seen this tweet about the joys of trying to access a website in 2022.
I would prefer that we build a community where people feel safe to self-report any shortfalls we have. I’ve tried to fix any issues with the website that have been reported during my time helping out with Pulp, whether the issues relate to content or layout.
Discourse itself provides some nice anonymous statistics. We don’t require anyone to use any identifying information. People can use Tor or throwaway email accounts and still ask and answer questions here. The only requirement is to follow our general community code of conduct.
I know that there is also a separate special interest group focusing on telemetry in Pulp itself that is progressing in the same spirit in its nascent state.
Google Analytics and alternatives can provide statistics on devices, for example, desktop or mobile access to the site.
I consider the website a platform for information and updates. I don’t plan to move past a static site with text and images. Gathering device information is therefore not really relevant. What do you think?
I would appreciate feedback on whether you’d prefer Google Analytics over FOSS alternatives?
I would also appreciate it if you would +1 the collection of just page views.
If you think device type information or anything else would be interesting or relevant, please tell me why so I can understand.
Please let me know if there is anything I have not considered as part of this.