NHS.uk allowing Google, Facebook, and others to track you
The NHS is allowing Google, Facebook, and others to track your http://www.nhs.uk/ browsing habits, regardless of the fact that people use the page to seek medical advice. It was recently pointed out to me that the NHS Choices website’s social features include the Facebook Like button (see e.g. the page on Testicular Cancer). Due to the fact that the standard method of Facebook Like button deployment is intrusive to say the least, I thought I would look into identifying which third party companies have been given permission to track users on NHS Choices, and my results are rather disconcerting.
In short there are four third-party, advertising/tracking companies which are informed every time a user visits one of the “conditions pages” on the NHS Choices website. These listed below, all get to make a call from the user’s browser, in turn allowing the four companies to access their cookies, tracking the users (explained in a previous blog post of mine, and in Bala’s research). This means, that if one has ever logged into a Google account, or a Facebook account and then visits one of the pages on the NHS site, the company will then know that their user X was just looking at a page about condition Y on the NHS website.
These are the four third party companies that make requests every time a “conditions page” on http://www.nhs.uk/ is viewed by a user:
jambi:~ mt $ grep "Host" tcpdump.ext.20101121.log | sort -u<br />
Host: l.addthiscdn.com<br />
Host: statse.webtrendslive.com<br />
Host: www.facebook.com<br />
Host: www.google-analytics.com
Two of the four third-party sites (facebook.com and addthiscdn.com) are contacted in order to provider the “social functionality” shown in the following screenshot. This intrusive OPT-OUT method of adding social features to the NHS website, in my opinion is NOT acceptable. I would only deem this to be acceptable if NHS has written declarations from the two aforementioned services stating that they WOULDN’T be tracking peoples’ browsing habits on http://www.nhs.uk/.
And the other two sites contacted (webtrendslive.com and google-analytics.com) seemed to be used for analytics purposes. In my view, this task should NOT be outsourced to a third party. If this was a website about pub reviews these third-party services would be acceptable, but due to the nature of the information on the Choices website, I feel the NHS should be hosting their own analytics code. Ok, I understand that the NHS needs to gather statistics about their website usage, but their user’s privacy should be of utmost importance, there do exist a high number of open sourced analytics software which the NHS should run themselves.
In order to show that I am not making this up, I have captured all of the HTTP requests made by my browser when loading the HIV and AIDS information page on NHS Choices.
http://www.nhs.uk/conditions/HIV/Pages/Introduction.aspx
The below two files are logs of all HTTP requests made when loading the HIV page:
https://mmt.me.uk/blog/misc/nhscookies/tcpdump.full.20101121.log
And this cut down log file shows all of the third-party HTTP requests made by one’s browser when loading the aforementioned page:
https://mmt.me.uk/blog/misc/nhscookies/tcpdump.ext.20101121.log
The above logs where captured using the following bash command:
tcpdump -A -s 1024 -i en0 dst port 80
An example:
My colleague Steve captured output from the HTTP trace via the NHS website, it can be found on http://pastebin.com/4TfDRRZJ
The browser (Safari) had it’s history cleared, logged into facebook, the facebook window closed, then sent to the NHS page.
Bits of confidential data replaced with XXXs
GET /plugins/like.php?href=http%3A%2F%2Fwww.nhs.uk%2fConditions%2fHIV%2fPages%2fIntroduction.aspx&layout=button_count&show_faces=true&width=450&action=like&colorscheme=light&height=21 HTTP/1.1<br />
Host: www.facebook.com<br />
User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10_6_4; en-gb) AppleWebKit/533.18.1 (KHTML, like Gecko) Version/5.0.2 Safari/533.18.5<br />
Accept: application/xml,application/xhtml+xml,text/html;q=0.9,text/plain;q=0.8,image/png,*/*;q=0.5<br />
Referer: http://www.nhs.uk/conditions/HIV/Pages/Introduction.aspx<br />
Accept-Language: en-gb<br />
Accept-Encoding: gzip, deflate<br />
Cookie: presence=DJ290173073BchADhA_22112.channelH1L60XXXXXXXXXXXXXXXXX4104WMblcMsndPBXXXXXXXXXXXsbPBtA_5b_5dBfAnullBuctMsA0QBblADacP0VXXXXXXXXXXXX0K290173073QQQ; x-referer=http%3A%2F%2Fwww.facebook.com%2Fhome.php%23%2Fhome.php; xs=976XXXXXXXXXXXXXXXXXXX0e11a0e600; sid=2; sct=12XXXXXX70; made_write_conn=12XXXXXX70; lu=ghXXXXXXXXXXXXXXXXXXYgqQ; datr=12XXXXXXXX-XXXXXXXXXXXf24
Possible Data Protection Violation
_It was pointed out to me that I was reference the incorrect ICO filing. The data controller for the NHS Choices website is the Department of Health and not NHS Direct. </p>
Find below, my amended version of this and the following section of this blog post. – Mischa 2010-11-24 11:00:00.</em>
In order to see the NHS’s Data Protection Policy, we had a look at their ICO filing, which led me to the following page:
http://www.ico.gov.uk/ESDWebPages/DoSearch.asp?reg=4693360
http://www.ico.gov.uk/ESDWebPages/DoSearch.asp?reg=4906007
I should start this section by saying that I am not a lawyer. But it seems like sections 6 and 4 purpose 2 are is relevant to my question of “how come the NHS website has third-party tracking enabled, especially given that the tracking is provided by for profit advertising companies?”. Firstly, it should be noted that by contacting facebook and google, data is being sent outside of the European Economic Area. Which is in violation of their Data Protection commitment of “Transfers: None outside the European Economic Area”.
And as per the ICO filing the potential recipients are: Business associates and other professional advisers, Central Government, Data subjects themselves, Employees and agents of the data controller, Healthcare, social and welfare advisers or practitioners, Local Government, Ombudsmen and regulatory authorities, Other companies in the same group as the data controller, Persons making an enquiry or complaint, Police forces, Relatives, guardians or other persons associated the data subject, Survey and research organisations.</p>
I will like to point out that no where in the ICO filing can one see that the NHS will be sharing data with advertising companies.</del>
It should be noted that the ICO filing does not make it explicit that the Department of Health would not:
- Sell advertising to patients
- Sell/Provide user data for third party advertising
Next Steps: FOI
I am about to post off a Freedom Of Information request (tomorrow morning), asking the NHS to please supply the minutes of all policy and technical meetings involved in the decision to deploy iframes referencing non-NHS sites and to use third-party analytics software on NHS choices pages.
Next Steps: Official Complaint
Further to the FOI request I am going to submit an official complaint via the official NHS Choices feedback form :http://www.nhs.uk/aboutNHSChoices/Pages/ContactUs.aspx.
I have the latest copy of the FOI Request updated FOI Request and the Letter of Complaint up in .pdf format on my website.
Note that:
There is a way to deploy the Facebook Like button which would resemble an OPT-IN based user interaction, instead of the intrusive standard iframe based approach. This involves the use of an “onClick” function call in Javascript which would tell Facebook only when explicitly “liked”. Obviously this method of interaction does not display the “social information” such as like counts, and whether or not you would be the first of your friends to “like” a given page. The German social networking site jetzt.de moved from the iframe to the self-hosted version after vigorous backlash from the userbase about being tracked (see for instance http://jetzt.sueddeutsche.de/texte/anzeigen/385237, line 350). This example was given to me by Sören Preibusch from the University of Cambridge.
And Finally…
I would like to thank Steve Harris and Dan Brickley for helping me decide how to take this forward. And I would like to thank Richard Northover for the link.
Amendment 2010-11-24 16:57 Conflict in terms of NHS privacy policy
It has been pointed out to me that in relation to the NHS stating that how only on pages with the Like button…. communications with Facebook.com occurs, this is also not true :
See the following page :
http://www.nhs.uk/livewell/depression/pages/depressionhome.aspx
I see no “Like” button on this page. But according to firebug, There is still an HTTP request made to facebook.com from my browser.
The following quote from the NHS Choices privacy policy states :
“While we only share your information with the Data Processors, when you visit pages on our site that display a Facebook Like button, Facebook will collect information about your visit. For more information, read the relevant section of the Facebook privacy policy.”
Which is NOT true.
The following screenshot illustrates this. People are free to replicate, all you need is Firefox and the Firebug plugin (both free and open source).
NHS privacy policy
I thought I would cut and paste the NHS’s privacy policy, in case it changes dated 2010-11-25 15:58:00 GMT
“As a government department, we do not share data with other organisations unless the law permits us to do so. We do not sell individual information. We will share it only with our authorised Data Processors, who must act at all times on our instructions as the Data Controller under the Data Protection Act 1998. Before you submit any information, we will notify you as to why we are asking for specific information and it is up to you whether you provide it.</p>
While we only share your information with the Data Processors, when you visit pages on our site that display a Facebook Like button, Facebook will collect information about your visit. For more information, read the relevant section of the Facebook privacy policy.”</em>
Furthermore
It should be noted that it has been pointed out to me that AddThis’ privacy policy reveals they monetise their product through behavourial targeting. Would you be somewhat surprised if you started to received adverts about their ailments on third-party websites.?
Disabling Referer Headers in Firefox
Given the awesome work detailed by Bala from AT&T, and some recent privacy related measures I have been taking in my Firefox browser (see https-everywhere and adblocking fb), I have decided to instruct my browser to stop sending the Referrer Header (nb: incorrectly referred to as the ‘referer header’), when I am clicking around on the web.
The following example shows the Referrer header of the HTTP request telling facebook.com, that I have just been looking at a page about HIV on the NHS choices website.
GET /<br />
Host: www.facebook.com<br />
User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10_6_4; en-gb) AppleWebKit/533.18.1 (KHTML, like Gecko) Version/5.0.2 Safari/533.18.5<br />
Accept: application/xml,application/xhtml+xml,text/html;q=0.9,text/plain;q=0.8,image/png,*/*;q=0.5
Referer: http://www.nhs.uk/conditions/HIV/Pages/Introduction.aspx
Accept-Language: en-gb<br />
Accept-Encoding: gzip, deflate<br />
Cookie: presence=DJ290173073BchADhA_22112.channelH1L60X...
I followed instructions on the following blog post http://cafe.elharo.com/privacy/privacy-tip-3-block-referer-headers-in-firefox/ to configure my Firefox instance to not send the “referer header”.
In short, the steps needed are as follows:
- Type about:config into your firefox awesome bar, to bring up your settings
- find the setting
network.http.sendRefererHeader
. This is probably set to 2. - Choose one of the following values:
- 0: Completely disables the referer header (mischa’s setting)
- 1: Sends a referer header when following a link to another page, but not when loading images on the page
- 2: Always sends the referer header (default)
I am going to experiment with setting it to 0, disabling the referer header all the time, I will post back here to say if it causes me any problems.
</p>
HTTPS: Making more use of SSL
There has been a lot of talk about how more and more people are using their laptops on public wifi connections, and with the advent of the Firesheep plugin, there has been a number of scares around session hijacking, and unencrypted login details being sent through the ether.
As a result, I thought I would describe the steps I have taken in securing my Firefox instance on my laptop. These are :
- Installing the HTTPS Everywhere plugin from the eff, which attempts to select https if available when accessing a site. I have tested it with Facebook, Google, Hotmail, LinkedIn and a few other sites
- I have set my homepage to be encrypted.google.com
- I have changed the search engine in top right hand of my Firefox instance to use the encrypted google service, by installing their plugin
- I have set a master password on my Firefox keychain, which gives my stored passwords some level of protection
- And I run Adblocking software, (with a custom Facebook Like Button blocking extension) as per an earlier blog post
Furthmore, I use Firefox as my main browser, I have chrome installed, but I hardly ever use it, and I have a locked down, stateless Safari instance which I wrote about earlier.