Identifying the anonymous in today’s digital world
A few years ago, I was having a discussion with an acquaintance who was involved in an investigation. One individual they were tracking kept changing his mobile phone every few days. Each new mobile was typically pay as you go or stolen and personal information connected to the mobile was either false or not available. Yet the investigators were able to very quickly determine the new number of the individual each time they switched mobile numbers. How they did this at the time impressed me, and I use the logic to this day.
Throughout the course of the investigation they were able to determine who this individual contacted. A few of the mobiles that the individual contacted did not routinely change their mobile number. As a result, by watching the calling patterns of the mobile phones where the numbers did not change, the investigators could quickly determine a new number that suddenly was calling each of the static numbers in a similar pattern. This of course requires access to mobile network data, but it worked. Even though this individual thought they were not being tracked, their efforts to remain anonymous unknown to them were ineffective. As a side note, there is software that will search for and detect these types of calling patterns automatically. The same logic here can easily be applied to a Internet connection.
A more common example is when you are ever pulled over by a police officer and you don’t have your license. Aside from them giving you a ticket for not having your license on your person, they will most likely ask you for your full name and birth date. The reason for the birth date is to help assure them that when they go back to the cruiser to search on their laptop, the records they obtain are actually yours and not someone else with the same name. How many Michael Dundas’ are there in Canada? Not sure, but the number of Michael Dundas’ with the exact same birth date really lowers the probability of a false positive. This same logic can be applied to social networking and there is interesting research in this area including twitter.
The EFF recently published a post on information theory and privacy. In it they discuss the concept of Entropy and how it applies to information and privacy. It touches a bit on some of the math behind it, but if you are interested it is a good explanation of why when you think you are anonymous you may not be, even when you take precautions. If you skip the math, their example of how a ‘user-agent’ header transmitted by your browser can narrow you down to one of 1500 people can start to give people that are new to information and anonymity a good perspective.

