Decentralized Privacy-Preserving Proximity Tracing: Difference between revisions

Developed by École Polytechnique Fédérale de LausanneETH ZurichKU LeuvenDelft University of TechnologyUniversity College LondonHelmholtz Centre for Information SecurityUniversity of TorinoISI Foundation[1] April 4, 2020[2] Digital contact tracing Android & iOS smartphones ~10 m (33 ft)[3]

Decentralized Privacy-Preserving Proximity Tracing (DP-3T, stylized as dp3t) is an open protocol developed in response to the COVID-19 pandemic to facilitate digital contact tracing of infected participants.[4] The protocol, like competing protocol Pan-European Privacy-Preserving Proximity Tracing (PEPP-PT), uses Bluetooth Low Energy to track and log encounters with other users.[5][6] The protocols differ in their reporting mechanism, with PEPP-PT requiring clients to upload contact logs to a central reporting server, whereas with DP-3T, the central reporting server never has access to contact logs nor is it responsible for processing and informing clients of contact.[1] Because contact logs are never transmitted to third parties, it has major privacy benefits over the PEPP-PT approach,[7][8][9] however this comes at the cost of requiring more computing power on the client side to process infection reports.[10]

According to Google, the Google / Apple Exposure Notification project was "heavily inspired" by the DP-3T protocol.[11][12]

The DP-3T SDK and calibration apps intend to support the Apple/Google API as soon as it is released to iOS and Android devices.[13][14]

On the 21 April 2020, the Swiss Federal Office of Public Health announced that the Swiss national coronavirus contact tracing app will be based on DP-3T.[15] On the 22 April 2020, the Austrian Red Cross, leading on the national digital contact tracing app, announced its migration to the approach of DP-3T.[16] Estonia also confirmed that their app would be based on DP-3T.[17] On April 28, 2020, it was announced that Finland was piloting a version of DP-3T called "Ketju".[18] In Germany, a national app is being built upon DP-3T by SAP SE and Deutsche Telekom alongside CISPA, one of the organisations that authored the protocol.[19]

Overview

The DP-3T protocol works off the basis of Ephemeral IDs (EphID), semi-random rotating strings that uniquely identify clients.[20] When two clients encounter each other, they exchange EphIDs and store them locally in a contact log.[21] Then, once a user tests positive for infection, a report is sent to a central server. Each client on the network then collects the reports from the server and independently checks their local contact logs for an EphID contained in the report. If a matching EphID is found, then the user has come in close contact with an infected patient, and is warned by the client. Since each device locally verifies contact logs, and thus contact logs are never transmitted to third parties, the central reporting server cannot by itself ascertain the identity or contact log of any client in the network. This is in contrast to competing protocols like PEPP-PT, where the central reporting server receives and processes client contact logs.[22]

Ephemeral ID

A diagram demonstrating how the different components of the Ephemeral ID algorithm feed into each other

Similar to the TCN Protocol and its Temporary Contact Numbers, the DP-3T protocol makes use of 16 byte Ephemeral IDs (EphID) to uniquely identify devices in the proximity of a client. These EphIDs are logged locally on a receiving client's device and are never transmitted to third parties.[1]

To generate an EphID, first a client generates a secret key that rotates daily (${\displaystyle SK_{t}}$) by computing ${\displaystyle SK_{t}=H(SK_{t-1})}$, where ${\displaystyle H()}$ is a cryptographic hash function such as SHA-256. ${\displaystyle SK_{0}}$ is calculated by a standard secret key algorithm such as Ed25519. The client will use ${\displaystyle SK_{t}}$ during day ${\displaystyle t}$ to generate a list of EphIDs. At the beginning of the day, a client generates a local list of size ${\displaystyle n=(24*60)/l}$ new EphIDs to broadcast throughout the day, where ${\displaystyle l}$ is the lifetime of an EphID in minutes. To prevent malicious third parties from establishing patterns of movement by tracing static identifiers over a large area, EphIDs are rotated frequently. Given the secret day key ${\displaystyle SK_{t}}$, each device computes ${\displaystyle S\_EphID(BK)=PRG(PRF(SK_{t},BK))}$, where ${\displaystyle BK}$ is a global fixed string, ${\displaystyle PRF()}$ is a pseudo-random function like HMAC-SHA256, and ${\displaystyle PRG()}$ is a stream cipher producing ${\displaystyle n*16}$ bytes. This stream is then split into 16-byte chunks and randomly sorted to obtain the EphIDs of the day.[1]

Technical specification

The DP-3T protocol is made up of two separate responsibilities, tracking and logging close range encounters with other users (device handshake), and the reporting of those encounters such that other clients can determine if they have been in contact with an infected patient (infection reporting). Like most digital contact tracing protocols, the device handshake uses Bluetooth Low Energy to find and exchange details with local clients, and the infection reporting stage uses HTTPS to upload a report to a central reporting server. Additionally, like other decentralized reporting protocols, the central reporting server never has access to any client's contact logs; rather the report is structured such that clients can individually derive contact from the report.[1]

Device handshake

In order to find and communicate with clients in proximity of a device, the protocol makes use of both the server and client modes of Bluetooth LE, switching between the two frequently.[23] In server mode the device advertises its EphID to be read by clients, with clients scanning for servers.[24] When a client and server meet, the client reads the EphID and subsequently writes its own EphID to the server. The two devices then store the encounter in their respective contact logs in addition to a coarse timestamp and signal strength. The signal strength is later used as part of the infection reporting process to estimate the distance between an infected patient and the user.[1]

Infection reporting

When reporting infection, there exists a central reporting server controlled by the local health authority. Before a user can submit a report, the health authority must first confirm infection and generate a code authorizing the client to upload the report. The health authority additionally instructs the patient on which day their report should begin (denoted as ${\displaystyle t}$). The client then uploads the pair ${\displaystyle SK_{t}}$ and ${\displaystyle t}$ to the central reporting server, which other clients in the network download at a later date. By using the same algorithm used to generate the original EphIDs, clients can reproduce every EphID used for the period past and including ${\displaystyle t}$, which they then check against their local contact log to determine whether the user has been in close proximity to an infected patient.[1]

In the entire protocol, the health authority never has access to contact logs, and only serve to test patients and authorize report submissions.[1]:p. 11

Epidemiological analysis

When a user installs a DP-3T app, they are asked if they want to opt in to sharing data with epidemiologists. If the user consents, when they are confirmed to have been within close contact of an infected patient the respective contact log entry containing the encounter is scheduled to be sent to a central statistics server. In order to prevent malicious third parties from discovering potential infections by detecting these uploads, reports are sent at regular intervals, with indistinguishable dummy reports sent when there is no data to transmit.[1]

Health authority cooperation

To facilitate compatibility between DP-3T apps administered by separate health authorities, apps maintain a local list of the regions a user has visited. Regions are large areas directly corresponding to health authority jurisdiction; the exact location is not recorded. The app will later connect these regions to their respective foreign central reporting server, and fetch reports from these servers in addition to its normal home reporting server. Apps will also submit reports to these foreign reporting servers if the user tests positive for infection.[1]

Attacks on DP-3T and criticism

Cryptography and security scholar Serge Vaudenay, analyzing the security of DP-3T[25] argued that:

some privacy protection measurements by DP3T may have the opposite affect of what they were intended to. Specifically, sick and reported people may be deanonymized, private encounters may be revealed, and people may be coerced to reveal the private data they collect.

— Serge Vaudenay, [25]:p. 1

Vaudenay's work presents several attacks against DP-3T and similar systems. In response, the DP-3T group claim that out of twelve risks Vaudenay presents, eight are also present in centralized systems, three do not work, and one, which involves physical access to the phone, works but can be mitigated.[26] In a subsequent work[27] Vaudenay reviews attacks against both centralized and decentralized tracing systems and referring to identification attacks of diagnosed people concludes that:

By comparing centralized and decentralized architectures, we observe that attacks against decentralized systems are undetectable, can be done at a wide scale, and that the proposed countermeasures are, at best, able to mitigate attacks in a limited number of scenarios. Contrarily, centralized systems offer many countermeasures, by accounting and auditing.

— Serge Vaudenay, [27]:p. 6

In the same work[27] Vaudenay advocates that, since neither the centralized nor the decentralized approaches offer sufficient level of privacy protection, different solutions should be explored, in particular suggesting the ConTra Corona[28], Epione[29] and Pronto-C2[30] systems as a "third way".

Tang[31] surveys the major digital contact tracing systems and shows that DP-3T is subject to what he calls "targeted identification attacks".

Theoretical attacks on DP-3T have been simulated[32] showing that persistent tracking of users of the first version of the DP-3T system who have voluntarily uploaded their identifiers can be made easy to any 3rd party who can install a large fleet of Bluetooth Low Energy devices. This attack leverages the linkabilty of a user during a day, and therefore is possible on within a day on all users of some centralized systems such as the system proposed in the United Kingdom,[33] but does not function on 'unlinkable' versions of DP-3T where infected users' identifiers are not transmitted using a compact representation such as a key or seed.[34]

References

1. "DP-3T whitepaper" (PDF). GitHub. Retrieved 2020-04-22.
2. ^ "Initial commit". GitHub. 2020-04-04. Retrieved 2020-04-22.
3. ^ Sponås, Jon Gunnar. "Things You Should Know About Bluetooth Range". blog.nordicsemi.com. Retrieved 2020-04-12.
4. ^ "Rift Opens Over European Coronavirus Contact Tracing Apps". The New York Times. Reuters. 2020-04-20. ISSN 0362-4331. Retrieved 2020-04-21.
5. ^ Jason Bay, Joel Kek, Alvin Tan, Chai Sheng Hau, Lai Yongquan, Janice Tan, Tang Anh Quy. "BlueTrace: A privacy-preserving protocol for community-driven contact tracing across borders" (PDF). Government Technology Agency. Retrieved 2020-04-12.CS1 maint: multiple names: authors list (link)
6. ^ "Is Apple and Google's Covid-19 Contact Tracing a Privacy Risk?". Wired. ISSN 1059-1028. Retrieved 2020-04-18.
7. ^ "Controversy around privacy splits Europe's push to build COVID-19 contact-tracing apps". Fortune. Retrieved 2020-04-21.
8. ^ "European Contact Tracing Consortium Faces Wave of Defections". CoinDesk. 2020-04-20. Retrieved 2020-04-21.
9. ^ "Rift opens over European coronavirus contact tracing apps". Reuters. 2020-04-20. Retrieved 2020-04-21.
10. ^ "DP-3T 3 page brief" (PDF). GitHub. Retrieved 2020-04-22.
11. ^ "Apple and Google update joint coronavirus tracing tech to improve user privacy and developer flexibility". TechCrunch. Retrieved 2020-04-26.
12. ^ Farr, Christina (2020-04-28). "How a handful of Apple and Google employees came together to help health officials trace coronavirus". CNBC. Retrieved 2020-04-29.
13. ^ "DP3T-SDK for iOS". GitHub. Retrieved 2020-05-06.
14. ^ "DP3T-SDK for Android". GitHub. Retrieved 2020-05-06.
15. ^ swissinfo.ch, S. W. I.; Corporation, a branch of the Swiss Broadcasting. "Contact tracing app could be launched in Switzerland within weeks". SWI swissinfo.ch. Retrieved 2020-04-21.
16. ^ "Stopp Corona-App: Weiterentwicklung mit Hilfe der Zivilgesellschaft". OTS.at (in German). Retrieved 2020-04-22.
17. ^ "How do you trace Covid-19 while respecting privacy?". e-Estonia. 2020-04-24. Retrieved 2020-04-26.
18. ^ "Vaasa Central Hospital pilots the Ketju application for helping in the identification of coronavirus exposures". Sitra. Retrieved 2020-04-29.
19. ^ "Corona-Tracking: Helmholtz-Zentrum erwartet Start der Corona-App in den nächsten Wochen". www.handelsblatt.com (in German). Retrieved 2020-04-29.
20. ^ "France's Inria and Germany's Fraunhofer detail their ROBERT contact-tracing protocol". TechCrunch. Retrieved 2020-04-22.
21. ^ "Protecting Lives & Liberty: How Contact Tracing Can Foil COVID-19 & Big Brother". ncase.me. Retrieved 2020-04-19.
22. ^ Liauw, 🇸🇬 Frank (2020-04-09). "TraceTogether: under the hood". Medium. Retrieved 2020-04-18.
23. ^ "DP-3T/dp3t-sdk-android/dp3t-sdk/sdk/src/main/java/org/dpppt/android/sdk/internal/TracingService.java". GitHub. Retrieved 2020-04-24.
24. ^ "What is a client and server in BLE?". Nordic DevZone. Retrieved 2020-04-24.
25. ^ a b "Analysis of DP3T Between Scylla and Charybdis" (PDF). IACR ePrint archive. Retrieved 2020-05-07.
26. ^ The DP-3T Project (23 April 2020). "Response to 'Analysis of DP3T: Between Scylla and Charybidis'".
27. ^ a b c "Centralized or Decentralized? The Contact Tracing Dilemma" (PDF). IACR ePrint archive. Retrieved 2020-05-07.
28. ^
29. ^ "Lightweight Contact Tracing with Strong Privacy". arXiv. Retrieved 2020-05-07.
30. ^ "Towards Defeating Mass Surveillance and SARS-CoV-2: The Pronto-C2 Fully Decentralized Automatic Contact Tracing System". IACR ePrint archive. Retrieved 2020-05-07.
31. ^ "Privacy-Preserving Contact Tracing: current solutions and open questions". arXiv. Retrieved 2020-05-07.
32. ^ "BLE contact tracing sniffer PoC". github. Retrieved 2020-05-07.
33. ^ "NHS COVID App: Application and system architecture" (PDF). github. Retrieved 2020-05-08.
34. ^ "Privacy and Security Attacks on Digital Proximity Tracing Systems" (PDF). github. Retrieved 2020-05-08.