Data exhaust

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Data exhaust or exhaust data refers to the trail of data left by the activities of an Internet users during their online activity, behavior and transactions. This is part of a broader category of unconventional data[1] that includes geospatial, network, and time-series data and maybe useful for predictive analytics. Every visited website, clicked link, and even hovering with a mouse is collected, leaving behind a trail of data.[2] An enormous amount of often raw data are created, which can be in the form of cookies, temporary files, logfiles, storable choices, and more.[3] This information can help to improve the online experience, for example through customized content. It can be used to improve tracking trends and studying data exhaust also improves the user interface and the layout design. On the other hand, they can also compromise privacy, as they offer a valuable insight into the user's habits. For example, as the world's most popular website, Google uses this data exhaust to refine the predictive value of their products.[4]

The data that is collected by companies is often information that does not seem immediately useful. Although the information is not used by the company right away, it can be stored for future use or sold to someone else who can use the information. The data can help with quality control, performance and revenue.[5]

Unlike primary content, these data are not purposefully created by the user, who is often unaware of their very existence. A bank for example would consider as primary data information concerning the sums and parties of a transaction, whilst secondary data might include the percentage of transactions carried out at a cash machine instead of a real bank.[6]

Medical Exhaust Data[edit]

Most medical devices emit some form of exhaust data, such as many pacemakers, dialysis machines, and cameras used during surgery.[7] The majority of this data is never captured, and is primarily abandoned after the surgery is completed, or the device makes its next routine check. Some issues have arisen regarding the use of the data captured by devices like pacemakers. This can lead to larger issues surrounding the use of this exhaust data.[8] Using electronic medical records (EMR) for research poses a large number of challenges, the most prevalent being the amount of data there is. This surplus of data is too much for people to sort through and analyze, thus creating a need for algorithms.[9]

Solutions[edit]

Although data exhaust is not a new concept, it plays a much larger role in the contemporary world. The increase of technology has caused for and increase in data exhaust. The collection and distribution of this data is not illegal, but there are steps that must be taken to ensure that the use of this data is ethical. In order to keep the privacy of users safe, when the information is sold it can be kept anonymous. Also, users can be given the opportunity to opt-out of the selling of their information if they choose. Lastly, to avoid any negative connotations, websites can update their privacy policies so that they include all the data in which they will be collecting about the user.[10]

References[edit]

  1. ^ "What is Unconventional Data? - Definition from EU Glossary". Retrieved 2019-04-28.
  2. ^ Kosciejew, M. (2013). The individual and big data. Feliciter, 59(6), 47
  3. ^ "What is Data Exhaust? - Definition from Techopedia". Techopedia.com. Retrieved 2018-11-01.
  4. ^ Zuboff, Shoshana. "SAGE Journals: Your gateway to world-class journal research". doi:10.1057/jit.2015.5.
  5. ^ "What is Data Exhaust and What Can You Do With It?". www.datasciencecentral.com. Retrieved 2018-11-01.
  6. ^ "5 things you need to know about data exhaust".
  7. ^ Rob, Kitchin (2014-08-26). The data revolution : big data, open data, data infrastructures & their consequences. Los Angeles, California. ISBN 1446287483. OCLC 871211376.
  8. ^ "Our Medical Data Must Become Free". WIRED. Retrieved 2017-10-12.
  9. ^ "Healthcare data everywhere: the waste problem - AI Med". AI Med. 2018-05-09. Retrieved 2018-11-01.
  10. ^ "Dealing with data exhaust. - Free Online Library". www.thefreelibrary.com. Retrieved 2018-11-01.