EMRBots: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
AnomieBOT (talk | contribs)
m Dating maintenance tags: {{Cleanup-link rot}}
Filled in 10 bare reference(s) with reFill 2
Line 9: Line 9:


==Academic use==
==Academic use==
In April 2018 [[Bioinformatics (journal)]] published a study that relied on EMRBots data to create a new R package denoted as "comoRbidity".<ref>{{cite journal |last1=Gutiérrez-Sacristán |first1=Alba |last2=Bravo |first2=Àlex |last3=Giannoula |first3=Alexia |last4=Mayer |first4=Miguel A |last5=Sanz |first5=Ferran |last6=Furlong |first6=Laura I |last7=Kelso |first7=Janet |title=comoRbidity: an R package for the systematic analysis of disease comorbidities |journal=Bioinformatics |date=15 September 2018 |volume=34 |issue=18 |pages=3228–3230 |doi=10.1093/bioinformatics/bty315 |pmid=29897411 |pmc=6137966 }}</ref> Co-authors on the study included scientists from [[Universitat Pompeu Fabra]] and [[Harvard University]]. The repositories have been used to accelerate research, e.g., researchers from [[Michigan State University]], [[IBM Research]], and [[Cornell University]] published a study in the Knowledge Discovery and Data Mining (KDD) conference.<ref>{{cite web |url= http://www.kdd.org/kdd2017/papers/view/patient-subtyping-via-time-aware-lstm-networks |title= Patient Subtyping via Time-Aware LSTM Networks |website= Kdd.org |accessdate= 24 May 2018}}</ref><ref>{{cite web |url= http://www.kdd.org |title=SIGKDD |website= Kdd.org |accessdate= 24 May 2018}}</ref><ref>http://biometrics.cse.msu.edu/Presentations/InciBaytas_PatientSubtypingViaTimeAwareLSTMNetworks_KDD_2017.pdf</ref><ref>http://biometrics.cse.msu.edu/Publications/Thesis/InciBaytas_ContributionsToMatchineLearningInBiomedicalInformation.pdf</ref> Their study describes a novel neural network that performs better than the widely used [[long short-term memory]] neural network developed by [[Sepp Hochreiter]] and [[Jürgen Schmidhuber]] in 1997.<ref>{{cite journal |title=Long short-term memory |journal=Neural Comput.|volume=9 |issue=8 |pages=1735–1780 |year=1997 |last1=Hochreiter | first1 = Sepp | last2=Schmidhuber| first2 = Jürgen |doi=10.1162/neco.1997.9.8.1735|pmid=9377276}}</ref> In May 2018 scientists from [[IBM Research]] and [[Cornell University]] have used the repositories to test a new deep architecture denoted as Health-ATM. To demonstrate superiority over traditional neural networks, they applied their architecture to a congestive heart failure use case.<ref>{{Cite book | doi=10.1137/1.9781611975321.30|chapter = Health-ATM: A Deep Architecture for Multifaceted Patient Health Record Representation and Risk Prediction|title = Proceedings of the 2018 SIAM International Conference on Data Mining| pages=261–269|year = 2018|last1 = Ma|first1 = Tengfei| last2=Xiao| first2=Cao| last3=Wang| first3=Fei| isbn=978-1-61197-532-1}}</ref> Additional use includes [[The University of Chicago]] creating a highly-detailed tutorial demonstrating how to use R using the repositories,<ref>{{cite web|url=http://cri.uchicago.edu/wp-content/uploads/2018/02/CRI_StatisticalModeling_Methods.pdf|title=Statistical Modeling of Clinical Data|website=Cri.uchicago.edu|accessdate=24 May 2018}}</ref> [[University of California Merced]],<ref>{{Cite book|title=A dynamic cloud computing platform for eHealth systems - IEEE Conference Publication|pages=435–438|doi=10.1109/HealthCom.2015.7454539|chapter=A dynamic cloud computing platform for eHealth systems|year=2015|last1=Bahrami|first1=Mehdi|last2=Singhal|first2=Mukesh|isbn=978-1-4673-8325-7}}</ref><ref>{{cite web|url=http://cloudlab.ucmerced.edu/mehdi-bahrami-publication|title=Publication - UC Merced Cloud Lab|website=Cloudlab.ucmerced.edu}}</ref> and The [[University of Tampere]], Finland.<ref>{{cite web|url=https://people.uta.fi/~kostas.stefanidis/docs/recsys17/lecture08_fairgrouprecs.pdf|title=Fairness in Group Recommendations in the Health Domain|website=People.uta.fi|accessdate=24 May 2018}}</ref><ref>{{cite web|url=https://devpost.com/software/mlarapp|title=MLARAPP|website=Devpost.com|accessdate=24 May 2018}}</ref> Additional resources include.<ref>{{cite web|url=https://github.com/illidanlab/T-LSTM/blob/master/main.py|title=illidanlab/T-LSTM|website=GitHub|accessdate=24 May 2018}}</ref><ref>{{Cite book|doi=10.1007/978-3-319-98812-2_11|title = Database and Expert Systems Applications|volume = 11030|pages = 147–155|series = Lecture Notes in Computer Science|year = 2018|last1 = Stratigi|first1 = Maria|last2 = Kondylakis|first2 = Haridimos|last3 = Stefanidis|first3 = Kostas|isbn = 978-3-319-98811-5 |hdl=10024/104308 }}</ref><ref>{{cite biorxiv |title= Teaching data science fundamentals through realistic synthetic clinical cardiovascular data |biorxiv=232611}}</ref><ref>{{Cite book |title= PRIIME: A generic framework for interactive personalized interesting pattern discovery - IEEE Conference Publication |pages= 606–615 |doi= 10.1109/BigData.2016.7840653 |chapter= PRIIME: A generic framework for interactive personalized interesting pattern discovery |year= 2016 |last1= Bhuiyan |first1= Mansurul A. |last2= Hasan |first2= Mohammad Al |isbn= 978-1-4673-9005-7 |arxiv= 1607.05749 }}</ref><ref>{{cite web |url= http://dmgroup.cs.iupui.edu/files/student_thesis/MansurulBhuiyan_thesis.pdf |title= GENERIC FRAMEWORKS FOR INTERACTIVE PERSONALIZED INTERESTING PATTERN DISCOVERY |website= Dmgroup.cs.iupui.edu |accessdate= 24 May 2018}}</ref><ref>{{cite web |url= https://www.linkedin.com/pulse/exploratory-statistical-analysis-emr-data-where-angels-rajeev-gangal/|title=Exploratory Statistical Analysis of EMR data Or Where Angels Fear to tread…|date=17 October 2015|website=Linkedin.com}}</ref><ref>{{cite web|url=http://acictworld.blogspot.com/2015/12/robot.html|title=Robot|date=31 December 2015|website=Acictworld.blogspot.com|accessdate=24 May 2018}}</ref><ref>{{cite web|url=http://repository.sustech.edu/bitstream/handle/123456789/15777/Obstacle%20Avoider%20Robotic%20Vehicle.pdf?sequence=1|format=PDF|title=Obstacle Avoider Robotic Vehicle |website=Repository.sustech.edu|accessdate=24 May 2018}}</ref><ref>{{cite journal|title=Predictive delimiter for multiple sensitive attribute publishing|first1=M.|last1=Nithya|first2=T.|last2=Sheela|date=4 January 2018|journal=Cluster Computing|volume=22|pages=12297–12304|doi=10.1007/s10586-017-1612-y}}</ref><ref>{{Cite book | chapter-url=https://ieeexplore.ieee.org/document/7544820/ | doi=10.1109/IACC.2016.31| chapter=Semantic Interoperability and Data Mapping in EHR Systems| title=2016 IEEE 6th International Conference on Advanced Computing (IACC)| pages=117–122| year=2016| last1=Janaswamy| first1=Sreya| last2=Kent| first2=Robert D.| isbn=978-1-4673-8286-1| url=https://scholar.uwindsor.ca/etd/5645}}</ref><ref>{{cite web | url=https://conferences.oreilly.com/strata/strata-ny/public/schedule/detail/68054 | title=Improving patient screening by applying predictive analytics to electronic medical records.: Big data conference & machine learning training &#124; Strata Data}}</ref><ref>https://www.linkedin.com/pulse/part-deux-exploratory-analysis-emr-data-rajeev-gangal/</ref><ref>https://repository.eafit.edu.co/bitstream/handle/10784/13027/ElkinAndrés_VillaSámchez_2018.pdf</ref><ref>http://insticc.org/node/TechnicalProgram/ict4awe/presentationDetails/77986</ref><ref>https://xuc.me/file/paper/ICDE19a.pdf</ref><ref>{{cite journal| pmc=6416981 | pmid=30871520 | doi=10.1186/s12911-019-0793-0 | volume=19 | issue=1 | title=The validity of synthetic clinical data: a validation study of a leading synthetic data generator (Synthea) using clinical quality measures | year=2019 | journal=BMC Med Inform Decis Mak | page=44 | last1 = Chen | first1 = J | last2 = Chun | first2 = D | last3 = Patel | first3 = M | last4 = Chiang | first4 = E | last5 = James | first5 = J}}</ref><ref>{{cite journal| pmc=6416981 | pmid=30871520 | doi=10.1186/s12911-019-0793-0 | volume=19 | issue=1 | title=The validity of synthetic clinical data: a validation study of a leading synthetic data generator (Synthea) using clinical quality measures | year=2019 | journal=BMC Med Inform Decis Mak | page=44 | last1 = Chen | first1 = J | last2 = Chun | first2 = D | last3 = Patel | first3 = M | last4 = Chiang | first4 = E | last5 = James | first5 = J}}</ref><ref>https://www.scribd.com/document/411608802/RobertoCarlosCavalcantieCavalcanteDissertacao2018-EMRBots-org</ref><ref>https://www.ijitee.org/wp-content/uploads/papers/v8i11/J99270881019.pdf</ref><ref>http://sutir.sut.ac.th:8080/sutir/bitstream/123456789/7846/2/Fulltext.pdf</ref><ref>https://sigmodrecord.org/publications/sigmodRecord/1909/pdfs/full-issue.pdf</ref><ref>http://uclab.khu.ac.kr/resources/publication/C_404.pdf</ref><ref>http://api.sunlab.org/static/media/1fF/ai2/5b6aef0d241ba60001bec1bf.pdf</ref><ref>https://www.sciencedirect.com/science/article/pii/S1532046419302564</ref>
In April 2018 [[Bioinformatics (journal)]] published a study that relied on EMRBots data to create a new R package denoted as "comoRbidity".<ref>{{cite journal |last1=Gutiérrez-Sacristán |first1=Alba |last2=Bravo |first2=Àlex |last3=Giannoula |first3=Alexia |last4=Mayer |first4=Miguel A |last5=Sanz |first5=Ferran |last6=Furlong |first6=Laura I |last7=Kelso |first7=Janet |title=comoRbidity: an R package for the systematic analysis of disease comorbidities |journal=Bioinformatics |date=15 September 2018 |volume=34 |issue=18 |pages=3228–3230 |doi=10.1093/bioinformatics/bty315 |pmid=29897411 |pmc=6137966 }}</ref> Co-authors on the study included scientists from [[Universitat Pompeu Fabra]] and [[Harvard University]]. The repositories have been used to accelerate research, e.g., researchers from [[Michigan State University]], [[IBM Research]], and [[Cornell University]] published a study in the Knowledge Discovery and Data Mining (KDD) conference.<ref>{{cite web |url= http://www.kdd.org/kdd2017/papers/view/patient-subtyping-via-time-aware-lstm-networks |title= Patient Subtyping via Time-Aware LSTM Networks |website= Kdd.org |accessdate= 24 May 2018}}</ref><ref>{{cite web |url= http://www.kdd.org |title=SIGKDD |website= Kdd.org |accessdate= 24 May 2018}}</ref><ref>http://biometrics.cse.msu.edu/Presentations/InciBaytas_PatientSubtypingViaTimeAwareLSTMNetworks_KDD_2017.pdf</ref><ref>http://biometrics.cse.msu.edu/Publications/Thesis/InciBaytas_ContributionsToMatchineLearningInBiomedicalInformation.pdf</ref> Their study describes a novel neural network that performs better than the widely used [[long short-term memory]] neural network developed by [[Sepp Hochreiter]] and [[Jürgen Schmidhuber]] in 1997.<ref>{{cite journal |title=Long short-term memory |journal=Neural Comput.|volume=9 |issue=8 |pages=1735–1780 |year=1997 |last1=Hochreiter | first1 = Sepp | last2=Schmidhuber| first2 = Jürgen |doi=10.1162/neco.1997.9.8.1735|pmid=9377276}}</ref> In May 2018 scientists from [[IBM Research]] and [[Cornell University]] have used the repositories to test a new deep architecture denoted as Health-ATM. To demonstrate superiority over traditional neural networks, they applied their architecture to a congestive heart failure use case.<ref>{{Cite book | doi=10.1137/1.9781611975321.30|chapter = Health-ATM: A Deep Architecture for Multifaceted Patient Health Record Representation and Risk Prediction|title = Proceedings of the 2018 SIAM International Conference on Data Mining| pages=261–269|year = 2018|last1 = Ma|first1 = Tengfei| last2=Xiao| first2=Cao| last3=Wang| first3=Fei| isbn=978-1-61197-532-1}}</ref> Additional use includes [[The University of Chicago]] creating a highly-detailed tutorial demonstrating how to use R using the repositories,<ref>{{cite web|url=http://cri.uchicago.edu/wp-content/uploads/2018/02/CRI_StatisticalModeling_Methods.pdf|title=Statistical Modeling of Clinical Data|website=Cri.uchicago.edu|accessdate=24 May 2018}}</ref> [[University of California Merced]],<ref>{{Cite book|title=A dynamic cloud computing platform for eHealth systems - IEEE Conference Publication|pages=435–438|doi=10.1109/HealthCom.2015.7454539|chapter=A dynamic cloud computing platform for eHealth systems|year=2015|last1=Bahrami|first1=Mehdi|last2=Singhal|first2=Mukesh|isbn=978-1-4673-8325-7}}</ref><ref>{{cite web|url=http://cloudlab.ucmerced.edu/mehdi-bahrami-publication|title=Publication - UC Merced Cloud Lab|website=Cloudlab.ucmerced.edu}}</ref> and The [[University of Tampere]], Finland.<ref>{{cite web|url=https://people.uta.fi/~kostas.stefanidis/docs/recsys17/lecture08_fairgrouprecs.pdf|title=Fairness in Group Recommendations in the Health Domain|website=People.uta.fi|accessdate=24 May 2018}}</ref><ref>{{cite web|url=https://devpost.com/software/mlarapp|title=MLARAPP|website=Devpost.com|accessdate=24 May 2018}}</ref> Additional resources include.<ref>{{cite web|url=https://github.com/illidanlab/T-LSTM/blob/master/main.py|title=illidanlab/T-LSTM|website=GitHub|accessdate=24 May 2018}}</ref><ref>{{Cite book|doi=10.1007/978-3-319-98812-2_11|title = Database and Expert Systems Applications|volume = 11030|pages = 147–155|series = Lecture Notes in Computer Science|year = 2018|last1 = Stratigi|first1 = Maria|last2 = Kondylakis|first2 = Haridimos|last3 = Stefanidis|first3 = Kostas|isbn = 978-3-319-98811-5 |hdl=10024/104308 }}</ref><ref>{{cite biorxiv |title= Teaching data science fundamentals through realistic synthetic clinical cardiovascular data |biorxiv=232611}}</ref><ref>{{Cite book |title= PRIIME: A generic framework for interactive personalized interesting pattern discovery - IEEE Conference Publication |pages= 606–615 |doi= 10.1109/BigData.2016.7840653 |chapter= PRIIME: A generic framework for interactive personalized interesting pattern discovery |year= 2016 |last1= Bhuiyan |first1= Mansurul A. |last2= Hasan |first2= Mohammad Al |isbn= 978-1-4673-9005-7 |arxiv= 1607.05749 }}</ref><ref>{{cite web |url= http://dmgroup.cs.iupui.edu/files/student_thesis/MansurulBhuiyan_thesis.pdf |title= GENERIC FRAMEWORKS FOR INTERACTIVE PERSONALIZED INTERESTING PATTERN DISCOVERY |website= Dmgroup.cs.iupui.edu |accessdate= 24 May 2018}}</ref><ref>{{cite web |url= https://www.linkedin.com/pulse/exploratory-statistical-analysis-emr-data-where-angels-rajeev-gangal/|title=Exploratory Statistical Analysis of EMR data Or Where Angels Fear to tread…|date=17 October 2015|website=Linkedin.com}}</ref><ref>{{cite web|url=http://acictworld.blogspot.com/2015/12/robot.html|title=Robot|date=31 December 2015|website=Acictworld.blogspot.com|accessdate=24 May 2018}}</ref><ref>{{cite web|url=http://repository.sustech.edu/bitstream/handle/123456789/15777/Obstacle%20Avoider%20Robotic%20Vehicle.pdf?sequence=1|format=PDF|title=Obstacle Avoider Robotic Vehicle |website=Repository.sustech.edu|accessdate=24 May 2018}}</ref><ref>{{cite journal|title=Predictive delimiter for multiple sensitive attribute publishing|first1=M.|last1=Nithya|first2=T.|last2=Sheela|date=4 January 2018|journal=Cluster Computing|volume=22|pages=12297–12304|doi=10.1007/s10586-017-1612-y}}</ref><ref>{{Cite book | chapter-url=https://ieeexplore.ieee.org/document/7544820/ | doi=10.1109/IACC.2016.31| chapter=Semantic Interoperability and Data Mapping in EHR Systems| title=2016 IEEE 6th International Conference on Advanced Computing (IACC)| pages=117–122| year=2016| last1=Janaswamy| first1=Sreya| last2=Kent| first2=Robert D.| isbn=978-1-4673-8286-1| url=https://scholar.uwindsor.ca/etd/5645}}</ref><ref>{{cite web | url=https://conferences.oreilly.com/strata/strata-ny/public/schedule/detail/68054 | title=Improving patient screening by applying predictive analytics to electronic medical records.: Big data conference & machine learning training &#124; Strata Data}}</ref><ref>{{Cite web|url=https://www.linkedin.com/pulse/part-deux-exploratory-analysis-emr-data-rajeev-gangal|title=Part Deux : Exploratory Analysis of EMR data|website=www.linkedin.com}}</ref><ref>https://repository.eafit.edu.co/bitstream/handle/10784/13027/ElkinAndrés_VillaSámchez_2018.pdf</ref><ref>{{Cite web|url=http://insticc.org/node/TechnicalProgram/ict4awe/presentationDetails/77986|title=Technical Program|website=insticc.org}}</ref><ref>https://xuc.me/file/paper/ICDE19a.pdf</ref><ref name="auto1">{{cite journal| pmc=6416981 | pmid=30871520 | doi=10.1186/s12911-019-0793-0 | volume=19 | issue=1 | title=The validity of synthetic clinical data: a validation study of a leading synthetic data generator (Synthea) using clinical quality measures | year=2019 | journal=BMC Med Inform Decis Mak | page=44 | last1 = Chen | first1 = J | last2 = Chun | first2 = D | last3 = Patel | first3 = M | last4 = Chiang | first4 = E | last5 = James | first5 = J}}</ref><ref name="auto1"/><ref>{{Cite web|url=https://pt.scribd.com/document/411608802/RobertoCarlosCavalcantieCavalcanteDissertacao2018-EMRBots-org|title=RobertoCarlosCavalcantieCavalcanteDissertacao2018 / EMRBots.org &#124; Prontuário Eletrônico &#124; Aprendizado de Máquina|website=Scribd}}</ref><ref>https://www.ijitee.org/wp-content/uploads/papers/v8i11/J99270881019.pdf</ref><ref>http://sutir.sut.ac.th:8080/sutir/bitstream/123456789/7846/2/Fulltext.pdf</ref><ref>https://sigmodrecord.org/publications/sigmodRecord/1909/pdfs/full-issue.pdf</ref><ref>http://uclab.khu.ac.kr/resources/publication/C_404.pdf</ref><ref>http://api.sunlab.org/static/media/1fF/ai2/5b6aef0d241ba60001bec1bf.pdf</ref><ref>{{Cite journal|url=http://www.sciencedirect.com/science/article/pii/S1532046419302564|title=Deep learning for electronic health records: A comparative review of multiple deep neural architectures|first1=Jose Roberto|last1=Ayala Solares|first2=Francesca Elisa|last2=Diletta Raimondi|first3=Yajie|last3=Zhu|first4=Fatemeh|last4=Rahimian|first5=Dexter|last5=Canoy|first6=Jenny|last6=Tran|first7=Ana Catarina|last7=Pinho Gomes|first8=Amir H.|last8=Payberah|first9=Mariagrazia|last9=Zottoli|first10=Milad|last10=Nazarzadeh|first11=Nathalie|last11=Conrad|first12=Kazem|last12=Rahimi|first13=Gholamreza|last13=Salimi-Khorshidi|date=January 1, 2020|journal=Journal of Biomedical Informatics|volume=101|pages=103337|via=ScienceDirect|doi=10.1016/j.jbi.2019.103337}}</ref>


In March 2019 the repositories were used to enhance "Computationally-Enabled Medicine", a course given by Harvard Medical School.<ref>https://github.com/kartoun/IBM-Harvard-Workshop/</ref> Further in March, scientists from multiple institutions, including [[Peking University]], [[University of Tokyo]], and [[Polytechnic University of Milan]] used the repositories to develop a new framework focused on medical information privacy<ref>https://h-suwa.github.io/percom2019/papers/p282-li.pdf</ref>
In March 2019 the repositories were used to enhance "Computationally-Enabled Medicine", a course given by Harvard Medical School.<ref>{{Cite web|url=https://github.com/kartoun/IBM-Harvard-Workshop|title=kartoun/IBM-Harvard-Workshop|date=August 18, 2019|via=GitHub}}</ref> Further in March, scientists from multiple institutions, including [[Peking University]], [[University of Tokyo]], and [[Polytechnic University of Milan]] used the repositories to develop a new framework focused on medical information privacy<ref>https://h-suwa.github.io/percom2019/papers/p282-li.pdf</ref>


==Use in hackathons==
==Use in hackathons==
Line 18: Line 18:
EMRBots were presented at [[HackPrinceton]] 2018 organized by [[Princeton University]].<ref>{{cite web | url=https://hackprinceton.com/hack/workshops/ | title=HackPrinceton Fall 2018 Workshops| date=2018-11-10}}</ref><ref>{{cite web | url=https://figshare.com/articles/ADVANCING_INFORMATICS_WITH_ELECTRONIC_MEDICAL_RECORDS_BOTS/7325903 | title=Advancing informatics with electronic medical records bots (HackPrinceton 2018)| date=2018-11-10|last1=Kartoun|first1= Uri}}</ref><ref>https://hackprinceton.com/hack/web-resources/</ref>
EMRBots were presented at [[HackPrinceton]] 2018 organized by [[Princeton University]].<ref>{{cite web | url=https://hackprinceton.com/hack/workshops/ | title=HackPrinceton Fall 2018 Workshops| date=2018-11-10}}</ref><ref>{{cite web | url=https://figshare.com/articles/ADVANCING_INFORMATICS_WITH_ELECTRONIC_MEDICAL_RECORDS_BOTS/7325903 | title=Advancing informatics with electronic medical records bots (HackPrinceton 2018)| date=2018-11-10|last1=Kartoun|first1= Uri}}</ref><ref>https://hackprinceton.com/hack/web-resources/</ref>


EMRBots were presented at TreeHacks 2019 organized by [[Stanford University]].<ref>https://live.treehacks.com/</ref>
EMRBots were presented at TreeHacks 2019 organized by [[Stanford University]].<ref>{{Cite web|url=https://live.treehacks.com/|title=TreeHacks 2020|website=live.treehacks.com}}</ref>


==Availability==
==Availability==
The repositories can be downloaded after registration.<ref>http://emrbots.org</ref>
The repositories can be downloaded after registration.<ref>{{Cite web|url=http://www.emrbots.org/|title=EMRBOTS.ORG|website=EMRBOTS.ORG}}</ref>


The repositories are available to download from [[Figshare]] without registration.<ref>{{cite web | url=https://figshare.com/articles/A_100-patient_database/7040039 | title=EMRBots: A 100-patient database| date=2018-09-03}}</ref><ref>{{cite web | url=https://figshare.com/articles/A_10_000-patient_database/7040060 | title=EMRBots: A 10,000-patient database| date=2018-09-03}}</ref><ref>{{cite web | url=https://figshare.com/articles/EMRBots_a_100_000-patient_database/7040198 | title=EMRBots: A 100,000-patient database| date=2018-09-03}}</ref>
The repositories are available to download from [[Figshare]] without registration.<ref>{{cite web | url=https://figshare.com/articles/A_100-patient_database/7040039 | title=EMRBots: A 100-patient database| date=2018-09-03}}</ref><ref>{{cite web | url=https://figshare.com/articles/A_10_000-patient_database/7040060 | title=EMRBots: A 10,000-patient database| date=2018-09-03}}</ref><ref>{{cite web | url=https://figshare.com/articles/EMRBots_a_100_000-patient_database/7040198 | title=EMRBots: A 100,000-patient database| date=2018-09-03}}</ref>
Line 27: Line 27:
Full source code for creating the repositories is available to download from [[Figshare]].<ref>{{cite web | url=https://figshare.com/articles/EMRBots_full_source_code/7040204 | title=EMRBots: Full source code| date=2018-09-03}}</ref>
Full source code for creating the repositories is available to download from [[Figshare]].<ref>{{cite web | url=https://figshare.com/articles/EMRBots_full_source_code/7040204 | title=EMRBots: Full source code| date=2018-09-03}}</ref>


All source code for EMRBots is available in [[Elsevier]]'s [[Software Impacts]] [[GitHub]] site.<ref>https://github.com/SoftwareImpacts/SIMPAC-2019-8</ref><ref>https://www.journals.elsevier.com/software-impacts/</ref>
All source code for EMRBots is available in [[Elsevier]]'s [[Software Impacts]] [[GitHub]] site.<ref>{{Cite web|url=https://github.com/SoftwareImpacts/SIMPAC-2019-8|title=SoftwareImpacts/SIMPAC-2019-8|date=November 20, 2019|via=GitHub}}</ref><ref>{{Cite web|url=https://www.journals.elsevier.com/software-impacts/|title=Software Impacts|via=www.journals.elsevier.com}}</ref>


==Northwell Health's EMRBot==
==Northwell Health's EMRBot==
In May 2018 [[Northwell Health]] funded a project denoted as EMRBot in the health system's third annual innovation challenge.<ref>{{cite web | url=https://www.healthcare-informatics.com/article/ehr/northwell-health-it-leaders-are-revamping-ehr-ai-nlp-and-voice-tools | title=At Northwell Health, IT Leaders are Revamping the EHR with AI, NLP and Voice Tools}}</ref><ref>{{cite web | url=https://www.prnewswire.com/news-releases/northwell-rewards-innovative-employee-projects-with-funding-300648947.html | title=Northwell rewards innovative employee projects with funding}}</ref><ref>{{cite web | url=https://theislandnow.com/manhasset-107/northwell-rewards-innovative-employee-projects-with-500k-funding/ | title=Northwell rewards innovative employee projects with $500K funding| date=2018-05-16}}</ref><ref>{{cite web | url=https://www.northwell.edu/about/news/press-releases/northwell-funds-innovative-employee-projects | title=Northwell funds innovative employee projects &#124; Northwell Health}}</ref>
In May 2018 [[Northwell Health]] funded a project denoted as EMRBot in the health system's third annual innovation challenge.<ref name="auto">{{cite web | url=https://www.healthcare-informatics.com/article/ehr/northwell-health-it-leaders-are-revamping-ehr-ai-nlp-and-voice-tools | title=At Northwell Health, IT Leaders are Revamping the EHR with AI, NLP and Voice Tools}}</ref><ref>{{cite web | url=https://www.prnewswire.com/news-releases/northwell-rewards-innovative-employee-projects-with-funding-300648947.html | title=Northwell rewards innovative employee projects with funding}}</ref><ref>{{cite web | url=https://theislandnow.com/manhasset-107/northwell-rewards-innovative-employee-projects-with-500k-funding/ | title=Northwell rewards innovative employee projects with $500K funding| date=2018-05-16}}</ref><ref>{{cite web | url=https://www.northwell.edu/about/news/press-releases/northwell-funds-innovative-employee-projects | title=Northwell funds innovative employee projects &#124; Northwell Health}}</ref>
<ref>{{cite web | url=https://www.beckershospitalreview.com/healthcare-information-technology/emr-chatbot-takes-2nd-place-in-northwell-health-s-2018-innovation-challenge.html | title=EMR chatbot takes 2nd place in Northwell Health's 2018 innovation challenge}}</ref><ref>{{cite web | url=https://www.healthcare-informatics.com/article/ehr/northwell-health-it-leaders-are-revamping-ehr-ai-nlp-and-voice-tools | title=At Northwell Health, IT Leaders are Revamping the EHR with AI, NLP and Voice Tools}}</ref><ref>{{cite web | url=https://libn.com/2018/05/16/advanced-test-talking-medical-records-win-northwell-funding/ | title=Advanced test, talking medical records win Northwell funding| date=2018-05-16}}</ref><ref>https://amp.fox5vegas.com/story/38196671/%7B%7BampLink%7D%7D</ref><ref>{{cite web | url=https://huntingtonnow.com/tag/emrbot/ | title=EMRBot Archives –}}</ref><ref>{{cite web | url=http://www.smartbrief.com/branded/94A57BF5-E8A1-4598-B740-5CB55226F136/06E53646-263A-4690-986A-E861ED3A9638 | title=Researchers look for ways to make EHRs easier to use}}</ref><ref>{{cite web | url=https://www.crainsnewyork.com/article/20180709/PULSE/180709943/nyc-doctors-in-small-practices-experience-less-burnout-study-finds | title=NYC doctors in small practices experience less burnout, study finds| date=2018-07-06}}</ref><ref>{{cite web | url=https://www.bioportfolio.com/news/article/3677635/At-Northwell-Health-IT-Leaders-are-Revamping-the-EHR-with-AI-NLP.html | title=At Northwell Health, IT Leaders are Revamping the EHR with AI, NLP and Voice Tools}}</ref><ref>{{cite web | url=http://www.diagnosticoweb.com.br/noticias/gestao/ceo-da-northwell-health-defende-aplicacao-do-modelo-shark-tank-para-estimular-inovacao-em-saude.html | title=CEO da Northwell Health defende aplicação do modelo Shark Tank para estimular inovação em saúde}}</ref><ref>https://www.northwell.edu/sites/northwell/files/New-Standard-volume-1-2018_0.pdf</ref> [[Northwell Health]]'s EMRBot, however, is neither related to Uri Kartoun's website (registered as a domain name in April 2015; www.emrbots.org) nor to any of its repositories or applications.
<ref>{{cite web | url=https://www.beckershospitalreview.com/healthcare-information-technology/emr-chatbot-takes-2nd-place-in-northwell-health-s-2018-innovation-challenge.html | title=EMR chatbot takes 2nd place in Northwell Health's 2018 innovation challenge}}</ref><ref name="auto"/><ref>{{cite web | url=https://libn.com/2018/05/16/advanced-test-talking-medical-records-win-northwell-funding/ | title=Advanced test, talking medical records win Northwell funding| date=2018-05-16}}</ref><ref>https://amp.fox5vegas.com/story/38196671/%7B%7BampLink%7D%7D</ref><ref>{{cite web | url=https://huntingtonnow.com/tag/emrbot/ | title=EMRBot Archives –}}</ref><ref>{{cite web | url=http://www.smartbrief.com/branded/94A57BF5-E8A1-4598-B740-5CB55226F136/06E53646-263A-4690-986A-E861ED3A9638 | title=Researchers look for ways to make EHRs easier to use}}</ref><ref>{{cite web | url=https://www.crainsnewyork.com/article/20180709/PULSE/180709943/nyc-doctors-in-small-practices-experience-less-burnout-study-finds | title=NYC doctors in small practices experience less burnout, study finds| date=2018-07-06}}</ref><ref>{{cite web | url=https://www.bioportfolio.com/news/article/3677635/At-Northwell-Health-IT-Leaders-are-Revamping-the-EHR-with-AI-NLP.html | title=At Northwell Health, IT Leaders are Revamping the EHR with AI, NLP and Voice Tools}}</ref><ref>{{cite web | url=http://www.diagnosticoweb.com.br/noticias/gestao/ceo-da-northwell-health-defende-aplicacao-do-modelo-shark-tank-para-estimular-inovacao-em-saude.html | title=CEO da Northwell Health defende aplicação do modelo Shark Tank para estimular inovação em saúde}}</ref><ref>https://www.northwell.edu/sites/northwell/files/New-Standard-volume-1-2018_0.pdf</ref> [[Northwell Health]]'s EMRBot, however, is neither related to Uri Kartoun's website (registered as a domain name in April 2015; www.emrbots.org) nor to any of its repositories or applications.


==Criticism==
==Criticism==
Line 39: Line 39:
[[CareCloud]]
[[CareCloud]]


[[MDClone]]<ref>https://www.reuters.com/article/tech-mdclone-fundraising/israeli-healthcare-data-engine-firm-mdclone-raises-26-mln-idUSL5N25I25X</ref>
[[MDClone]]<ref>{{Cite web|url=https://www.reuters.com/article/tech-mdclone-fundraising-idUSL5N25I25X|title=Israeli healthcare data engine firm MDClone raises $26 mln|date=August 22, 2019|via=www.reuters.com}}</ref>


[[SyntheticMass]]<ref>https://synthea.mitre.org/</ref>
[[SyntheticMass]]<ref>https://synthea.mitre.org/</ref>

Revision as of 17:50, 3 February 2020

Uri Kartoun presenting EMRBots at Stanford University, Feb. 2019.

EMRBots are experimental artificially generated electronic medical records (EMRs).[1][2] The aim of EMRBots is to allow non-commercial entities (such as universities) to use the artificial patient repositories to practice statistical and machine-learning algorithms. Commercial entities can also use the repositories for any purpose, as long as they do not create software products using the repositories.

A letter published in Communications of the ACM emphasizes the importance of using synthetic medical data, "... EMRBots can generate a synthetic patient population of any size, including demographics, admissions, comorbidities, and laboratory values. A synthetic patient has no confidentiality restrictions and thus can be used by anyone to practice machine learning algorithms."[3]

Background

EMRs contain sensitive personal information. For example, they may include details about infectious diseases, such as human immunodeficiency virus (HIV), or they may contain information about a mental disorder. They may also contain other sensitive information such as medical details related to fertility treatments. Because EMRs are subject to confidentiality requirements, accessing and analyzing EMR databases is a privilege given to only a small number of individuals. Individuals who work at institutions that do not have access to EMR systems have no opportunity to gain hands-on experience with this valuable resource. Simulated medical databases are currently available; however, they are difficult to configure and are limited in their resemblance to real clinical databases. Generating highly accessible repositories of artificial patient EMRs while relying only minimally on real patient data is expected to serve as a valuable resource to a broader audience of medical personnel, including those who reside in underdeveloped countries.

Academic use

In April 2018 Bioinformatics (journal) published a study that relied on EMRBots data to create a new R package denoted as "comoRbidity".[4] Co-authors on the study included scientists from Universitat Pompeu Fabra and Harvard University. The repositories have been used to accelerate research, e.g., researchers from Michigan State University, IBM Research, and Cornell University published a study in the Knowledge Discovery and Data Mining (KDD) conference.[5][6][7][8] Their study describes a novel neural network that performs better than the widely used long short-term memory neural network developed by Sepp Hochreiter and Jürgen Schmidhuber in 1997.[9] In May 2018 scientists from IBM Research and Cornell University have used the repositories to test a new deep architecture denoted as Health-ATM. To demonstrate superiority over traditional neural networks, they applied their architecture to a congestive heart failure use case.[10] Additional use includes The University of Chicago creating a highly-detailed tutorial demonstrating how to use R using the repositories,[11] University of California Merced,[12][13] and The University of Tampere, Finland.[14][15] Additional resources include.[16][17][18][19][20][21][22][23][24][25][26][27][28][29][30][31][31][32][33][34][35][36][37][38]

In March 2019 the repositories were used to enhance "Computationally-Enabled Medicine", a course given by Harvard Medical School.[39] Further in March, scientists from multiple institutions, including Peking University, University of Tokyo, and Polytechnic University of Milan used the repositories to develop a new framework focused on medical information privacy[40]

Use in hackathons

Researchers from Carnegie Mellon University used EMRBots data at the CMU HackAuton hackathon to create a prediction tool.[41] Additional uses are available.[42]

EMRBots were presented at HackPrinceton 2018 organized by Princeton University.[43][44][45]

EMRBots were presented at TreeHacks 2019 organized by Stanford University.[46]

Availability

The repositories can be downloaded after registration.[47]

The repositories are available to download from Figshare without registration.[48][49][50]

Full source code for creating the repositories is available to download from Figshare.[51]

All source code for EMRBots is available in Elsevier's Software Impacts GitHub site.[52][53]

Northwell Health's EMRBot

In May 2018 Northwell Health funded a project denoted as EMRBot in the health system's third annual innovation challenge.[54][55][56][57] [58][54][59][60][61][62][63][64][65][66] Northwell Health's EMRBot, however, is neither related to Uri Kartoun's website (registered as a domain name in April 2015; www.emrbots.org) nor to any of its repositories or applications.

Criticism

"[EMRBots] are ... pregenerated datasets of synthetic EHR with an insufficient explanation of how the datasets were generated. These datasets exhibit several inconsistencies between health problems, age, and gender."[67][68] An additional criticism is described in a thesis ("Realism in Synthetic Data Generation") granted by Massey University.[69]

Other Synthetic Medical Data Resources

CareCloud

MDClone[70]

SyntheticMass[71]

References

  1. ^ Kartoun, Uri (September 2019). "Advancing informatics with electronic medical records bots (EMRBots)". Software Impacts. 2: 100006. doi:10.1016/j.simpa.2019.100006.
  2. ^ Kartoun, Uri (2016). "A methodology to generate virtual patient repositories". arXiv:1608.00570 [cs.CY].
  3. ^ CACM Staff (1 January 2018). "A leap from artificial to intelligence". Communications of the ACM. 61 (1): 10–11. doi:10.1145/3168260.
  4. ^ Gutiérrez-Sacristán, Alba; Bravo, Àlex; Giannoula, Alexia; Mayer, Miguel A; Sanz, Ferran; Furlong, Laura I; Kelso, Janet (15 September 2018). "comoRbidity: an R package for the systematic analysis of disease comorbidities". Bioinformatics. 34 (18): 3228–3230. doi:10.1093/bioinformatics/bty315. PMC 6137966. PMID 29897411.
  5. ^ "Patient Subtyping via Time-Aware LSTM Networks". Kdd.org. Retrieved 24 May 2018.
  6. ^ "SIGKDD". Kdd.org. Retrieved 24 May 2018.
  7. ^ http://biometrics.cse.msu.edu/Presentations/InciBaytas_PatientSubtypingViaTimeAwareLSTMNetworks_KDD_2017.pdf
  8. ^ http://biometrics.cse.msu.edu/Publications/Thesis/InciBaytas_ContributionsToMatchineLearningInBiomedicalInformation.pdf
  9. ^ Hochreiter, Sepp; Schmidhuber, Jürgen (1997). "Long short-term memory". Neural Comput. 9 (8): 1735–1780. doi:10.1162/neco.1997.9.8.1735. PMID 9377276.
  10. ^ Ma, Tengfei; Xiao, Cao; Wang, Fei (2018). "Health-ATM: A Deep Architecture for Multifaceted Patient Health Record Representation and Risk Prediction". Proceedings of the 2018 SIAM International Conference on Data Mining. pp. 261–269. doi:10.1137/1.9781611975321.30. ISBN 978-1-61197-532-1.
  11. ^ "Statistical Modeling of Clinical Data" (PDF). Cri.uchicago.edu. Retrieved 24 May 2018.
  12. ^ Bahrami, Mehdi; Singhal, Mukesh (2015). "A dynamic cloud computing platform for eHealth systems". A dynamic cloud computing platform for eHealth systems - IEEE Conference Publication. pp. 435–438. doi:10.1109/HealthCom.2015.7454539. ISBN 978-1-4673-8325-7.
  13. ^ "Publication - UC Merced Cloud Lab". Cloudlab.ucmerced.edu.
  14. ^ "Fairness in Group Recommendations in the Health Domain" (PDF). People.uta.fi. Retrieved 24 May 2018.
  15. ^ "MLARAPP". Devpost.com. Retrieved 24 May 2018.
  16. ^ "illidanlab/T-LSTM". GitHub. Retrieved 24 May 2018.
  17. ^ Stratigi, Maria; Kondylakis, Haridimos; Stefanidis, Kostas (2018). Database and Expert Systems Applications. Lecture Notes in Computer Science. Vol. 11030. pp. 147–155. doi:10.1007/978-3-319-98812-2_11. hdl:10024/104308. ISBN 978-3-319-98811-5.
  18. ^ "Teaching data science fundamentals through realistic synthetic clinical cardiovascular data". bioRxiv 232611. {{cite bioRxiv}}: Check |biorxiv= value (help)
  19. ^ Bhuiyan, Mansurul A.; Hasan, Mohammad Al (2016). "PRIIME: A generic framework for interactive personalized interesting pattern discovery". PRIIME: A generic framework for interactive personalized interesting pattern discovery - IEEE Conference Publication. pp. 606–615. arXiv:1607.05749. doi:10.1109/BigData.2016.7840653. ISBN 978-1-4673-9005-7.
  20. ^ "GENERIC FRAMEWORKS FOR INTERACTIVE PERSONALIZED INTERESTING PATTERN DISCOVERY" (PDF). Dmgroup.cs.iupui.edu. Retrieved 24 May 2018.
  21. ^ "Exploratory Statistical Analysis of EMR data Or Where Angels Fear to tread…". Linkedin.com. 17 October 2015.
  22. ^ "Robot". Acictworld.blogspot.com. 31 December 2015. Retrieved 24 May 2018.
  23. ^ "Obstacle Avoider Robotic Vehicle" (PDF). Repository.sustech.edu. Retrieved 24 May 2018.
  24. ^ Nithya, M.; Sheela, T. (4 January 2018). "Predictive delimiter for multiple sensitive attribute publishing". Cluster Computing. 22: 12297–12304. doi:10.1007/s10586-017-1612-y.
  25. ^ Janaswamy, Sreya; Kent, Robert D. (2016). "Semantic Interoperability and Data Mapping in EHR Systems". 2016 IEEE 6th International Conference on Advanced Computing (IACC). pp. 117–122. doi:10.1109/IACC.2016.31. ISBN 978-1-4673-8286-1.
  26. ^ "Improving patient screening by applying predictive analytics to electronic medical records.: Big data conference & machine learning training | Strata Data".
  27. ^ "Part Deux : Exploratory Analysis of EMR data". www.linkedin.com.
  28. ^ https://repository.eafit.edu.co/bitstream/handle/10784/13027/ElkinAndrés_VillaSámchez_2018.pdf
  29. ^ "Technical Program". insticc.org.
  30. ^ https://xuc.me/file/paper/ICDE19a.pdf
  31. ^ a b Chen, J; Chun, D; Patel, M; Chiang, E; James, J (2019). "The validity of synthetic clinical data: a validation study of a leading synthetic data generator (Synthea) using clinical quality measures". BMC Med Inform Decis Mak. 19 (1): 44. doi:10.1186/s12911-019-0793-0. PMC 6416981. PMID 30871520.{{cite journal}}: CS1 maint: unflagged free DOI (link)
  32. ^ "RobertoCarlosCavalcantieCavalcanteDissertacao2018 / EMRBots.org | Prontuário Eletrônico | Aprendizado de Máquina". Scribd.
  33. ^ https://www.ijitee.org/wp-content/uploads/papers/v8i11/J99270881019.pdf
  34. ^ http://sutir.sut.ac.th:8080/sutir/bitstream/123456789/7846/2/Fulltext.pdf
  35. ^ https://sigmodrecord.org/publications/sigmodRecord/1909/pdfs/full-issue.pdf
  36. ^ http://uclab.khu.ac.kr/resources/publication/C_404.pdf
  37. ^ http://api.sunlab.org/static/media/1fF/ai2/5b6aef0d241ba60001bec1bf.pdf
  38. ^ Ayala Solares, Jose Roberto; Diletta Raimondi, Francesca Elisa; Zhu, Yajie; Rahimian, Fatemeh; Canoy, Dexter; Tran, Jenny; Pinho Gomes, Ana Catarina; Payberah, Amir H.; Zottoli, Mariagrazia; Nazarzadeh, Milad; Conrad, Nathalie; Rahimi, Kazem; Salimi-Khorshidi, Gholamreza (January 1, 2020). "Deep learning for electronic health records: A comparative review of multiple deep neural architectures". Journal of Biomedical Informatics. 101: 103337. doi:10.1016/j.jbi.2019.103337 – via ScienceDirect.
  39. ^ "kartoun/IBM-Harvard-Workshop". August 18, 2019 – via GitHub.
  40. ^ https://h-suwa.github.io/percom2019/papers/p282-li.pdf
  41. ^ Gebert, Theresa; Jiang, Shuli; Sheng, Jiaxian (2018). "Characterizing Allegheny County opioid overdoses with an interactive data explorer and synthetic prediction tool". arXiv:1804.08830 [stat.AP].
  42. ^ "GitHub - gyaneshanand/Rajasthan_Hackathon_5.0". 2018-07-26.
  43. ^ "HackPrinceton Fall 2018 Workshops". 2018-11-10.
  44. ^ Kartoun, Uri (2018-11-10). "Advancing informatics with electronic medical records bots (HackPrinceton 2018)".
  45. ^ https://hackprinceton.com/hack/web-resources/
  46. ^ "TreeHacks 2020". live.treehacks.com.
  47. ^ "EMRBOTS.ORG". EMRBOTS.ORG.
  48. ^ "EMRBots: A 100-patient database". 2018-09-03.
  49. ^ "EMRBots: A 10,000-patient database". 2018-09-03.
  50. ^ "EMRBots: A 100,000-patient database". 2018-09-03.
  51. ^ "EMRBots: Full source code". 2018-09-03.
  52. ^ "SoftwareImpacts/SIMPAC-2019-8". November 20, 2019 – via GitHub.
  53. ^ "Software Impacts" – via www.journals.elsevier.com.
  54. ^ a b "At Northwell Health, IT Leaders are Revamping the EHR with AI, NLP and Voice Tools".
  55. ^ "Northwell rewards innovative employee projects with funding".
  56. ^ "Northwell rewards innovative employee projects with $500K funding". 2018-05-16.
  57. ^ "Northwell funds innovative employee projects | Northwell Health".
  58. ^ "EMR chatbot takes 2nd place in Northwell Health's 2018 innovation challenge".
  59. ^ "Advanced test, talking medical records win Northwell funding". 2018-05-16.
  60. ^ https://amp.fox5vegas.com/story/38196671/%7B%7BampLink%7D%7D
  61. ^ "EMRBot Archives –".
  62. ^ "Researchers look for ways to make EHRs easier to use".
  63. ^ "NYC doctors in small practices experience less burnout, study finds". 2018-07-06.
  64. ^ "At Northwell Health, IT Leaders are Revamping the EHR with AI, NLP and Voice Tools".
  65. ^ "CEO da Northwell Health defende aplicação do modelo Shark Tank para estimular inovação em saúde".
  66. ^ https://www.northwell.edu/sites/northwell/files/New-Standard-volume-1-2018_0.pdf
  67. ^ Walonoski, J; et al. (2017). "Synthea: An approach, method, and software mechanism for generating synthetic patients and the synthetic electronic health care record". J Am Med Inform Assoc. 25 (3): 230–238. doi:10.1093/jamia/ocx079. PMID 29025144.
  68. ^ "Corrigendum". Journal of the American Medical Informatics Association. 25 (7): 921. 2017. doi:10.1093/jamia/ocx147. PMC 6016640. PMID 29253166.
  69. ^ "Realism in Synthetic Data Generation" (PDF). Mro.massey.ac.nz. Retrieved 24 May 2018.
  70. ^ "Israeli healthcare data engine firm MDClone raises $26 mln". August 22, 2019 – via www.reuters.com.
  71. ^ https://synthea.mitre.org/