GroupLens Research: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
No edit summary
No edit summary
Line 1: Line 1:
{{Infobox university
{{Infobox university
|name = GroupLens Research, College of Science and Engineering, University of Minnesota
|name = GroupLens Research, College of Science and Engineering, University of Minnesota
|image_name =
|image_name =
|image_size =
|image_size =
|established = 1993 by [[John Riedl]] and [[Paul Resnick]]
|established = 1992 by [[John Riedl]] and [[Paul Resnick]]
|type = Research Laboratory
|type = Research Laboratory
|calendar = Semester
|calendar = Semester
Line 14: Line 14:
|logo = [[:File:UMinnesotaWordmark.png|200px|University Wordmark]]
|logo = [[:File:UMinnesotaWordmark.png|200px|University Wordmark]]
}}
}}




'''GroupLens Research''' is a research lab in the Department of Computer Science and Engineering at the [[University of Minnesota|University of Minnesota, Twin Cities]] specializing in [[recommender system]]s, [[virtual community|online communities]], [[mobile computing|mobile]] and [[ubiquitous computing|ubiquitious]] technologies, [[digital library|digital libraries]] and local [[geographic information system]]s.
'''GroupLens Research''' is a research lab in the Department of Computer Science and Engineering at the [[University of Minnesota|University of Minnesota, Twin Cities]] specializing in [[recommender system]]s, [[virtual community|online communities]], [[mobile computing|mobile]] and [[ubiquitous computing|ubiquitious]] technologies, [[digital library|digital libraries]] and local [[geographic information system]]s.


==History==
==History==
In 1992, [[John Riedl]] and [[Paul Resnick]] attended the [[Computer Supported Cooperative Work|CSCW]] conference together. After they heard keynote speaker Shumpei Kumon talk about the information economy<ref>{{cite conference
In 1992, [[John Riedl]] and [[Paul Resnick]] attended the
[[Computer Supported Cooperative Work|CSCW]] conference together. After
they heard keynote speaker Shumpei Kumon talk about the information
economy<ref>
{{cite conference
| title = From wealth to wisdom: a change in the social paradigm
| title = From wealth to wisdom: a change in the social paradigm
| first = Shumpei
| first = Shumpei
Line 29: Line 31:
| publisher = ACM Press
| publisher = ACM Press
| page = 3
| page = 3
| isbn = 0897915429
| isbn = 0897915429
| doi = 10.1145/143457.371587
| doi = 10.1145/143457.371587
| accessdate = 2009-12-29
| accessdate = 2009-12-29
| quote =
| quote =
}}</ref>, they originated the idea for a [[collaborative filtering]]
}}</ref>, they originated the idea for a [[collaborative filtering]] system used for [[Usenet|Usenet news]] that they'd later call GroupLens. GroupLens allowed for postings to be rated on a scale. These ratings allowed for the system to predict how much an individual would like an article, based on their ratings of previous articles. A [[feasibility study|feasibility test]] was done between [[MIT]] and the University of Minnesota and the published results were part of the CSCW conference of 1994.<ref>{{cite conference
system used for [[Usenet|Usenet news]] that they'd later call
GroupLens. GroupLens collected ratings from Usenet readers, and used
those ratings to predict how much other readers would like each
article. GroupLens was one of the first automated collaborative
filtering systems, in which algorithms were used to automatically form
predictions based on historical patterns of ratings. The overall
system was called "GroupLens", and the servers that collected the
ratings and performed the computation were called the "Better Bit
Bureau". (This name was later dropped after a request from the
[[Better Business Bureau]].)

A [[feasibility study|feasibility test]] was done between [[MIT]] and
the University of Minnesota and a research paper was published including
the algorithm, the system design, and the results of the feasibility
study in the CSCW conference of 1994.<ref>
{{cite conference
| title = GroupLens: an open architecture for collaborative filtering of netnews
| title = GroupLens: an open architecture for collaborative filtering of netnews
| first = Paul
| first = Paul
| last = Resnick
| last = Resnick
| coauthors = et al
| coauthors = Neophytos Iacovou; Mitesh Suchak; Peter Bergstrom; John Riedl
| year = 1994
| year = 1994
| conference = Conference on Supporting Group Work
| conference = Computer Supported Cooperative Work
| booktitle = Proceedings of the 1994 international ACM conference on Supporting group work
| booktitle = Proceedings of the 1994 International ACM Conference on
Computer Supported Cooperative Work
| publisher = ACM Press
| publisher = ACM Press
| pages = 175&ndash;186
| pages = 175&ndash;186
| isbn = 0897916891
| isbn = 0897916891
| doi = 10.1145/192844.192905
| doi = 10.1145/192844.192905
| accessdate = 2009-12-29
| accessdate = 2009-12-30
| quote =
| quote =
}}
}}
</ref>
</ref>


In 1995, Riedl and Resnick invited [[Joseph Konstan]] to join the
In 1995, GroupLens Research expanded the team and hired [[Joseph Konstan]], Bradley Miller, and later others, to re-implement GroupLens over the internet with a centralized server, called the Better Bit Bureau.<ref>This name was later changed under threats from the [[Better Business Bureau]].</ref> Later, in the Spring of 1996, the first workshop on [[collaborative filtering]] was put together by Resnick and [[Hal Varian]] at the [[University of California, Berkeley]]. There, researchers from GroupLens and other projects that were studying similar systems came together to explore what each other were doing.
team, because of his expertise<ref>
{{cite journal
| url = http://bulletin.sigchi.org/2004/march/presidents-report-advancing-the-field/
| title = President's Report: Advancing the Field
| first = Joseph
| last = Konstan
| journal = SIGCHI Bulletin
| volume = 36
| issue = 2
| date = March/April 2004
| pages =
| accessdate = 2009-12-30
}}</ref>
in [[user interface]]s. The team
decided to create a higher-performance implementation of the
algorithms to support larger-scale deployments. In summer 1995 Riedl,
Resnick, and Konstan gathered [[Bradley Miller]], [[David Maltz]],
[[Jon Herlocker]], and [[Mark Claypool]] for "Hack Week" to create
the new implementation, and to plan the next round of experiments.


In the Spring of 1996, the first workshop on
*NetPerceptions: David Gardiner, [[Steven Snyder]], Brad Miller, John Riedl, Joseph Konstan
[[collaborative filtering]] was put together by Resnick and
*E-commerce awards<ref>{{Citation
[[Hal Varian]] at the [[University of California, Berkeley]].<ref>
{{cite web
| url = http://www2.sims.berkeley.edu/resources/collab/
| title = Collaborative Filtering
| date = March 16, 1996
| accessdate = 2009-12-30
}}</ref>
There, researchers from projects around the world
[[CHECKME: any from outside the US]]
that were studying similar systems came together to share ideas and
experience.

In the summer of 1996, [[David Gardiner]], a former Ph.D. student of
Riedl's, introduced Riedl to [[Steven Snyder]]. Snyder had been one
of the early employees at Microsoft, but had left Microsoft to come to
Minnesota to do a Ph.D. in Psychology. He realized the commercial
potential of collaborative filtering, and encouraged the team to
found a company in April 1996. By June, Gardiner, Snyder, Miller,
Riedl, and Konstan had incorporated their company, and by July they
had their first round of funding, from the [[Hummer-Winblad]] venture
capital company<ref>
{{cite web
| url = http://news.minnesota.publicradio.org/features/199912/01_newsroom_dotcom/images/dotcom.pdf
| title = Minnesota in the .Com Age
| publisher = Minnesota Public Radio
| year = 1999
| accessdate = 30 Dec 2009
| quote =
}}
</ref>.
Net Perceptions went on to be one of the leading companies in
personalization<ref>
{{Citation
| last = MIT News Office
| last = MIT News Office
| first =
| first =
Line 61: Line 132:
| url = http://web.mit.edu/newsoffice/1999/ecomm-0519.html
| url = http://web.mit.edu/newsoffice/1999/ecomm-0519.html
| accessdate = February 2008
| accessdate = February 2008
}}</ref>
}}
</ref><ref>
{{Citation
| last = Dragan
| first = Richard
| author-link =
| publication-date =
| month = January
| year = 2001
| title = Net Perceptions for E-commerce 6.0
| periodical = PC Magazine
| series =
| publication-place =
| place =
| publisher =
| volume =
| issue =
| pages =
| url = http://www.pcmag.com/article2/0,2704,107446,00.asp
| issn =
| doi =
| oclc =
| accessdate = January 2008
}}
</ref> during the Internet boom of the late 1990s, and stayed
in business until 2004. Based on their experience, Riedl and Konstan wrote a book about the lessons learned from deploying recommenders in practice<ref>
{{cite book
| title = Word of Mouse: The Marketing Power of Collaborative Filtering
| last1 = Riedl
| first1 = John
| last2 = Konstan
| first2 = Joseph
| last3 = Vrooman
| first3 = Eric
| month = August
| year = 2002
| isbn = 9780759527270
| quote =
}}
</ref>. Recommender systems have since become ubiquitous in the online world, with leading vendors such as Amazon and Netflix deploying highly sophisticated recommender systems<ref>
*{{cite journal
| url = http://online.wsj.com/article/SB121633741917263835.html?mod=googlenews_wsj
| title = Technology Gets Personal
| publisher = The Wall Street Journal
| date = July 18, 2008
| accessdate = 23 Dec 2009
| quote =
}}
</ref>. Netflix even offered a $1,000,000 prize for improvements in recommender technology<ref>{{cite journal
| url = http://www.nytimes.com/2006/10/02/technology/02iht-netflix.2998832.html?scp=1&sq=john%20riedl&st=cse
| title = Netflix offers cash for good suggestions
| publisher = The New York Times
| date = October 2, 2006
| accessdate = 23 Dec 2009
| quote =
}}</ref>.


*1996-2004. Commercialized tech from GroupLens


Meanwhile, research continued at the University of Minnesota where the the area of [[human–computer interaction]] became GroupLens Research, named after the first system of collaborative filtering by Riedl and Resnick. The department was given its first grant from the [[National Science Foundation]] to study algorithmic issues in collaborative filtering<ref>{{cite web
| url = http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=9734442
| title = CAREER: Algorithmic Issues in Collaborative Filtering
| date = May 8, 1998
| accessdate = December 14, 2009
}}</ref>.


Meanwhile, research continued at the University of Minnesota. When
When the [[EachMovie]] site closed in 1997{{fact}}, the researchers behind it gave out, for free, the anonymous rating data behind the site. The GroupLens researchers Brent Dahlen and John Herlocker took advantage of this dataset and had the MovieLens movie recommendation engine running within a few months. Since then, the site has had substantial success in attracting publicity and users and its data set has been used in numerous research papers.{{fact}}
the [[EachMovie]]<ref>

{{cite conference
{{Notice|Anything cool happen between 1997 and 2002?}}
| url = http://www.springerlink.com/content/tta6jqjb2gryxafu/

| title = An Adaptive Recommendation System with a Coordinator Agent
* The Science of the Sleeper<ref>{{cite journal
| first = Myungeun
| last = Lim
| coauthors = Juntae Kim
| year = 2001
| conference = Asia-Pacific Conference on Web Intelligence
| booktitle = Proceedings of the First Asia-Pacific Conference on Web Intelligence: Research and Development
| series = Lecture Notes in Computer Science
| volume = 2198/2001
| publisher = Springer Berlin/Heidelberg
| pages = 438&ndash;442
| isbn = 9783540427308
| doi = 10.1007/3-540-45490-X_56
| accessdate = 2009-12-30
| quote =
}}
</ref>
site closed in 1997, the researchers behind it generously released
the anonymous rating data they had collected, for other researchers
to use. The GroupLens research team, led by Brent Dahlen and Jon
Herlocker, used this dataset to jumpstart a new movie recommendation
site, called MovieLens. They were able to get the first version of
the MovieLens movie recommendation site running within a few months.
Since 1997, MovieLens has been a very visible research recommender
site, including a detailed discussion in a New Yorker article by
[[Malcolm Gladwell]]<ref>
{{cite journal
| url = http://www.gladwell.com/1999/1999_10_04_a_sleeper.htm
| url = http://www.gladwell.com/1999/1999_10_04_a_sleeper.htm
| title = Annals of Marketing: The Science of the Sleeper: How the Information Age Could Blow Away the Blockbuster
| title = Annals of Marketing: The Science of the Sleeper: How the Information Age Could Blow Away the Blockbuster
| first = Malcom
| first = Malcolm
| last = Gladwell
| last = Gladwell
| journal = New Yorker
| journal = New Yorker
Line 87: Line 232:
| pages = 48&ndash;55
| pages = 48&ndash;55
| accessdate = 2009-12-29
| accessdate = 2009-12-29
}}</ref>
}}</ref>,
* ABC Nightline<ref>{{cite web
and a report in a full episode of ABC Nightline<ref>
{{cite web
| last=Krulwich
| last=Krulwich
| first=Robert
| first=Robert
Line 96: Line 242:
| url = http://tvnews.vanderbilt.edu/program.pl?ID=649063
| url = http://tvnews.vanderbilt.edu/program.pl?ID=649063
| accessdate = February 2008
| accessdate = February 2008
}}</ref>
}}</ref>.


Between 1997 and 2002 the group continued its research on
In 2002 the group expanded into [[social computing]] and [[virtual community|online communities]] with the addition of [[Loren Terveen]].
collaborative filtering, which became known in the community by the
* PHOAKS paper<ref>{{cite journal
more general term of [[recommender systems]]. In 2002 the group
expanded into [[social computing]] and [[virtual community|online
communities]] with the addition of [[Loren Terveen]], who was known
for his research on more social recommender systems such as PHOAKS
<ref>
*{{cite book
| url =
| title = From Usenet to CoWebs: Interacting with Social Information Spaces
| editor = Christopher Lueg and Danyel Fisher
| publisher = Springer
| year = 2003
| isbn = 978-1852335328
}}
</ref>
<ref>
{{cite journal
| author = Terveen, L., et al
| author = Terveen, L., et al
| title = PHOAKS: a system for sharing recommendations
| title = PHOAKS: a system for sharing recommendations
Line 111: Line 273:
| issn = 00010782
| issn = 00010782
| doi = 10.1145/245108.245122
| doi = 10.1145/245108.245122
}}</ref>.
}}</ref>, Loren was doing other types of recommendations -- more socially oriented


In order to broaden the set of research ideas and tools they used,
{{Notice|Anything cool happen between 2002 and 2009?}}
Riedl, Konstan, and Terveen invited colleagues in social psychology

([[Robert Kraut]] and [[Sara Kiesler]], of the
* The Value of Information (Mamun)<ref>{{cite conference
[[Carnegie Mellon Human Computer Interaction Institute]]), and
| url = http://www.grouplens.org/node/179
economic and social analysis ([[Paul Resnick]] and [[Yan Chen]] of the
| title = Getting to know you: learning new user preferences in recommender systems
[[University of Michigan School of Information]]) to collaborate. The
| first = Al M.
new, larger team adopted the name [[CommunityLab]], and looked
| last = Rashid
generally at the effects of technological interventions on the
performance of online communities. For instance, some of their
research explored technology for enriching conversation systems
<ref>
{{cite conference
| url = http://www.grouplens.org/node/107
| title = Talk amongst yourselves: inviting users to participate in online conversations
| first = F. Maxwell
| last = Harper
| coauthors = et al
| coauthors = et al
| year = 2002
| year = 2007
| conference = International Conference on Intelligent User Interfaces
| conference = International Conference on Intelligent User Interfaces
| conferenceurl = http://iuiconf.org/
| conferenceurl = http://iuiconf.org/
| booktitle = Proceedings of the 7th international conference on Intelligent user interfaces
| booktitle = Proceedings of the 12th international conference on Intelligent user interfaces
| publisher = ACM Press
| publisher = ACM Press
| pages = 127&ndash;134
| pages = 62&ndash;71
| isbn = 1581134592
| isbn = 1595934812
| doi = 10.1145/502716.502737
| doi = 10.1145/1216295.1216313
| accessdate = 2009-12-29
| accessdate = 2009-12-29
| quote =
| quote =
}}</ref>
}}</ref>,
while other research explored the personal, social, and economic
* Also Paul Resnick, and Yan Chen of UMich. More of an economics perspective. User Modelling (Max, 2004)<ref>{{cite conference
motivations for user ratings<ref>
{{cite conference
| url = http://www.grouplens.org/node/135
| url = http://www.grouplens.org/node/135
| title = An Economic Model of User Rating in an Online Recommender System
| title = An Economic Model of User Rating in an Online Recommender System
Line 151: Line 324:
| quote =
| quote =
}}</ref>
}}</ref>
<ref>
* Information Leakage paper (DanFr) in SIGIR<ref>{{cite conference
{{cite conference
| url = http://www.grouplens.org/node/124
| title = Motivating Participation by Displaying the Value of Contribution
| first = Al Mamunur
| last = Rashid
| coauthors = Ling, K.; Tassone, R.D.; Resnick, P.; Kraut, R.; Riedl, J.
| year = 2006
| conference = ACM SIGCHI Conference on Human Factors in Computing Systems
| booktitle = Proceedings of the 2006 CHI Conference
| series = Lecture Notes in Computer Science
| volume = 3538
| publisher = Springer
| pages = 955&ndash;958
| isbn = 1-59593-372-7
| doi = 10.1145/1124772.1124915
| accessdate = 2009-12-30
}}</ref>.

In 2008 GroupLens launched [http://cyclopath.org Cyclopath], a
computational geowiki for bicyclists within a city.
<ref>
*{{cite journal
| url = http://www.startribune.com/lifestyle/25620134.html
| title = Biking website pools cyclists' expertise
| publisher = Minneapolis Star Tribune (MN)
| date = July 18, 2008
| accessdate = January 4, 2010
}}
</ref>
<ref>
*{{cite journal
| title = Mapquest for the cycling set
| publisher = Saint Paul Pioneer Press (MN)
| date = July 19, 2008
}}
</ref>


==Contributions==

* '''The MovieLens recommender system:''' MovieLens is a non-commercial movie recommender system that has been running for over a decade now, with over 164,000 unique visitors to date, who have provided over 15 million movie ratings<ref>
{{cite conference
| url = http://www.grouplens.org/node/412
| title = Tagsplanations: Explaining Recommendations using Tags
| first = Jesse
| last = Vig
| coauthors = Shilad Sen; John Riedl
| year = 2009
| conference = International Conference on Intelligent User Interfaces
| conferenceurl = http://iuiconf.org/
| booktitle = Proceedings of the 13th international conference on Intelligent user interfaces
| publisher = ACM Press
| pages = 47&ndash;56
| isbn = 9781605581682
| doi = 10.1145/1502650.1502661
| accessdate = 2009-12-30
}}
</ref>.

* '''MovieLens ratings datasets:''' In the early days of recommender systems, research was slowed down by the lack of publicly available datasets. In response to requests from other researchers, GroupLens released [http://www.grouplens.org/node/73 three datasets]: the MovieLens 100,000 rating dataset, the MovieLens 1,000,000 rating dataset, and the MovieLens 10,000,000 rating dataset. These datasets became the standard datasets for recommender research,and have been used in over 300 papers by researchers around the world. The dataset is also being used for teaching about recommender technology<ref>{{cite book
| title = Programming Collective Intelligence: Building Smart Web 2.0 Applications
| last = Segaran
| first = Toby
| publisher = O'Reilly Media
| month = August
| year = 2007
| isbn = 9780596529321
| quote = Uses the MovieLens dataset
}}</ref>.


* '''MovieLens tagging dataset:''' GroupLens added tagging to MovieLens in 2006. Since then, users have provided over 85,000 applications of 14,000
unique tags to movies<ref>
{{cite conference
| url = http://www.grouplens.org/node/279
| title = Tagommenders: Connecting Users to Items through Tags
| first = Shilad
| last = Sen
| coauthors = Vig, J.; Riedl, J.
| year = 2009
| conference = International World Wide Web Conference
| conferenceurl = http://www2009.org/
| booktitle = Proceedings of the 18th international conference on World wide web
| publisher = ACM Press
| pages = 671&ndash;680
| isbn = 9781605584874
| doi = 10.1145/1526709.1526800
| accessdate = 2009-12-29
| quote =
}}
</ref>. The MovieLens 10,000,000 ratings dataset also includes a 100,000 tag applications dataset for researchers to use.

* '''Information leakage from recommender datasets:''' a paper in the ''information retrieval'' conference analyzed the privacy risks to users of having large recommender datasets released. The basic risk discovered is that an anonymized dataset might be combined with public information to identify a user. For instance, a user who has written about his preference for movies on online forums could be associated with a specific row in the MovieLens datasets. In some cases, these associations might leak information the user would prefer to keep private.<ref>
{{cite conference
| url = http://www.grouplens.org/node/118
| url = http://www.grouplens.org/node/118
| title = You are what you say: privacy risks of public mentions
| title = You are what you say: privacy risks of public mentions
Line 168: Line 435:
| quote =
| quote =
}}</ref>
}}</ref>
** Mentioned by Bruce Schneier<ref>{{cite web
<ref>{{cite web
| url = http://www.schneier.com/blog/archives/2006/08/privacy_risks_o_1.html
| url = http://www.schneier.com/blog/archives/2006/08/privacy_risks_o_1.html
| title = Schneier on Security: A blog covering security and security technology
| title = Schneier on Security: A blog covering security and security technology
Line 176: Line 443:
| year = 2006
| year = 2006
| accessdate = 2009-12-29
| accessdate = 2009-12-29
}}</ref>
* NYT article, John Riedl<ref>{{cite web
| url = http://www.nytimes.com/2006/10/02/technology/02netflix.html
| title = And if You Liked the Movie, a Netflix Contest May Reward You Handsomely
| last = Hafner
| first = Katie
| date = October 2,2006
| publisher = The New York Times
| accessdate = 2009-12-29
| quote = John Riedl, a professor of computer science at the University of Minnesota and a pioneer in the field of collaborative filtering, said that Netflix and Amazon now had the most advanced recommendation systems...
}}</ref>
* Collaborations with Bob Kraut and Sara Kiesler of CMU. Incorporated more social psychology work (ConversationLens)<ref>{{cite conference
| url = http://www.grouplens.org/node/107
| title = Talk amongst yourselves: inviting users to participate in online conversations
| first = F. Maxwell
| last = Harper
| coauthors = et al
| year = 2007
| conference = International Conference on Intelligent User Interfaces
| conferenceurl = http://iuiconf.org/
| booktitle = Proceedings of the 12th international conference on Intelligent user interfaces
| publisher = ACM Press
| pages = 62&ndash;71
| isbn = 1595934812
| doi = 10.1145/1216295.1216313
| accessdate = 2009-12-29
| quote =
}}</ref>
}}</ref>


GroupLens Research has since diversified from collaboriative filtering into the study of [[online communities]]{{fact}}, [[mobile computing|mobile]] and [[ubiquitous computing|ubiquitious]] technologies{{fact}}, [[digital libraries]]{{fact}}, and local [[geographic information systems]]{{fact}}.


* '''Wikipedia research:''' The study of value and vandalism in Wikipedia published in 2007<ref>{{cite conference
==Current Research Grants==
{| class="wikitable" style="text-align:center"; border="1"
! Grant Name!! Dates!! Professors Involved
|-
| Understanding Online Volunteer Communities: Toward Theory-Based Design || 2008-2013 || Joseph Konstan, John Riedl, Loren Terveen
|-
| Gender Identity and HIV Risk II || 2007-2012 || Joseph Konstan
|-
| Understanding and Supporting Online Question-Answering Sites || 2008-2011 || Joseph Konstan
|-
| Structural Factors to Lower Alcohol-Related HIV Risk || 2006-2011 || Joseph Konstan
|-
| Collaborative Research: Solving Critical Problems in Online Groups || 2007-2010 || John Riedl
|-
| Men's INTernet Study II (MINTS-II) for HIV Prevention || 2001-2009 || Joseph Konstan
|-
| Enhanced Digital Libraries through Recommendation: Exploring the use of citations, personal bibliographies, and metadata to synthesize library services for individuals || 2006-2009 || Joseph Konstan and John Riedl
|}

==Contributions==
* MovieLens - Members of GroupLens Research constructed and maintain [[MovieLens]], a non-commercial movie recommendation site that has hosted experiments in collaborative filtering algorithms{{fact}}, tagging interfaces{{fact}}, and other features since 1997. The datasets released by the MovieLens team have become a standard test set in the recommender systems research community
* Wikipedia - Since their initial study into value and vandalism in Wikipedia published in 2007<ref>{{cite conference
| url = http://www.grouplens.org/node/113
| url = http://www.grouplens.org/node/113
| title = Creating, Destroying, and Restoring Value in Wikipedia
| title = Creating, Destroying, and Restoring Value in Wikipedia
Line 244: Line 462:
| accessdate = 2009-12-29
| accessdate = 2009-12-29
| quote =
| quote =
}}</ref> described the concentration of contribution across Wikipedia editors. This paper was one of the first to focus on the length of time that a contribution survives within Wikipedia as a measure of its value. The paper also investigated the effects of vandalism on Wikipedia readers, by measuring the probability that a view of a page would capture that page in a vandalized state. GroupLens has also explored ways to help editors find pages that can effectively contribute to with the [[User:SuggestBot|SuggestBot]] robot recommender<ref>
}}</ref>, GroupLens has provided insight wiki-work recommendations{{fact}}, editing behaviors of Wikipedians{{fact}}, how the encyclopedia's library is growing{{fact}} and the informal peer review system that decides what content is accepted{{fact}}.
SuggestBot paper by Cosley
* Cyclopath - In 2008, GroupLens member Reid Preidhorsky launched Cyclopath, a computational geowiki for local bicyclists.
</ref>.
The group has also explored the evolution of the norms in Wikipedia that determine which articles are accepted or rejected, and the effect of changes in those norms on the [Long tail] of Wikipedia articles<ref>Tony's paper on Long Tail</ref>. GroupLens has also explored the functioning of the informal peer review system within Wikipedia to discover ways the decisions being made appear to be influenced inappropriately by ownership, and that experience does not seem to change editor performance very much<ref>Aaron's paper on peer review</ref><ref>Katie's paper on Born not made</ref>.GroupLens researchers have also explored visualizations of the edit history of Wikipedia articles<ref>Michael paper</ref>.


* '''Shilling recommender systems:''' GroupLens has explored ways that users of recommender systems can attempt to inappropriately influence the recommendations given to other users<ref>Lam Shilling paper</ref>. They call this behavior [[shill|shilling]], because of its relationship to the practice of hiring associates to pretend to be enthusiastic customers. They showed that some types of shilling are likely to be effective in practice. One concern about shilling is that the false predictions may change the reported opinions of later users, further corrupting the recommendations<ref>
==List of Publications==
# {{cite conference
{{cite journal
| url = http://www.nytimes.com/2003/05/22/technology/false-web-ratings-swing-opinion-study-says.html?scp=5&sq=john%20riedl&st=cse
| url = http://www.grouplens.org/node/279
| title = Tagommenders: Connecting Users to Items through Tags
| title = False Web Ratings Swing Opinion, Study Says
| first = Shilad
| date = May 22, 2003
| publisher = The New York Times
| last = Sen
| coauthors = Vig, J.; Riedl, J.
| accessdate = 23 Dec 2009
| year = 2009
| quote = Shilling
}}</ref>
| conference = International World Wide Web Conference
<ref>Cosley is seeing believing paper</ref>
| conferenceurl = http://www2009.org/

| booktitle = Proceedings of the 18th international conference on World wide web
* '''Cyclopath:''' Beginning in 2008 GroupLens launched Cyclopath, a computational geowiki for local bicyclists. Cyclopath has since been used by hundreds of cyclists within the Twin Cities<ref>Cyclopath paper that talks about usage</ref>. More recently, Cyclopath has been adopted by the Metropolitan Council of the Twin Cities to help plan the regional cycling system<ref>http://www.bikewalktwincities.org/projects/ntp-program-area/cycloplan</ref>.
| publisher = ACM Press
| pages = 671&ndash;680
| isbn = 9781605584874
| doi = 10.1145/1526709.1526800
| accessdate = 2009-12-29
| quote =
}}


== References ==
== References ==
Line 270: Line 484:


== External links ==
== External links ==
* [http://www.movielens.org/ The MovieLens Recommender System]
* [http://www.grouplens.org/ GroupLens Research]
* [http://www.grouplens.org/ GroupLens Research]

*{{cite book
*{{cite book
| url = http://www.amazon.com/reader/0198238207?_encoding=UTF8&ref_=sib_books_pg&qid=1261537446&query=GroupLens#reader_0198238207
| url = http://www.amazon.com/reader/0198238207?_encoding=UTF8&ref_=sib_books_pg&qid=1261537446&query=GroupLens#reader_0198238207
Line 281: Line 497:
| isbn = 9780198238201
| isbn = 9780198238201
| doi = 10.1093/0198238207.001.0001
| doi = 10.1093/0198238207.001.0001
| quote =
| quote = someone should check this out
}}
*{{cite book
| title = Smart Mobs: The Next Social Revolution
| last = Rheingold
| first = Howard
| publisher = Perseus Publishing
| year = 2002
| isbn = 9780738206080
| quote = Mentions GroupLens, though gives Paul Resnick all the credit
}}
*{{cite book
| title = Word of Mouse: The Marketing Power of Collaborative Filtering
| last1 = Riedl
| first1 = John
| last2 = Konstan
| first2 = Joseph
| month = August
| year = 2002
| isbn = 9780759527270
| quote =
}}
*{{cite journal
| url = http://www.nytimes.com/2003/05/22/technology/false-web-ratings-swing-opinion-study-says.html?scp=5&sq=john%20riedl&st=cse
| title = False Web Ratings Swing Opinion, Study Says
| date = May 22, 2003
| publisher = The New York Times
| accessdate = 23 Dec 2009
| quote = Shilling
}}
}}

*{{cite book
*{{cite book
| title = Breakthrough: Stories and Strategies of Radical Innovation
| title = Breakthrough: Stories and Strategies of Radical Innovation
Line 320: Line 509:
| month = October
| month = October
| year = 2004
| year = 2004
| isbn = 9780262195140
| isbn = 9780262195140
| quote = Mentions John Riedl and his research
| quote = Mentions John Riedl and his research
}}
*{{cite journal
| url = http://www.nytimes.com/2006/10/02/technology/02iht-netflix.2998832.html?scp=1&sq=john%20riedl&st=cse
| title = Netflix offers cash for good suggestions
| publisher = The New York Times
| date = October 2, 2006
| accessdate = 23 Dec 2009
| quote =
}}
*{{cite web
| url = http://minnesota.publicradio.org/display/web/2006/10/12/midmorning1/
| title = Google and YouTube
| date = October 12, 2006
| accessdate = 23 Dec 2009
| quote = Interview after Google bought Youtube
}}
}}
*{{cite web
*{{cite web
Line 343: Line 517:
| accessdate = 23 Dec 2009
| accessdate = 23 Dec 2009
| quote = About Riedl's invited talk at RecSys'06 in Bilbao
| quote = About Riedl's invited talk at RecSys'06 in Bilbao
}}
*{{cite book
| title = Programming Collective Intelligence: Building Smart Web 2.0 Applications
| last = Segaran
| first = Toby
| publisher = O'Reilly Media
| month = August
| year = 2007
| isbn = 9780596529321
| quote = Uses the MovieLens dataset
}}
}}
*{{cite web
*{{cite web
Line 358: Line 522:
| title = Recommendations 2.0 by John Riedl, Ph.D.
| title = Recommendations 2.0 by John Riedl, Ph.D.
| year = 2007
| year = 2007
| accessdate = 23 Dec 2009
| quote =
}}
*{{cite journal
| url = http://online.wsj.com/article/SB121633741917263835.html?mod=googlenews_wsj
| title = Technology Gets Personal
| publisher = The Wall Street Journal
| date = July 18, 2008
| accessdate = 23 Dec 2009
| accessdate = 23 Dec 2009
| quote =
| quote =
Line 383: Line 539:
| quote =
| quote =
}}
}}

* My Tivo thinks I'm Gay article
http://online.wsj.com/article_email/SB1038261936872356908.html

Revision as of 18:32, 4 January 2010

GroupLens Research, College of Science and Engineering, University of Minnesota
TypeResearch Laboratory
Established1992 by John Riedl and Paul Resnick
Academic staff
3
Postgraduates20
Location, ,
Websitewww.grouplens.org
200px|University Wordmark

GroupLens Research is a research lab in the Department of Computer Science and Engineering at the University of Minnesota, Twin Cities specializing in recommender systems, online communities, mobile and ubiquitious technologies, digital libraries and local geographic information systems.

History

In 1992, John Riedl and Paul Resnick attended the CSCW conference together. After they heard keynote speaker Shumpei Kumon talk about the information economy[1], they originated the idea for a collaborative filtering system used for Usenet news that they'd later call GroupLens. GroupLens collected ratings from Usenet readers, and used those ratings to predict how much other readers would like each article. GroupLens was one of the first automated collaborative filtering systems, in which algorithms were used to automatically form predictions based on historical patterns of ratings. The overall system was called "GroupLens", and the servers that collected the ratings and performed the computation were called the "Better Bit Bureau". (This name was later dropped after a request from the Better Business Bureau.)

A feasibility test was done between MIT and the University of Minnesota and a research paper was published including the algorithm, the system design, and the results of the feasibility study in the CSCW conference of 1994.[2]

In 1995, Riedl and Resnick invited Joseph Konstan to join the team, because of his expertise[3] in user interfaces. The team decided to create a higher-performance implementation of the algorithms to support larger-scale deployments. In summer 1995 Riedl, Resnick, and Konstan gathered Bradley Miller, David Maltz, Jon Herlocker, and Mark Claypool for "Hack Week" to create the new implementation, and to plan the next round of experiments.

In the Spring of 1996, the first workshop on collaborative filtering was put together by Resnick and Hal Varian at the University of California, Berkeley.[4] There, researchers from projects around the world CHECKME: any from outside the US that were studying similar systems came together to share ideas and experience.

In the summer of 1996, David Gardiner, a former Ph.D. student of Riedl's, introduced Riedl to Steven Snyder. Snyder had been one of the early employees at Microsoft, but had left Microsoft to come to Minnesota to do a Ph.D. in Psychology. He realized the commercial potential of collaborative filtering, and encouraged the team to found a company in April 1996. By June, Gardiner, Snyder, Miller, Riedl, and Konstan had incorporated their company, and by July they had their first round of funding, from the Hummer-Winblad venture capital company[5]. Net Perceptions went on to be one of the leading companies in personalization[6][7] during the Internet boom of the late 1990s, and stayed in business until 2004. Based on their experience, Riedl and Konstan wrote a book about the lessons learned from deploying recommenders in practice[8]. Recommender systems have since become ubiquitous in the online world, with leading vendors such as Amazon and Netflix deploying highly sophisticated recommender systems[9]. Netflix even offered a $1,000,000 prize for improvements in recommender technology[10].


Meanwhile, research continued at the University of Minnesota. When the EachMovie[11] site closed in 1997, the researchers behind it generously released the anonymous rating data they had collected, for other researchers to use. The GroupLens research team, led by Brent Dahlen and Jon Herlocker, used this dataset to jumpstart a new movie recommendation site, called MovieLens. They were able to get the first version of the MovieLens movie recommendation site running within a few months. Since 1997, MovieLens has been a very visible research recommender site, including a detailed discussion in a New Yorker article by Malcolm Gladwell[12], and a report in a full episode of ABC Nightline[13].

Between 1997 and 2002 the group continued its research on collaborative filtering, which became known in the community by the more general term of recommender systems. In 2002 the group expanded into social computing and online communities with the addition of Loren Terveen, who was known for his research on more social recommender systems such as PHOAKS [14] [15].

In order to broaden the set of research ideas and tools they used, Riedl, Konstan, and Terveen invited colleagues in social psychology (Robert Kraut and Sara Kiesler, of the Carnegie Mellon Human Computer Interaction Institute), and economic and social analysis (Paul Resnick and Yan Chen of the University of Michigan School of Information) to collaborate. The new, larger team adopted the name CommunityLab, and looked generally at the effects of technological interventions on the performance of online communities. For instance, some of their research explored technology for enriching conversation systems [16], while other research explored the personal, social, and economic motivations for user ratings[17] [18].

In 2008 GroupLens launched Cyclopath, a computational geowiki for bicyclists within a city. [19] [20]


Contributions

  • The MovieLens recommender system: MovieLens is a non-commercial movie recommender system that has been running for over a decade now, with over 164,000 unique visitors to date, who have provided over 15 million movie ratings[21].
  • MovieLens ratings datasets: In the early days of recommender systems, research was slowed down by the lack of publicly available datasets. In response to requests from other researchers, GroupLens released three datasets: the MovieLens 100,000 rating dataset, the MovieLens 1,000,000 rating dataset, and the MovieLens 10,000,000 rating dataset. These datasets became the standard datasets for recommender research,and have been used in over 300 papers by researchers around the world. The dataset is also being used for teaching about recommender technology[22].


  • MovieLens tagging dataset: GroupLens added tagging to MovieLens in 2006. Since then, users have provided over 85,000 applications of 14,000

unique tags to movies[23]. The MovieLens 10,000,000 ratings dataset also includes a 100,000 tag applications dataset for researchers to use.

  • Information leakage from recommender datasets: a paper in the information retrieval conference analyzed the privacy risks to users of having large recommender datasets released. The basic risk discovered is that an anonymized dataset might be combined with public information to identify a user. For instance, a user who has written about his preference for movies on online forums could be associated with a specific row in the MovieLens datasets. In some cases, these associations might leak information the user would prefer to keep private.[24]

[25]


  • Wikipedia research: The study of value and vandalism in Wikipedia published in 2007[26] described the concentration of contribution across Wikipedia editors. This paper was one of the first to focus on the length of time that a contribution survives within Wikipedia as a measure of its value. The paper also investigated the effects of vandalism on Wikipedia readers, by measuring the probability that a view of a page would capture that page in a vandalized state. GroupLens has also explored ways to help editors find pages that can effectively contribute to with the SuggestBot robot recommender[27].

The group has also explored the evolution of the norms in Wikipedia that determine which articles are accepted or rejected, and the effect of changes in those norms on the [Long tail] of Wikipedia articles[28]. GroupLens has also explored the functioning of the informal peer review system within Wikipedia to discover ways the decisions being made appear to be influenced inappropriately by ownership, and that experience does not seem to change editor performance very much[29][30].GroupLens researchers have also explored visualizations of the edit history of Wikipedia articles[31].

  • Shilling recommender systems: GroupLens has explored ways that users of recommender systems can attempt to inappropriately influence the recommendations given to other users[32]. They call this behavior shilling, because of its relationship to the practice of hiring associates to pretend to be enthusiastic customers. They showed that some types of shilling are likely to be effective in practice. One concern about shilling is that the false predictions may change the reported opinions of later users, further corrupting the recommendations[33]

[34]

  • Cyclopath: Beginning in 2008 GroupLens launched Cyclopath, a computational geowiki for local bicyclists. Cyclopath has since been used by hundreds of cyclists within the Twin Cities[35]. More recently, Cyclopath has been adopted by the Metropolitan Council of the Twin Cities to help plan the regional cycling system[36].

References

  1. ^ Kumon, Shumpei (1992). "From wealth to wisdom: a change in the social paradigm". Proceedings of the 1992 ACM conference on Computer-supported cooperative work. Computer Supported Cooperative Work. ACM Press. p. 3. doi:10.1145/143457.371587. ISBN 0897915429. {{cite conference}}: |access-date= requires |url= (help); Unknown parameter |booktitle= ignored (|book-title= suggested) (help)
  2. ^ Resnick, Paul (1994). "GroupLens: an open architecture for collaborative filtering of netnews". Proceedings of the 1994 International ACM Conference on Computer Supported Cooperative Work. Computer Supported Cooperative Work. ACM Press. pp. 175–186. doi:10.1145/192844.192905. ISBN 0897916891. {{cite conference}}: |access-date= requires |url= (help); Unknown parameter |booktitle= ignored (|book-title= suggested) (help); Unknown parameter |coauthors= ignored (|author= suggested) (help); line feed character in |booktitle= at position 56 (help)
  3. ^ Konstan, Joseph (March/April 2004). "President's Report: Advancing the Field". SIGCHI Bulletin. 36 (2). Retrieved 2009-12-30. {{cite journal}}: Check date values in: |date= (help)
  4. ^ "Collaborative Filtering". March 16, 1996. Retrieved 2009-12-30.
  5. ^ "Minnesota in the .Com Age" (PDF). Minnesota Public Radio. 1999. Retrieved 30 Dec 2009.
  6. ^ MIT News Office (May 19, 1999), Firms honored at e-commerce awards, MIT, retrieved February 2008 {{citation}}: Check date values in: |accessdate= (help)
  7. ^ Dragan, Richard (2001), "Net Perceptions for E-commerce 6.0", PC Magazine, retrieved January 2008 {{citation}}: Check date values in: |accessdate= (help); Unknown parameter |month= ignored (help)
  8. ^ Riedl, John; Konstan, Joseph; Vrooman, Eric (2002). Word of Mouse: The Marketing Power of Collaborative Filtering. ISBN 9780759527270. {{cite book}}: Unknown parameter |month= ignored (help)
  9. ^
  10. ^ "Netflix offers cash for good suggestions". The New York Times. October 2, 2006. Retrieved 23 Dec 2009. {{cite journal}}: Cite journal requires |journal= (help)
  11. ^ Lim, Myungeun (2001). "An Adaptive Recommendation System with a Coordinator Agent". Proceedings of the First Asia-Pacific Conference on Web Intelligence: Research and Development. Asia-Pacific Conference on Web Intelligence. Lecture Notes in Computer Science. Vol. 2198/2001. Springer Berlin/Heidelberg. pp. 438–442. doi:10.1007/3-540-45490-X_56. ISBN 9783540427308. Retrieved 2009-12-30. {{cite conference}}: Unknown parameter |booktitle= ignored (|book-title= suggested) (help); Unknown parameter |coauthors= ignored (|author= suggested) (help)
  12. ^ Gladwell, Malcolm (October 4, 1999). "Annals of Marketing: The Science of the Sleeper: How the Information Age Could Blow Away the Blockbuster". New Yorker. 75 (29): 48–55. Retrieved 2009-12-29.
  13. ^ Krulwich, Robert (December 10, 1999). "ABC Nightline: Soulmate". ABC. Retrieved February 2008. {{cite web}}: Check date values in: |accessdate= (help)
  14. ^
    • Christopher Lueg and Danyel Fisher, ed. (2003). From Usenet to CoWebs: Interacting with Social Information Spaces. Springer. ISBN 978-1852335328.
  15. ^ Terveen, L.; et al. (1997). "PHOAKS: a system for sharing recommendations". Communications of the ACM. 40 (3). ACM Press: 59–62. doi:10.1145/245108.245122. ISSN 0001-0782. {{cite journal}}: Explicit use of et al. in: |author= (help); Unknown parameter |month= ignored (help)
  16. ^ Harper, F. Maxwell (2007). "Talk amongst yourselves: inviting users to participate in online conversations". Proceedings of the 12th international conference on Intelligent user interfaces. International Conference on Intelligent User Interfaces. ACM Press. pp. 62–71. doi:10.1145/1216295.1216313. ISBN 1595934812. Retrieved 2009-12-29. {{cite conference}}: External link in |conferenceurl= (help); Unknown parameter |booktitle= ignored (|book-title= suggested) (help); Unknown parameter |coauthors= ignored (|author= suggested) (help); Unknown parameter |conferenceurl= ignored (|conference-url= suggested) (help)
  17. ^ Harper, F. Maxwell (2005). "An Economic Model of User Rating in an Online Recommender System". User Modeling 2005 Proceedings. 10th International Conference on User Modeling. Lecture Notes in Computer Science. Vol. 3538. Springer. pp. 307–316. doi:10.1007/11527886_40. ISBN 9783540278856. Retrieved 2009-12-29. {{cite conference}}: External link in |conferenceurl= (help); Unknown parameter |booktitle= ignored (|book-title= suggested) (help); Unknown parameter |coauthors= ignored (|author= suggested) (help); Unknown parameter |conferenceurl= ignored (|conference-url= suggested) (help)
  18. ^ Rashid, Al Mamunur (2006). "Motivating Participation by Displaying the Value of Contribution". Proceedings of the 2006 CHI Conference. ACM SIGCHI Conference on Human Factors in Computing Systems. Lecture Notes in Computer Science. Vol. 3538. Springer. pp. 955–958. doi:10.1145/1124772.1124915. ISBN 1-59593-372-7. Retrieved 2009-12-30. {{cite conference}}: Unknown parameter |booktitle= ignored (|book-title= suggested) (help); Unknown parameter |coauthors= ignored (|author= suggested) (help)
  19. ^
  20. ^
    • "Mapquest for the cycling set". Saint Paul Pioneer Press (MN). July 19, 2008. {{cite journal}}: Cite journal requires |journal= (help)
  21. ^ Vig, Jesse (2009). "Tagsplanations: Explaining Recommendations using Tags". Proceedings of the 13th international conference on Intelligent user interfaces. International Conference on Intelligent User Interfaces. ACM Press. pp. 47–56. doi:10.1145/1502650.1502661. ISBN 9781605581682. Retrieved 2009-12-30. {{cite conference}}: External link in |conferenceurl= (help); Unknown parameter |booktitle= ignored (|book-title= suggested) (help); Unknown parameter |coauthors= ignored (|author= suggested) (help); Unknown parameter |conferenceurl= ignored (|conference-url= suggested) (help)
  22. ^ Segaran, Toby (2007). Programming Collective Intelligence: Building Smart Web 2.0 Applications. O'Reilly Media. ISBN 9780596529321. Uses the MovieLens dataset {{cite book}}: Unknown parameter |month= ignored (help)
  23. ^ Sen, Shilad (2009). "Tagommenders: Connecting Users to Items through Tags". Proceedings of the 18th international conference on World wide web. International World Wide Web Conference. ACM Press. pp. 671–680. doi:10.1145/1526709.1526800. ISBN 9781605584874. Retrieved 2009-12-29. {{cite conference}}: External link in |conferenceurl= (help); Unknown parameter |booktitle= ignored (|book-title= suggested) (help); Unknown parameter |coauthors= ignored (|author= suggested) (help); Unknown parameter |conferenceurl= ignored (|conference-url= suggested) (help)
  24. ^ Frankowski, Dan (2006). "You are what you say: privacy risks of public mentions". Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval. Annual ACM Conference on Research and Development in Information Retrieval. ACM Press. pp. 565–572. doi:10.1145/1148170.1148267. ISBN 1595933697. Retrieved 2009-12-29. {{cite conference}}: External link in |conferenceurl= (help); Unknown parameter |booktitle= ignored (|book-title= suggested) (help); Unknown parameter |coauthors= ignored (|author= suggested) (help); Unknown parameter |conferenceurl= ignored (|conference-url= suggested) (help)
  25. ^ Schneier, Bruce (2006). "Schneier on Security: A blog covering security and security technology". Retrieved 2009-12-29. {{cite web}}: Unknown parameter |month= ignored (help)
  26. ^ Priedhorsky, Reid (2007). "Creating, Destroying, and Restoring Value in Wikipedia". Proceedings of the 2007 international ACM conference on Supporting group work. Conference on Supporting Group Work. ACM Press. pp. 259–268. doi:10.1145/1316624.1316663. ISBN 9781595938459. Retrieved 2009-12-29. {{cite conference}}: External link in |conferenceurl= (help); Unknown parameter |booktitle= ignored (|book-title= suggested) (help); Unknown parameter |coauthors= ignored (|author= suggested) (help); Unknown parameter |conferenceurl= ignored (|conference-url= suggested) (help)
  27. ^ SuggestBot paper by Cosley
  28. ^ Tony's paper on Long Tail
  29. ^ Aaron's paper on peer review
  30. ^ Katie's paper on Born not made
  31. ^ Michael paper
  32. ^ Lam Shilling paper
  33. ^ "False Web Ratings Swing Opinion, Study Says". The New York Times. May 22, 2003. Retrieved 23 Dec 2009. Shilling {{cite journal}}: Cite journal requires |journal= (help)
  34. ^ Cosley is seeing believing paper
  35. ^ Cyclopath paper that talks about usage
  36. ^ http://www.bikewalktwincities.org/projects/ntp-program-area/cycloplan

External links

  • My Tivo thinks I'm Gay article

http://online.wsj.com/article_email/SB1038261936872356908.html