From Wikipedia, the free encyclopedia
Jump to: navigation, search
Clair logo.jpg
Developer(s) CLAIR University of Michigan
Stable release 1.0.8 / August 1, 2009; 5 years ago (2009-08-01)
Development status Active
Written in Perl
Platform Cross-platform
Available in Perl
Type Natural Language Processing, Network Analysis, Information Retrieval
License GNU General Public License, Artistic License

Clairlib is a suite of open-source Perl modules developed and maintained by the Computational Linguistics And Information Retrieval (CLAIR) group at the University of Michigan. Clairlib is intended to simplify a number of generic tasks in natural language processing (NLP), information retrieval (IR), and network analysis (NA). The latest version of clairlib is 1.06 which was released on March 2009 and includes about 130 modules implementing a wide range of functionalities.


Clairlib is distributed in two forms: Clairlib-core, which has essential functionality and minimal dependence on external software, and Clairlib-ext, which has extended functionality that may be of interest to a smaller audience. Much can be done using Clairlib on its own. Some of the things that Clairlib can do are: Tokenization, Summarization, Document Clustering, Document Indexing, Web Graph Analysis, Network Generation, Power law distribution Analysis, Network Analysis, Random walks on graphs, Tf-idf, Perceptron learning and classification, and Phrase Based Retrieval and Fuzzy OR Queries.


External links[edit]