Victor Lavrenko is a lecturer at the School of Informatics at the University of Edinburgh, Scotland, UK. He received his Ph.D. in Computer Science from the University of Massachusetts Amherst in 2004. His dissertation focused on a generative framework for modeling relevance in Information Retrieval. In 2005 he joined the Center for Intelligent Information Retrieval at UMass as a post-doctoral research associate, working on statistical models for searching large semi-structured databases. From 2006 Victor worked as a language technology consultant for the Credit Suisse Group. Since 2000, he has served as a reviewer for SIGIR, CIKM, NAACL/HLT, IJCAI and NIPS conferences.
Victor's current research interests include formal models for searching text in multiple languages, annotating and retrieving images, and detecting and tracking novel events in the news.
Relevance.- A Generative View of Relevance.- Generative Density Allocation.- Retrieval Scenarios.- Conclusion.
A modern information retrieval system must have the capability to find, organize and present very different manifestations of information - such as text, pictures, videos or database records - any of which may be of relevance to the user. However, the concept of relevance, while seemingly intuitive, is actually hard to define, and it's even harder to model in a formal way.
Lavrenko does not attempt to bring forth a new definition of relevance, nor provide arguments as to why any particular definition might be theoretically superior or more complete. Instead, he takes a widely accepted, albeit somewhat conservative definition, makes several assumptions, and from them develops a new probabilistic model that explicitly captures that notion of relevance. With this book, he makes two major contributions to the field of information retrieval: first, a new way to look at topical relevance, complementing the two dominant models, i.e., the classical probabilistic model and the language modeling approach, and which explicitly combines documents, queries, and relevance in a single formalism; second, a new method for modeling exchangeable sequences of discrete random variables which does not make any structural assumptions about the data and which can also handle rare events.
Thus his book is of major interest to researchers and graduate students in information retrieval who specialize in relevance modeling, ranking algorithms, and language modeling.