Probabilistic Data Fusion on a Large Document Collection
David Lillis, Fergus Toolan, Rem Collier and John Dunnion
In Proceedings of the 17th Irish Conference on Artificial Intelligence and Cognitive Science (AICS 2006), Belfast, Northern Ireland, 2006.
Data Fusion is the process of combining the output of a number of Information Retrieval algorithms into a single result set, to achieve greater retrieval performance. ProbFuse is a probabilistic data fusion algorithm that has been shown to outperform the CombMNZ algorithm in a number of previous experiments. This paper builds upon this previous work and applies probFuse to the much larger Web Track document collection from the 2004 Text REtreival Conference.