Distilling Information Reliability and Source Trustworthiness from Digital Traces

Behzad Tabibian Isabel Valera Mehrdad Farajtabar
Le Song Bernhard Schölkopf Manuel Gomez-Rodriguez

Online knowledge repositories typically rely on their users or dedicated editors to evaluate the reliability of their contents. These explicit feedback mechanisms can be viewed as noisy measurements of both information reliability and information source trustworthiness. Can we leverage these noisy measurements, often biased, to distill a robust, unbiased and interpretable measure of both notions?

In this paper, we argue that the large volume of digital traces left by the users within knowledge repositories also reflect information reliability and source trustworthiness. In particular, we propose a temporal point process modeling framework which links the temporal behavior of the users to information reliability and source trustworthiness.

Furthermore, we develop an efficient convex optimization procedure to learn the parameters of the model from historical traces of the evaluations provided by these users. Experiments on real-world data gathered from Wikipedia and Stack Overflow show that our modeling framework accurately predicts evaluation events, provides an interpretable measure of information reliability and source trustworthiness, and yields interesting insights about real-world events.

Wikipedia Demo


Explore site parameters by typing domain name in following text box.


Explore parameters of the model trained on Wikipedia corpus by typing title of an entry on Wikipedia in following text box.