Bayesian Aggregation of Categorical Distributions with Applications in Crowdsourcing
Abstract: A key problem in crowdsourcing is the aggregation of judgments of proportions. For example, workers might be presented with a news article or an image, and be asked to identify the proportion of each topic, sentiment, object, or colour present in it. These varying judgments then need to be aggregated to form a consensus view of the document's or image's contents. Often, however, these judgments are skewed by workers who provide judgments randomly. Such spammers make the cost of acquiring judgments more expensive and degrade the accuracy of the aggregation. For such cases, we provide a new Bayesian framework for aggregating these responses (expressed in the form of categorical distributions) that for the first time accounts for spammers. We elicit 796 judgments about proportions of objects and coloursin images. Experimental results show comparable aggregation accuracy when 60% of the workers are spammers, as other state of the art approaches do when there are no spammers.
Authors: A. Augustin, M. Venanzi, A. Rogers, N.R. Jennings
Date: 2017-03-03
Venue: Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI)
#publications #research #machinelearning