人群分布不均、遮挡和背景干扰等问题使得人群计数成为了一项复杂且具有挑战性的任务。针对这些问题,提出了一种多尺度特征融合的位置关注网络(Position-Aware Network based on Multi-Scale Feature Fusion,MSFPANet)。首先,设计了一...人群分布不均、遮挡和背景干扰等问题使得人群计数成为了一项复杂且具有挑战性的任务。针对这些问题,提出了一种多尺度特征融合的位置关注网络(Position-Aware Network based on Multi-Scale Feature Fusion,MSFPANet)。首先,设计了一种多尺度特征融合模块,以在不同感受野下提取并融合人群密度图的多尺度特征,同时提取出前景信息,来应对人群计数中的遮挡和背景干扰问题;然后,通过位置注意力分配网络提高模型对人群区域的关注度,有效地应对人群分布不均的问题;最后,为了辅助模型训练,减小背景噪声带来的干扰,引入了一种结构交叉损失用于强化模型对人群结构的学习。实验结果表明:MSF-PANet在Shanghai Tech Part A、Shanghai Tech Part B、UCF-QNRF和UCF_CC_50上平均绝对误差分别为59.5、7.8、103、182.7,均方误差分别为96.7、13.6、177、237.7,验证了所提模块在提高人群计数准确率上的有效性。展开更多
A composite random variable is a product (or sum of products) of statistically distributed quantities. Such a variable can represent the solution to a multi-factor quantitative problem submitted to a large, diverse, i...A composite random variable is a product (or sum of products) of statistically distributed quantities. Such a variable can represent the solution to a multi-factor quantitative problem submitted to a large, diverse, independent, anonymous group of non-expert respondents (the “crowd”). The objective of this research is to examine the statistical distribution of solutions from a large crowd to a quantitative problem involving image analysis and object counting. Theoretical analysis by the author, covering a range of conditions and types of factor variables, predicts that composite random variables are distributed log-normally to an excellent approximation. If the factors in a problem are themselves distributed log-normally, then their product is rigorously log-normal. A crowdsourcing experiment devised by the author and implemented with the assistance of a BBC (British Broadcasting Corporation) television show, yielded a sample of approximately 2000 responses consistent with a log-normal distribution. The sample mean was within ~12% of the true count. However, a Monte Carlo simulation (MCS) of the experiment, employing either normal or log-normal random variables as factors to model the processes by which a crowd of 1 million might arrive at their estimates, resulted in a visually perfect log-normal distribution with a mean response within ~5% of the true count. The results of this research suggest that a well-modeled MCS, by simulating a sample of responses from a large, rational, and incentivized crowd, can provide a more accurate solution to a quantitative problem than might be attainable by direct sampling of a smaller crowd or an uninformed crowd, irrespective of size, that guesses randomly.展开更多
文摘人群分布不均、遮挡和背景干扰等问题使得人群计数成为了一项复杂且具有挑战性的任务。针对这些问题,提出了一种多尺度特征融合的位置关注网络(Position-Aware Network based on Multi-Scale Feature Fusion,MSFPANet)。首先,设计了一种多尺度特征融合模块,以在不同感受野下提取并融合人群密度图的多尺度特征,同时提取出前景信息,来应对人群计数中的遮挡和背景干扰问题;然后,通过位置注意力分配网络提高模型对人群区域的关注度,有效地应对人群分布不均的问题;最后,为了辅助模型训练,减小背景噪声带来的干扰,引入了一种结构交叉损失用于强化模型对人群结构的学习。实验结果表明:MSF-PANet在Shanghai Tech Part A、Shanghai Tech Part B、UCF-QNRF和UCF_CC_50上平均绝对误差分别为59.5、7.8、103、182.7,均方误差分别为96.7、13.6、177、237.7,验证了所提模块在提高人群计数准确率上的有效性。
文摘A composite random variable is a product (or sum of products) of statistically distributed quantities. Such a variable can represent the solution to a multi-factor quantitative problem submitted to a large, diverse, independent, anonymous group of non-expert respondents (the “crowd”). The objective of this research is to examine the statistical distribution of solutions from a large crowd to a quantitative problem involving image analysis and object counting. Theoretical analysis by the author, covering a range of conditions and types of factor variables, predicts that composite random variables are distributed log-normally to an excellent approximation. If the factors in a problem are themselves distributed log-normally, then their product is rigorously log-normal. A crowdsourcing experiment devised by the author and implemented with the assistance of a BBC (British Broadcasting Corporation) television show, yielded a sample of approximately 2000 responses consistent with a log-normal distribution. The sample mean was within ~12% of the true count. However, a Monte Carlo simulation (MCS) of the experiment, employing either normal or log-normal random variables as factors to model the processes by which a crowd of 1 million might arrive at their estimates, resulted in a visually perfect log-normal distribution with a mean response within ~5% of the true count. The results of this research suggest that a well-modeled MCS, by simulating a sample of responses from a large, rational, and incentivized crowd, can provide a more accurate solution to a quantitative problem than might be attainable by direct sampling of a smaller crowd or an uninformed crowd, irrespective of size, that guesses randomly.