A large number of the most-subscribed YouTube channels target children of a very young age. Hundreds of toddler-oriented channels on YouTube feature inoffensive, well-produced, and educational videos. Unfortunately, inappropriate content that targets this demographic is also common. YouTube's algorithmic recommendation system regrettably suggests inappropriate content because some of it mimics or is derived from otherwise appropriate content. Considering the risk for early childhood development, and an increasing trend in toddler's consumption of YouTube media, this is a worrisome problem. In this work, we build a classifier able to discern inappropriate content that targets toddlers on YouTube with 84.3% accuracy, and leverage it to perform a first-of-its-kind, large-scale, quantitative characterization that reveals some of the risks of YouTube media consumption by young children. Our analysis reveals that YouTube is still plagued by such disturbing videos and its currently deployed counter-measures are ineffective in terms of detecting them in a timely manner. Alarmingly, using our classifier we show that young children are not only able, but likely to encounter disturbing videos when they randomly browse the platform starting from benign videos.
翻译:大量最受欢迎的YouTube频道针对的是非常年幼的儿童。YouTube上数百个面向幼儿的频道不冒犯、制作精良和教育性视频。不幸的是,针对这一人口分布的不适当内容也很常见。YouTube的算法建议系统令人遗憾地暗示内容内容不适当,因为有些内容模仿或来自其他适当内容。考虑到幼儿发展的风险,以及幼儿对YouTube媒体的消费趋势日益上升,这是一个令人担忧的问题。在这项工作中,我们建立了一个分类器,能够以84.3%的准确率辨别YouTube上的目标对象的不适当内容,并利用它进行首类、大规模、定量的描述,揭示了儿童对YouTube媒体消费的一些风险。我们的分析显示,YouTube仍然受到这种令人不安的视频的困扰,而且它目前采用的反措施在及时发现这些视频方面是无效的。我们用我们的分类器显示,幼儿不仅能够,而且有可能在他们从良性视频随机浏览平台时看到令人不安的视频。