Prerecorded laughter accompanying dialog in comedy TV shows encourages the audience to laugh by clearly marking humorous moments in the show. We present an approach for automatically detecting humor in the Friends TV show using multimodal data. Our model is capable of recognizing whether an utterance is humorous or not and assess the intensity of it. We use the prerecorded laughter in the show as annotation as it marks humor and the length of the audience's laughter tells us how funny a given joke is. We evaluate the model on episodes the model has not been exposed to during the training phase. Our results show that the model is capable of correctly detecting whether an utterance is humorous 78% of the time and how long the audience's laughter reaction should last with a mean absolute error of 600 milliseconds.
翻译:在喜剧电视节目中,在喜剧电视节目中,预先录制的笑声会鼓励观众笑,在节目中明显地标出幽默的时刻。我们展示了一种方法,用多式联运数据自动检测Friends电视节目中的幽默感。我们的模型能够辨别一个发声是否幽默并评估其强度。我们用节目中预先录制的笑声作为注解,因为它标记了幽默感和观众的笑声长度告诉我们一个笑话是多么有趣。我们评估了模型在培训阶段没有暴露的模型。我们的结果显示,模型能够正确检测出一个发声是否幽默78%的时间,以及观众的笑声反应应持续多久,其平均误差为600毫秒。