We describe our straight-forward approach for Tasks 5 and 6 of 2021 Social Media Mining for Health Applications (SMM4H) shared tasks. Our system is based on fine-tuning Distill- BERT on each task, as well as first fine-tuning the model on the other task. We explore how much fine-tuning is necessary for accurately classifying tweets as containing self-reported COVID-19 symptoms (Task 5) or whether a tweet related to COVID-19 is self-reporting, non-personal reporting, or a literature/news mention of the virus (Task 6).
翻译:我们描述我们对2021年社会媒体采矿促进健康应用(SMM4H)第5和第6任务(SMM4H)的共同任务采取的直向办法,我们的系统基于对每项任务进行微调,以及首先对另一项任务的模式进行微调,我们探讨需要多少微调才能准确地将推特归类为含有自报COVID-19症状(任务5),或者与COVID-19有关的推文是自我报告、非个人报告,还是文献/新闻提到病毒(任务6)。