Subjective answer evaluation is a time-consuming and tedious task, and the quality of the evaluation is heavily influenced by a variety of subjective personal characteristics. Instead, machine evaluation can effectively assist educators in saving time while also ensuring that evaluations are fair and realistic. However, most existing methods using regular machine learning and natural language processing techniques are generally hampered by a lack of annotated answers and poor model interpretability, making them unsuitable for real-world use. To solve these challenges, we propose ProtSi Network, a unique semi-supervised architecture that for the first time uses few-shot learning to subjective answer evaluation. To evaluate students' answers by similarity prototypes, ProtSi Network simulates the natural process of evaluator scoring answers by combining Siamese Network which consists of BERT and encoder layers with Prototypical Network. We employed an unsupervised diverse paraphrasing model ProtAugment, in order to prevent overfitting for effective few-shot text classification. By integrating contrastive learning, the discriminative text issue can be mitigated. Experiments on the Kaggle Short Scoring Dataset demonstrate that the ProtSi Network outperforms the most recent baseline models in terms of accuracy and quadratic weighted kappa.
翻译:主观回答评估是一项耗时费时和繁琐的任务,评价的质量受到各种主观个人特征的严重影响。相反,机器评估可以有效地帮助教育者节省时间,同时确保评价是公平和现实的。然而,使用常规机器学习和自然语言处理技术的大多数现有方法通常由于缺乏附加说明的答案和模型解释能力差而受阻,使其不适于现实世界使用。为了应对这些挑战,我们提议了ProtSi网络,这是一个独特的半监督架构,首次对主观回答评估使用少发的学习。为了评估学生的答案,ProtSi网络通过将由BERT和编码器层组成的Simese网络与Protophical网络结合起来,模拟了评价员评分的自然过程。我们采用了一种未经监督的多种参数模型ProtAugment,以防止对短发文本进行有效的分类。通过结合对比性学习,可减少歧视性文本问题。在Kagglegle Shittle Scoration Dataset上实验了Krat-qraimational-si Net网络的最新基线术语。