The dataset was collected to examine and identify possible key topics within these texts. Data preparation such as data cleaning, transformation, tokenization, removal of stop words from both English and Filipino, and word stemming was employed in the dataset before feeding it to sentiment analysis and the LDA model. The topmost occurring word within the dataset is "development" and there are three (3) likely topics from the speeches of Philippine presidents: economic development, enhancement of public services, and addressing challenges. The dataset was able to provide valuable insights contained among official documents. While the study showed that presidents have used their annual address to express their visions for the country. It also presented that the presidents from 1935 to 2016 faced the same problems during their term. Future researchers may collect other speeches made by presidents during their term; combine them to the dataset used in this study to further investigate these important texts by subjecting them to the same methodology used in this study. The dataset may be requested from the authors and it is recommended for further analysis. For example, determine how the speeches of the president reflect the preamble or foundations of the Philippine constitution.
翻译:收集数据,如数据清理、转换、象征性化、删除英文和菲律宾文的停留词和断字等数据,在输入情绪分析和LDA模型之前,在数据集中使用了数据元数,数据集中最常出现的词是“发展”,菲律宾总统讲话中可能有三个(3)个主题:经济发展、加强公共服务和应对挑战。数据集能够提供正式文件中的宝贵见解。研究显示,1935年至2016年的总统用年度演讲来表达他们对国家的看法。还表明,1935年至2016年的总统在任期内面临同样的问题。未来的研究人员可以收集总统在任期内发表的其他演讲;将其与本研究中使用的数据集合并,以便进一步调查这些重要文本,采用的研究所用方法相同。可以要求作者提供数据集,并建议作进一步分析。例如,确定总统的演讲如何反映菲律宾宪法的序言或基础。