We present a methodology to support the analysis of culture from text such as news events and demonstrate its usefulness on categorizing news events from different categories (society, business, health, recreation, science, shopping, sports, arts, computers, games and home) across different geographical locations (different places in 117 countries). We group countries based on the culture that they follow and then filter the news events based on their content category. The news events are automatically labelled with the help of Hofstedes cultural dimensions. We present combinations of events across different categories and check the performances of different classification methods. We also presents experimental comparison of different number of features in order to find a suitable set to represent the culture.
翻译:我们提出一种方法,支持从新闻事件等文字分析文化,并展示其对于将不同地点(117个国家的不同地点)的不同类别(社会、商业、卫生、娱乐、科学、购物、体育、艺术、计算机、游戏和家)的新闻事件分类的有用性(社会、卫生、娱乐、科学、购物、体育、艺术、计算机、游戏和家),我们根据所遵循的文化将国家分组,然后根据其内容类别过滤新闻事件,新闻事件在Hofsstedes文化层面的帮助下自动贴上标签。我们介绍不同类别的事件组合,检查不同分类方法的表现。我们还对不同的特征进行实验性比较,以便找到代表文化的合适数据集。