Nowadays, foundation models become one of fundamental infrastructures in artificial intelligence, paving ways to the general intelligence. However, the reality presents two urgent challenges: existing foundation models are dominated by the English-language community; users are often given limited resources and thus cannot always use foundation models. To support the development of the Chinese-language community, we introduce an open-source project, called Fengshenbang, which leads by the research center for Cognitive Computing and Natural Language (CCNL). Our project has comprehensive capabilities, including large pre-trained models, user-friendly APIs, benchmarks, datasets, and others. We wrap all these in three sub-projects: the Fengshenbang Model, the Fengshen Framework, and the Fengshen Benchmark. An open-source roadmap, Fengshenbang, aims to re-evaluate the open-source community of Chinese pre-trained large-scale models, prompting the development of the entire Chinese large-scale model community. We also want to build a user-centered open-source ecosystem to allow individuals to access the desired models to match their computing resources. Furthermore, we invite companies, colleges, and research institutions to collaborate with us to build the large-scale open-source model-based ecosystem. We hope that this project will be the foundation of Chinese cognitive intelligence.
翻译:现今,基础模型成为了人工智能中基本的基础设施之一,为通用智能铺平了道路。然而,现实中存在两个紧急的挑战:现有的基础模型被英语社区所主导;用户常常受到资源的限制,因此不能总是使用基础模型。为支持中国语言社区的发展,我们推出一个名为冯神帮的开源项目,由认知计算和自然语言研究中心(CCNL)主导。我们的项目具有综合能力,包括大型预训练模型、用户友好的API、基准、数据集等等。我们将所有这些都包含在三个子项目中: 冯神帮模型、冯神框架和冯神基准。一个开源的路线图,冯神帮,旨在重新评估中文预训练大型模型的开源社区,促进整个中文大型模型社区的发展。我们还希望建立一个以用户为中心的开源生态系统,让个人能够访问所需的模型,以匹配他们的计算资源。此外,我们邀请公司、高校和研究机构与我们合作,共同建立基于大型开源模型的生态系统。我们希望这个项目将成为中国认知智能的基石。