FlexServe:将PyTork模型作为灵活REEST终点部署 (FlexServe: Deployment of PyTorch Models as Flexible REST Endpoints)

The integration of artificial intelligence capabilities into modern software systems is increasingly being simplified through the use of cloud-based machine learning services and representational state transfer architecture design. However, insufficient information regarding underlying model provenance and the lack of control over model evolution serve as an impediment to the more widespread adoption of these services in many operational environments which have strict security requirements. Furthermore, tools such as TensorFlow Serving allow models to be deployed as RESTful endpoints, but require error-prone transformations for PyTorch models as these dynamic computational graphs. This is in contrast to the static computational graphs of TensorFlow. To enable rapid deployments of PyTorch models without intermediate transformations we have developed FlexServe, a simple library to deploy multi-model ensembles with flexible batching.

翻译：将人工智能能力纳入现代软件系统,正越来越多地通过使用云基机器学习服务和代表式国家传输结构设计而简化,然而,关于基本模型出处的信息不足和对模型演变缺乏控制,阻碍了在许多具有严格安全要求的业务环境中更广泛地采用这些服务。此外,TensorFlow Service等工具允许将模型作为REST端点进行部署,但需要将PyToirch模型作为这些动态计算图进行易出错的转换。这与TensorFlow的静态计算图不同。为了能够在没有中间转换的情况下迅速部署PyTorrch模型,我们开发了FlexServe,这是一个简单的图书馆,可以部署具有灵活批量的多模型组合。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/