AI引导数字内容生成的无线感知 (Guiding AI-Generated Digital Content with Wireless Perception) - 专知论文

会员服务 ·

0

AIGC · 模型生成 · AI · 骨架图 · 图像传输 ·

2023 年 3 月 26 日

Guiding AI-Generated Digital Content with Wireless Perception

翻译：AI引导数字内容生成的无线感知

Jiacheng Wang,Hongyang Du,Dusit Niyato,Zehui Xiong,Jiawen Kang,Shiwen Mao, Xuemin, Shen

Recent advances in artificial intelligence (AI), coupled with a surge in training data, have led to the widespread use of AI for digital content generation, with ChatGPT serving as a representative example. Despite the increased efficiency and diversity, the inherent instability of AI models poses a persistent challenge in guiding these models to produce the desired content for users. In this paper, we introduce an integration of wireless perception (WP) with AI-generated content (AIGC) and propose a unified WP-AIGC framework to improve the quality of digital content production. The framework employs a novel multi-scale perception technology to read user's posture, which is difficult to describe accurately in words, and transmits it to the AIGC model as skeleton images. Based on these images and user's service requirements, the AIGC model generates corresponding digital content. Since the production process imposes the user's posture as a constraint on the AIGC model, it makes the generated content more aligned with the user's requirements. Additionally, WP-AIGC can also accept user's feedback, allowing adjustment of computing resources at edge server to improve service quality. Experiments results verify the effectiveness of the WP-AIGC framework, highlighting its potential as a novel approach for guiding AI models in the accurate generation of digital content.

翻译：近年来，人工智能（AI）的快速发展，配合大量的训练数据，已经广泛应用于数字内容生成，以ChatGPT为代表。尽管具有更高效和更多样化的特点，但AI模型固有的不稳定性仍然是指导这些模型生成所需内容的持久挑战。在本文中，我们介绍了将无线感知（WP）与AI生成的内容（AIGC）相结合的方法，并提出了一个统一的WP-AIGC框架，以提高数字内容生成的质量。该框架采用了一种新颖的多尺度感知技术，读取用户的姿势并将其作为骨架图像传输给AIGC模型。根据这些图像和用户的服务要求，AIGC模型生成相应的数字内容。由于生成过程将用户的姿势作为AIGC模型的约束，使生成的内容更符合用户的要求。此外，WP-AIGC还可以接受用户的反馈，允许调整边缘服务器上的计算资源以改善服务质量。实验结果验证了WP-AIGC框架的有效性，突显其作为指导AI模型准确生成数字内容的新方法的潜力。

0

相关内容

AIGC

人工智能生成内容

【ACM UMAP 2022 】可复现推荐系统的语义感知内容表示，148页ppt

【ACM UMAP 2022 】可复现推荐系统的语义感知内容表示，148页ppt

专知会员服务

17+阅读 · 2022年7月6日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【ACL2022】解释生成的多尺度分布深度变分自编码器, Multi-Scale Distribution Deep Variational Autoencoder for Explanation Generation

【ACL2022】解释生成的多尺度分布深度变分自编码器, Multi-Scale Distribution Deep Variational Autoencoder for Explanation Generation

专知会员服务

12+阅读 · 2022年3月24日

【Hugging Face】指导文本生成与约束波束搜索🤗Transformers，Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

【Hugging Face】指导文本生成与约束波束搜索🤗Transformers，Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

专知会员服务

22+阅读 · 2022年3月18日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

专知会员服务

30+阅读 · 2022年2月22日

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

专知会员服务

34+阅读 · 2020年6月19日

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

专知会员服务

24+阅读 · 2019年12月15日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

GAN新书《生成式深度学习》，Generative Deep Learning，379页pdf

GAN新书《生成式深度学习》，Generative Deep Learning，379页pdf

专知会员服务

208+阅读 · 2019年9月30日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

最新NLP论文阅读列表，包括对话、问答、摘要、翻译等（附资源）

最新NLP论文阅读列表，包括对话、问答、摘要、翻译等（附资源）

THU数据派

11+阅读 · 2019年3月25日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新六篇知识图谱相关论文—全局关系嵌入、时序关系提取、对抗学习、远距离关系、时序知识图谱

【论文推荐】最新六篇知识图谱相关论文—全局关系嵌入、时序关系提取、对抗学习、远距离关系、时序知识图谱

专知

23+阅读 · 2018年4月24日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

专知

13+阅读 · 2018年1月23日

多尺度NED/DEM生成的数字综合理论和关键技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于多维信号星座图的高质量数字传输系统研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于波前编码的高质量近衍射极限光学遥感成像技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

多层QoS约束支持的遥感信息服务个性化搜索方法

国家自然科学基金

0+阅读 · 2012年12月31日

基于社交访问行为与传播特性的在线视频内容部署与传输方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

复杂无线环境下的主动跨层恶意节点定位算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

钝体尾流－压电薄膜涡激共振能量采集多场耦合机理的高频响TR-PIV实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

数字电视外辐射源雷达低空目标检测算法研究

国家自然科学基金

1+阅读 · 2011年12月31日

压缩采样框架下的自适应稀疏信号感知与重建

国家自然科学基金

0+阅读 · 2009年12月31日

无线网络中可扩展视频编码传输技术

国家自然科学基金

0+阅读 · 2009年12月31日

Fusion-S2iGan: An Efficient and Effective Single-Stage Framework for Speech-to-Image Generation

Arxiv

0+阅读 · 2023年5月17日

CageViT: Convolutional Activation Guided Efficient Vision Transformer

Arxiv

0+阅读 · 2023年5月17日

ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human

Arxiv

0+阅读 · 2023年5月15日

Detection and Mitigation of Byzantine Attacks in Distributed Training

Arxiv

0+阅读 · 2023年5月13日

Digital Forensics in the Age of Smart Environments: A Survey of Recent Advancements and Challenges

Arxiv

0+阅读 · 2023年5月12日

Spider GAN: Leveraging Friendly Neighbors to Accelerate GAN Training

Arxiv

0+阅读 · 2023年5月12日

Regulating ChatGPT and other Large Generative AI Models

Arxiv

0+阅读 · 2023年5月12日

Multiverse at the Edge: Interacting Real World and Digital Twins for Wireless Beamforming

Arxiv

0+阅读 · 2023年5月10日

A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT

Arxiv

34+阅读 · 2023年3月7日

Robust breast cancer detection in mammography and digital breast tomosynthesis using annotation-efficient deep learning approach

Robust breast cancer detection in mammography and digital breast tomosynthesis using annotation-efficient deep learning approach

Arxiv

14+阅读 · 2019年12月27日

VIP会员

文章信息

相关主题

相关VIP内容

【ACM UMAP 2022 】可复现推荐系统的语义感知内容表示，148页ppt

【ACM UMAP 2022 】可复现推荐系统的语义感知内容表示，148页ppt

专知会员服务

17+阅读 · 2022年7月6日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【ACL2022】解释生成的多尺度分布深度变分自编码器, Multi-Scale Distribution Deep Variational Autoencoder for Explanation Generation

【ACL2022】解释生成的多尺度分布深度变分自编码器, Multi-Scale Distribution Deep Variational Autoencoder for Explanation Generation

专知会员服务

12+阅读 · 2022年3月24日

【Hugging Face】指导文本生成与约束波束搜索🤗Transformers，Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

【Hugging Face】指导文本生成与约束波束搜索🤗Transformers，Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

专知会员服务

22+阅读 · 2022年3月18日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

专知会员服务

30+阅读 · 2022年2月22日

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

专知会员服务

34+阅读 · 2020年6月19日

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

专知会员服务

24+阅读 · 2019年12月15日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

GAN新书《生成式深度学习》，Generative Deep Learning，379页pdf

GAN新书《生成式深度学习》，Generative Deep Learning，379页pdf

专知会员服务

208+阅读 · 2019年9月30日

热门VIP内容

开通专知VIP会员享更多权益服务

【NeurIPS2025】VideoLucy：用于长视频理解的深度记忆回溯机制

不确定环境下无人机与无人地面车辆编队的地下勘探规划算法 | 122页

【NTU博士论文】端到端鲁棒自动语音识别的最新进展

用于强化学习的扩散模型：基础、分类与发展

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

最新NLP论文阅读列表，包括对话、问答、摘要、翻译等（附资源）

最新NLP论文阅读列表，包括对话、问答、摘要、翻译等（附资源）

THU数据派

11+阅读 · 2019年3月25日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新六篇知识图谱相关论文—全局关系嵌入、时序关系提取、对抗学习、远距离关系、时序知识图谱

【论文推荐】最新六篇知识图谱相关论文—全局关系嵌入、时序关系提取、对抗学习、远距离关系、时序知识图谱

专知

23+阅读 · 2018年4月24日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

专知

13+阅读 · 2018年1月23日

相关论文

Fusion-S2iGan: An Efficient and Effective Single-Stage Framework for Speech-to-Image Generation

Arxiv

0+阅读 · 2023年5月17日

CageViT: Convolutional Activation Guided Efficient Vision Transformer

Arxiv

0+阅读 · 2023年5月17日

ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human

Arxiv

0+阅读 · 2023年5月15日

Detection and Mitigation of Byzantine Attacks in Distributed Training

Arxiv

0+阅读 · 2023年5月13日

Digital Forensics in the Age of Smart Environments: A Survey of Recent Advancements and Challenges

Arxiv

0+阅读 · 2023年5月12日

Spider GAN: Leveraging Friendly Neighbors to Accelerate GAN Training

Arxiv

0+阅读 · 2023年5月12日

Regulating ChatGPT and other Large Generative AI Models

Arxiv

0+阅读 · 2023年5月12日

Multiverse at the Edge: Interacting Real World and Digital Twins for Wireless Beamforming

Arxiv

0+阅读 · 2023年5月10日

A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT

Arxiv

34+阅读 · 2023年3月7日

Robust breast cancer detection in mammography and digital breast tomosynthesis using annotation-efficient deep learning approach

Robust breast cancer detection in mammography and digital breast tomosynthesis using annotation-efficient deep learning approach

Arxiv

14+阅读 · 2019年12月27日

相关基金

多尺度NED/DEM生成的数字综合理论和关键技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于多维信号星座图的高质量数字传输系统研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于波前编码的高质量近衍射极限光学遥感成像技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

多层QoS约束支持的遥感信息服务个性化搜索方法

国家自然科学基金

0+阅读 · 2012年12月31日

基于社交访问行为与传播特性的在线视频内容部署与传输方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

复杂无线环境下的主动跨层恶意节点定位算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

钝体尾流－压电薄膜涡激共振能量采集多场耦合机理的高频响TR-PIV实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

数字电视外辐射源雷达低空目标检测算法研究

国家自然科学基金

1+阅读 · 2011年12月31日

压缩采样框架下的自适应稀疏信号感知与重建

国家自然科学基金

0+阅读 · 2009年12月31日

无线网络中可扩展视频编码传输技术

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员