Brain Captioning: Decoding human brain activity into images and text - 专知论文

会员服务 ·

0

解码 · MoDELS · fMRI · Processing（编程语言） · 数据集 ·

2023 年 5 月 19 日

Brain Captioning: Decoding human brain activity into images and text

翻译：暂无翻译

Matteo Ferrante,Furkan Ozcelik,Tommaso Boccato,Rufin VanRullen,Nicola Toschi

Every day, the human brain processes an immense volume of visual information, relying on intricate neural mechanisms to perceive and interpret these stimuli. Recent breakthroughs in functional magnetic resonance imaging (fMRI) have enabled scientists to extract visual information from human brain activity patterns. In this study, we present an innovative method for decoding brain activity into meaningful images and captions, with a specific focus on brain captioning due to its enhanced flexibility as compared to brain decoding into images. Our approach takes advantage of cutting-edge image captioning models and incorporates a unique image reconstruction pipeline that utilizes latent diffusion models and depth estimation. We utilized the Natural Scenes Dataset, a comprehensive fMRI dataset from eight subjects who viewed images from the COCO dataset. We employed the Generative Image-to-text Transformer (GIT) as our backbone for captioning and propose a new image reconstruction pipeline based on latent diffusion models. The method involves training regularized linear regression models between brain activity and extracted features. Additionally, we incorporated depth maps from the ControlNet model to further guide the reconstruction process. We evaluate our methods using quantitative metrics for both generated captions and images. Our brain captioning approach outperforms existing methods, while our image reconstruction pipeline generates plausible images with improved spatial relationships. In conclusion, we demonstrate significant progress in brain decoding, showcasing the enormous potential of integrating vision and language to better understand human cognition. Our approach provides a flexible platform for future research, with potential applications in various fields, including neural art, style transfer, and portable devices.

翻译：暂无翻译

0

相关内容

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

专知会员服务

85+阅读 · 2023年6月19日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

126+阅读 · 2022年4月21日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

脱甲基化酶Jmjd3调节成骨细胞凋亡的作用机制

国家自然科学基金

0+阅读 · 2015年12月31日

光皮桦OFP基因在次生壁形成中的功能及调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

天然免疫应答中DNA结合蛋白DAI（ZBP1/DLM1）的结构与功能研究

国家自然科学基金

0+阅读 · 2014年12月31日

RNA结合蛋白Smaug识别果蝇生殖发育关键基因oskar mRNA的结构机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

联合干扰对齐与多跳组簇的密集WLAN广域吞吐量提升方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

自适应多分辨率宽带频谱压缩感知

国家自然科学基金

0+阅读 · 2012年12月31日

内隐类别学习的认知神经机制

国家自然科学基金

0+阅读 · 2012年12月31日

IPS细胞调节培养基抑制增生性瘢痕的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

组团参加国际光学联合会大会

国家自然科学基金

0+阅读 · 2012年8月18日

Drosha 蛋白磷酸化修饰对胃癌形成的影响

国家自然科学基金

0+阅读 · 2011年12月31日

Synthesizing Artistic Cinemagraphs from Text

Arxiv

0+阅读 · 2023年7月6日

VerifAI: Verified Generative AI

Arxiv

0+阅读 · 2023年7月6日

Direct segmentation of brain white matter tracts in diffusion MRI

Arxiv

0+阅读 · 2023年7月5日

Beyond Known Reality: Exploiting Counterfactual Explanations for Medical Research

Arxiv

0+阅读 · 2023年7月5日

Multi-Modal Prototypes for Open-Set Semantic Segmentation

Arxiv

0+阅读 · 2023年7月5日

Discriminating Human-authored from ChatGPT-Generated Code Via Discernable Feature Analysis

Arxiv

0+阅读 · 2023年7月4日

MADS: Modulated Auto-Decoding SIREN for time series imputation

Arxiv

0+阅读 · 2023年7月3日

Active Sensing with Predictive Coding and Uncertainty Minimization

Arxiv

0+阅读 · 2023年7月2日

Video Captioning via Hierarchical Reinforcement Learning

Arxiv

20+阅读 · 2018年3月29日

Image Captioning at Will: A Versatile Scheme for Effectively Injecting Sentiments into Image Descriptions

Arxiv

16+阅读 · 2018年1月30日

VIP会员

文章信息

相关主题

Processing（编程语言）

相关VIP内容

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

专知会员服务

85+阅读 · 2023年6月19日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

126+阅读 · 2022年4月21日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】基础模型训练中网络规模数据的负责任与高效使用

《俄乌战争背景下俄罗斯的战略性海军分析（2022-2025年）》最新100页报告

人工智能时代背景下的未来海战

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

相关论文

Synthesizing Artistic Cinemagraphs from Text

Arxiv

0+阅读 · 2023年7月6日

VerifAI: Verified Generative AI

Arxiv

0+阅读 · 2023年7月6日

Direct segmentation of brain white matter tracts in diffusion MRI

Arxiv

0+阅读 · 2023年7月5日

Beyond Known Reality: Exploiting Counterfactual Explanations for Medical Research

Arxiv

0+阅读 · 2023年7月5日

Multi-Modal Prototypes for Open-Set Semantic Segmentation

Arxiv

0+阅读 · 2023年7月5日

Discriminating Human-authored from ChatGPT-Generated Code Via Discernable Feature Analysis

Arxiv

0+阅读 · 2023年7月4日

MADS: Modulated Auto-Decoding SIREN for time series imputation

Arxiv

0+阅读 · 2023年7月3日

Active Sensing with Predictive Coding and Uncertainty Minimization

Arxiv

0+阅读 · 2023年7月2日

Video Captioning via Hierarchical Reinforcement Learning

Arxiv

20+阅读 · 2018年3月29日

Image Captioning at Will: A Versatile Scheme for Effectively Injecting Sentiments into Image Descriptions

Arxiv

16+阅读 · 2018年1月30日

相关基金

脱甲基化酶Jmjd3调节成骨细胞凋亡的作用机制

国家自然科学基金

0+阅读 · 2015年12月31日

光皮桦OFP基因在次生壁形成中的功能及调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

天然免疫应答中DNA结合蛋白DAI（ZBP1/DLM1）的结构与功能研究

国家自然科学基金

0+阅读 · 2014年12月31日

RNA结合蛋白Smaug识别果蝇生殖发育关键基因oskar mRNA的结构机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

联合干扰对齐与多跳组簇的密集WLAN广域吞吐量提升方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

自适应多分辨率宽带频谱压缩感知

国家自然科学基金

0+阅读 · 2012年12月31日

内隐类别学习的认知神经机制

国家自然科学基金

0+阅读 · 2012年12月31日

IPS细胞调节培养基抑制增生性瘢痕的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

组团参加国际光学联合会大会

国家自然科学基金

0+阅读 · 2012年8月18日

Drosha 蛋白磷酸化修饰对胃癌形成的影响

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员