Neural Encoding and Decoding With a Flow-Based Invertible Generative Model

Qiongyi Zhou; Changde Du; Dan Li; Haibao Wang; Jian K. Liu; Huiguang He

doi:10.1109/TCDS.2022.3176977

Neural Encoding and Decoding With a Flow-Based Invertible Generative Model

Qiongyi Zhou, Changde Du, Dan Li, Haibao Wang, Jian K. Liu, Huiguang He^*

^*Corresponding author for this work

Computer Science

Research output: Contribution to journal › Article › peer-review

Abstract

Recent studies on visual neural encoding and decoding have made significant progress, benefiting from the latest advances in deep neural networks having powerful representations. However, two challenges remain. First, the current decoding algorithms based on deep generative models always struggle with information losses, which may cause blurry reconstruction. Second, most studies model the neural encoding and decoding processes separately, neglecting the inherent dual relationship between the two tasks. In this article, we propose a novel neural encoding and decoding method with a two-stage flow-based invertible generative (FLIG) model to tackle the above issues. First, a convolutional autoencoder (CAE) is trained to bridge the stimuli space and the feature space. Second, an adversarial cross-modal normalizing flow is trained to build up a bijective transformation between image features and neural signals, with local and global constraints imposed on the latent space to render cross-modal alignment. The method eventually achieves bidirectional generation of visual stimuli and neural responses with a combination of the flow-based generator and the autoencoder. The FLIG model can minimize information losses and unify neural encoding and decoding into a single framework. Experimental results on different neural signals containing spike signals and functional magnetic resonance imaging demonstrate that our model achieves the best comprehensive performance among the comparison models.

Original language	English
Pages (from-to)	724-736
Number of pages	13
Journal	IEEE Transactions on Cognitive and Developmental Systems
Volume	15
Issue number	2
Early online date	23 May 2022
DOIs	https://doi.org/10.1109/TCDS.2022.3176977
Publication status	Published - Jun 2023

Bibliographical note

Funding:
This work was supported in part by the National Natural Science Foundation of China under Grant 61976209 and Grant 61906188; in part by the CAS International Collaboration Key Project under Grant 173211KYSB20190024; and in part by the Strategic Priority Research Program of CAS under Grant XDB32040000

Keywords

Cross-modal generation
neural decoding
neural encoding
normalizing flow

Access to Document

10.1109/TCDS.2022.3176977

https://ieeexplore.ieee.org/document/9780264/

Cite this

@article{3da2a6fe7ef04d2a88314003e26e994d,

title = "Neural Encoding and Decoding With a Flow-Based Invertible Generative Model",

abstract = "Recent studies on visual neural encoding and decoding have made significant progress, benefiting from the latest advances in deep neural networks having powerful representations. However, two challenges remain. First, the current decoding algorithms based on deep generative models always struggle with information losses, which may cause blurry reconstruction. Second, most studies model the neural encoding and decoding processes separately, neglecting the inherent dual relationship between the two tasks. In this article, we propose a novel neural encoding and decoding method with a two-stage flow-based invertible generative (FLIG) model to tackle the above issues. First, a convolutional autoencoder (CAE) is trained to bridge the stimuli space and the feature space. Second, an adversarial cross-modal normalizing flow is trained to build up a bijective transformation between image features and neural signals, with local and global constraints imposed on the latent space to render cross-modal alignment. The method eventually achieves bidirectional generation of visual stimuli and neural responses with a combination of the flow-based generator and the autoencoder. The FLIG model can minimize information losses and unify neural encoding and decoding into a single framework. Experimental results on different neural signals containing spike signals and functional magnetic resonance imaging demonstrate that our model achieves the best comprehensive performance among the comparison models.",

keywords = "Cross-modal generation, neural decoding, neural encoding, normalizing flow",

author = "Qiongyi Zhou and Changde Du and Dan Li and Haibao Wang and Liu, {Jian K.} and Huiguang He",

note = "Funding: This work was supported in part by the National Natural Science Foundation of China under Grant 61976209 and Grant 61906188; in part by the CAS International Collaboration Key Project under Grant 173211KYSB20190024; and in part by the Strategic Priority Research Program of CAS under Grant XDB32040000 ",

year = "2023",

month = jun,

doi = "10.1109/TCDS.2022.3176977",

language = "English",

volume = "15",

pages = "724--736",

journal = "IEEE Transactions on Cognitive and Developmental Systems",

issn = "2379-8920",

publisher = "IEEE Xplore",

number = "2",

}

TY - JOUR

T1 - Neural Encoding and Decoding With a Flow-Based Invertible Generative Model

AU - Zhou, Qiongyi

AU - Du, Changde

AU - Li, Dan

AU - Wang, Haibao

AU - Liu, Jian K.

AU - He, Huiguang

N1 - Funding: This work was supported in part by the National Natural Science Foundation of China under Grant 61976209 and Grant 61906188; in part by the CAS International Collaboration Key Project under Grant 173211KYSB20190024; and in part by the Strategic Priority Research Program of CAS under Grant XDB32040000

PY - 2023/6

Y1 - 2023/6

N2 - Recent studies on visual neural encoding and decoding have made significant progress, benefiting from the latest advances in deep neural networks having powerful representations. However, two challenges remain. First, the current decoding algorithms based on deep generative models always struggle with information losses, which may cause blurry reconstruction. Second, most studies model the neural encoding and decoding processes separately, neglecting the inherent dual relationship between the two tasks. In this article, we propose a novel neural encoding and decoding method with a two-stage flow-based invertible generative (FLIG) model to tackle the above issues. First, a convolutional autoencoder (CAE) is trained to bridge the stimuli space and the feature space. Second, an adversarial cross-modal normalizing flow is trained to build up a bijective transformation between image features and neural signals, with local and global constraints imposed on the latent space to render cross-modal alignment. The method eventually achieves bidirectional generation of visual stimuli and neural responses with a combination of the flow-based generator and the autoencoder. The FLIG model can minimize information losses and unify neural encoding and decoding into a single framework. Experimental results on different neural signals containing spike signals and functional magnetic resonance imaging demonstrate that our model achieves the best comprehensive performance among the comparison models.

AB - Recent studies on visual neural encoding and decoding have made significant progress, benefiting from the latest advances in deep neural networks having powerful representations. However, two challenges remain. First, the current decoding algorithms based on deep generative models always struggle with information losses, which may cause blurry reconstruction. Second, most studies model the neural encoding and decoding processes separately, neglecting the inherent dual relationship between the two tasks. In this article, we propose a novel neural encoding and decoding method with a two-stage flow-based invertible generative (FLIG) model to tackle the above issues. First, a convolutional autoencoder (CAE) is trained to bridge the stimuli space and the feature space. Second, an adversarial cross-modal normalizing flow is trained to build up a bijective transformation between image features and neural signals, with local and global constraints imposed on the latent space to render cross-modal alignment. The method eventually achieves bidirectional generation of visual stimuli and neural responses with a combination of the flow-based generator and the autoencoder. The FLIG model can minimize information losses and unify neural encoding and decoding into a single framework. Experimental results on different neural signals containing spike signals and functional magnetic resonance imaging demonstrate that our model achieves the best comprehensive performance among the comparison models.

KW - Cross-modal generation

KW - neural decoding

KW - neural encoding

KW - normalizing flow

U2 - 10.1109/TCDS.2022.3176977

DO - 10.1109/TCDS.2022.3176977

M3 - Article

SN - 2379-8920

VL - 15

SP - 724

EP - 736

JO - IEEE Transactions on Cognitive and Developmental Systems

JF - IEEE Transactions on Cognitive and Developmental Systems

IS - 2

ER -

Neural Encoding and Decoding With a Flow-Based Invertible Generative Model

Abstract

Bibliographical note

Keywords

Access to Document

Fingerprint

Cite this