GPT-2论文 - 专知

会员服务 ·

GPT-2

Context-Emotion Aware Therapeutic Dialogue Generation: A Multi-component Reinforcement Learning Approach to Language Models for Mental Health Support

Arxiv

0+阅读 · 11月14日

Dissecting the Ledger: Locating and Suppressing "Liar Circuits" in Financial Large Language Models

Arxiv

0+阅读 · 11月24日

Universal Neurons in GPT-2: Emergence, Persistence, and Functional Impact

Arxiv

0+阅读 · 11月9日

Weak-to-Strong Generalization Even in Random Feature Networks, Provably

Arxiv

0+阅读 · 11月9日

RETTA: Retrieval-Enhanced Test-Time Adaptation for Zero-Shot Video Captioning

Arxiv

0+阅读 · 10月28日

Memory Mosaics at scale

Arxiv

0+阅读 · 10月28日

A Stylometric Application of Large Language Models

Arxiv

0+阅读 · 10月24日

Breaking Bad Tokens: Detoxification of LLMs Using Sparse Autoencoders

Arxiv

0+阅读 · 10月23日

Single layer tiny Co$^4$ outpaces GPT-2 and GPT-BERT

Arxiv

0+阅读 · 10月9日

Biasless Language Models Learn Unnaturally: How LLMs Fail to Distinguish the Possible from the Impossible

Biasless Language Models Learn Unnaturally: How LLMs Fail to Distinguish the Possible from the Impossible

Arxiv

0+阅读 · 10月8日

Evaluating The Impact of Stimulus Quality in Investigations of LLM Language Performance

Arxiv

0+阅读 · 10月7日

Hierarchical Semantic Retrieval with Cobweb

Arxiv

0+阅读 · 10月2日

Krony-PT: GPT2 compressed with Kronecker Products

Arxiv

0+阅读 · 9月30日

Ultra-Fast Language Generation via Discrete Diffusion Divergence Instruct

Arxiv

0+阅读 · 10月1日

Ultra-Fast Language Generation via Discrete Diffusion Divergence Instruct

Arxiv

0+阅读 · 9月29日

参考链接

微信扫码咨询专知VIP会员