GPT-5论文 - 专知

会员服务 ·

GPT-5

Reasoning Models Ace the CFA Exams

Arxiv

0+阅读 · 12月9日

COMPARE: Clinical Optimization with Modular Planning and Assessment via RAG-Enhanced AI-OCT: Superior Decision Support for Percutaneous Coronary Intervention Compared to ChatGPT-5 and Junior Operators

Arxiv

0+阅读 · 12月11日

Solving a Research Problem in Mathematical Statistics with AI Assistance

Arxiv

0+阅读 · 12月10日

Solving a Research Problem in Mathematical Statistics with AI Assistance

Arxiv

0+阅读 · 12月17日

ChatGPT-5 in Secondary Education: A Mixed-Methods Analysis of Student Attitudes, AI Anxiety, and Hallucination-Aware Use

Arxiv

0+阅读 · 11月30日

EdgeRunner 20B: Military Task Parity with GPT-5 while Running on the Edge

EdgeRunner 20B: Military Task Parity with GPT-5 while Running on the Edge

Arxiv

0+阅读 · 10月30日

Validating Formal Specifications with LLM-generated Test Cases

Arxiv

0+阅读 · 10月27日

Benchmarking GPT-5 for biomedical natural language processing

Arxiv

0+阅读 · 10月23日

From Charts to Code: A Hierarchical Benchmark for Multimodal Models

Arxiv

0+阅读 · 10月20日

The GPT-4o Shock Emotional Attachment to AI Models and Its Impact on Regulatory Acceptance: A Cross-Cultural Analysis of the Immediate Transition from GPT-4o to GPT-5

Arxiv

0+阅读 · 10月18日

MAGPIE: A benchmark for Multi-AGent contextual PrIvacy Evaluation

Arxiv

0+阅读 · 10月16日

Toward LLM-Supported Automated Assessment of Critical Thinking Subskills

Arxiv

0+阅读 · 10月14日

DocReward: A Document Reward Model for Structuring and Stylizing

Arxiv

0+阅读 · 10月13日

Holistic Evaluation of Multimodal LLMs on Spatial Intelligence

Arxiv

0+阅读 · 10月13日

GPT-5 Model Corrected GPT-4V's Chart Reading Errors, Not Prompting

Arxiv

0+阅读 · 10月8日

参考链接

微信扫码咨询专知VIP会员