Vision论文 - 专知

会员服务 ·

Vision

Measuring AI Progress in Drug Discovery: A Reproducible Leaderboard for the Tox21 Challenge

Arxiv

0+阅读 · 11月18日

ThinkingViT: Matryoshka Thinking Vision Transformer for Elastic Inference

Arxiv

0+阅读 · 11月17日

Sketch-guided Cage-based 3D Gaussian Splatting Deformation

Arxiv

0+阅读 · 12月1日

Privacy-preserving fall detection at the edge using Sony IMX636 event-based vision sensor and Intel Loihi 2 neuromorphic processor

Arxiv

0+阅读 · 11月27日

MatteViT: High-Frequency-Aware Document Shadow Removal with Shadow Matte Guidance

Arxiv

0+阅读 · 12月9日

Factual and Musical Evaluation Metrics for Music Language Models

Arxiv

0+阅读 · 11月2日

Hybrid Temporal-8-Bit Spike Coding for Spiking Neural Network Surrogate Training

Arxiv

0+阅读 · 12月3日

Thinking with Programming Vision: Towards a Unified View for Thinking with Images

Arxiv

0+阅读 · 12月3日

Beyond Patch Aggregation: 3-Pass Pyramid Indexing for Vision-Enhanced Document Retrieval

Arxiv

0+阅读 · 11月26日

Inference-Time Scaling of Diffusion Models for Infrared Data Generation

Arxiv

0+阅读 · 11月10日

How Robot Dogs See the Unseeable

Arxiv

0+阅读 · 11月20日

Object-Centric Vision Token Pruning for Vision Language Models

Arxiv

0+阅读 · 11月25日

Harmonizing Generalization and Specialization: Uncertainty-Informed Collaborative Learning for Semi-supervised Medical Image Segmentation

Arxiv

0+阅读 · 12月15日

Physical ID-Transfer Attacks against Multi-Object Tracking via Adversarial Trajectory

Arxiv

0+阅读 · 12月1日

Distilling Expert Surgical Knowledge: How to train local surgical VLMs for anatomy explanation in Complete Mesocolic Excision

Arxiv

0+阅读 · 12月5日

参考链接

微信扫码咨询专知VIP会员