MOS论文 - 专知

会员服务 ·

MOS

ARECHO: Autoregressive Evaluation via Chain-Based Hypothesis Optimization for Speech Multi-Metric Estimation

ARECHO: Autoregressive Evaluation via Chain-Based Hypothesis Optimization for Speech Multi-Metric Estimation

Arxiv

0+阅读 · 10月30日

Multi-Objective Search: Algorithms, Applications, and Emerging Directions

Arxiv

0+阅读 · 10月29日

Decoding the Ear: A Framework for Objectifying Expressiveness from Human Preference Through Efficient Alignment

Arxiv

0+阅读 · 10月23日

Self-Supervised Speech Quality Assessment (S3QA): Leveraging Speech Foundation Models for a Scalable Speech Quality Metric

Arxiv

0+阅读 · 10月8日

Temporal Overlapping Prediction: A Self-supervised Pre-training Method for LiDAR Moving Object Segmentation

Arxiv

0+阅读 · 10月2日

From Scores to Preferences: Redefining MOS Benchmarking for Speech Quality Reward Modeling

Arxiv

0+阅读 · 10月1日

LMM4Gen3DHF: Benchmarking and Evaluating Multimodal 3D Human Face Generation with LMMs

Arxiv

0+阅读 · 4月29日

LMME3DHF: Benchmarking and Evaluating Multimodal 3D Human Face Generation with LMMs

Arxiv

0+阅读 · 5月5日

OPMOS: Ordered Parallel Algorithm for Multi-Objective Shortest-Paths

Arxiv

0+阅读 · 4月15日

WhisQ: Cross-Modal Representation Learning for Text-to-Music MOS Prediction

Arxiv

0+阅读 · 6月6日

MOS: Model Surgery for Pre-Trained Model-Based Class-Incremental Learning

Arxiv

0+阅读 · 6月18日

Enabling Auditory Large Language Models for Automatic Speech Quality Evaluation

Arxiv

0+阅读 · 4月1日

Enhance Generation Quality of Flow Matching V2A Model via Multi-Step CoT-Like Guidance and Combined Preference Optimization

Arxiv

0+阅读 · 3月28日

Scaling Rich Style-Prompted Text-to-Speech Datasets

Arxiv

0+阅读 · 3月6日

Audio Large Language Models Can Be Descriptive Speech Quality Evaluators

Arxiv

0+阅读 · 3月12日

参考链接

微信扫码咨询专知VIP会员