摘要
本文在对三维声行业标准《三维声编解码及渲染》(Audio Vivid)深入研究的基础上,分析三维声编解码和渲染端到端技术框架,介绍基于神经网络的通用码率音频编码、元数据编码、扬声器渲染和双耳渲染等关键技术,给出卡塔尔世界杯期间Audio Vivid标准端到端技术试验情况,为Audio Vivid标准应用部署提供技术参考。
Based on the comprehensive study of the standard“3D audio coding and rendering”(Audio Vivid),this paper analyzes the end-to-end technical framework of 3D audio coding and rendering.Key technologies such as general bitrate audio coding based on neural network,metadata coding,speaker rendering and binaural rendering are presented,and the end-to-end technical trials of the Audio Vivid standard during the Qatar World Cup are given,providing technical reference for the deployment of the Audio Vivid standard application.
作者
周芸
庞超
王喆
郭晓强
Zhou Yun;Pang Chao;Wang Zhe;Guo Xiaoqiang(Academy of Broadcasting Science,NRTA,Beijing 100866,China;China Media Group,Beijing 100020,China;Huawei Technology Co.,Ltd.,Beijing 100085,China)
出处
《广播与电视技术》
2023年第7期35-42,共8页
Radio & TV Broadcast Engineering