To fully make use of information from different representation subspaces,a multi-head attention-based long short-term memory(LSTM)model is proposed in this study for speech emotion recognition(SER).The proposed model ...To fully make use of information from different representation subspaces,a multi-head attention-based long short-term memory(LSTM)model is proposed in this study for speech emotion recognition(SER).The proposed model uses frame-level features and takes the temporal information of emotion speech as the input of the LSTM layer.Here,a multi-head time-dimension attention(MHTA)layer was employed to linearly project the output of the LSTM layer into different subspaces for the reduced-dimension context vectors.To provide relative vital information from other dimensions,the output of MHTA,the output of feature-dimension attention,and the last time-step output of LSTM were utilized to form multiple context vectors as the input of the fully connected layer.To improve the performance of multiple vectors,feature-dimension attention was employed for the all-time output of the first LSTM layer.The proposed model was evaluated on the eNTERFACE and GEMEP corpora,respectively.The results indicate that the proposed model outperforms LSTM by 14.6%and 10.5%for eNTERFACE and GEMEP,respectively,proving the effectiveness of the proposed model in SER tasks.展开更多
This paper extends a prediction model for multi-directional random wave transformation based on an energy balance equation by Mase with the consideration of wave shoaling, refraction, diffraction, reflection and break...This paper extends a prediction model for multi-directional random wave transformation based on an energy balance equation by Mase with the consideration of wave shoaling, refraction, diffraction, reflection and breaking. This numerical model is improved by 1) introducing Wen's frequency spectrum and Mitsuyasu's directional function, which are more suitable to the coastal area of China; 2) considering energy dissipation caused by bottom friction, which ensures more accurate results for large-scale and shallow water areas; 3) taking into account a non-linear dispersion relation. Predictions using the extended wave model are carried out to study the feasibility of constructing the Ai Hua yacht port in Qingdao, China, with a comparison between two port layouts in design. Wave fields inside the port for different incident wave directions, water levels and return periods are simulated, and then two kinds of parameters are calculated to evaluate the wave conditions for the two layouts. Analyses show that Layout I is better than Layout II. Calculation results also show that the harbor will be calm for different wave directions under the design water level. On the contrary, the wave conditions do not wholly meet the requirements of a yacht port for ship berthing under the extreme water level. For safety consideration, the elevation of the breakwater might need to be properly increased to prevent wave overtopping under such water level. The extended numerical simulation model may provide an effective approach to computing wave heights in a harbor.展开更多
基金The National Natural Science Foundation of China(No.61571106,61633013,61673108,81871444).
文摘To fully make use of information from different representation subspaces,a multi-head attention-based long short-term memory(LSTM)model is proposed in this study for speech emotion recognition(SER).The proposed model uses frame-level features and takes the temporal information of emotion speech as the input of the LSTM layer.Here,a multi-head time-dimension attention(MHTA)layer was employed to linearly project the output of the LSTM layer into different subspaces for the reduced-dimension context vectors.To provide relative vital information from other dimensions,the output of MHTA,the output of feature-dimension attention,and the last time-step output of LSTM were utilized to form multiple context vectors as the input of the fully connected layer.To improve the performance of multiple vectors,feature-dimension attention was employed for the all-time output of the first LSTM layer.The proposed model was evaluated on the eNTERFACE and GEMEP corpora,respectively.The results indicate that the proposed model outperforms LSTM by 14.6%and 10.5%for eNTERFACE and GEMEP,respectively,proving the effectiveness of the proposed model in SER tasks.
基金supported by the National Natural Science Foundation of China (50879085)the Program for New Century Excellent Talents in University(NCET-07-0778)Fundamental Research Funds for the Central Universities (2012QNA4020)
文摘This paper extends a prediction model for multi-directional random wave transformation based on an energy balance equation by Mase with the consideration of wave shoaling, refraction, diffraction, reflection and breaking. This numerical model is improved by 1) introducing Wen's frequency spectrum and Mitsuyasu's directional function, which are more suitable to the coastal area of China; 2) considering energy dissipation caused by bottom friction, which ensures more accurate results for large-scale and shallow water areas; 3) taking into account a non-linear dispersion relation. Predictions using the extended wave model are carried out to study the feasibility of constructing the Ai Hua yacht port in Qingdao, China, with a comparison between two port layouts in design. Wave fields inside the port for different incident wave directions, water levels and return periods are simulated, and then two kinds of parameters are calculated to evaluate the wave conditions for the two layouts. Analyses show that Layout I is better than Layout II. Calculation results also show that the harbor will be calm for different wave directions under the design water level. On the contrary, the wave conditions do not wholly meet the requirements of a yacht port for ship berthing under the extreme water level. For safety consideration, the elevation of the breakwater might need to be properly increased to prevent wave overtopping under such water level. The extended numerical simulation model may provide an effective approach to computing wave heights in a harbor.