A machine learning based speech enhancement method is proposed to improve the intelligibility of whispered speech. A binary mask estimated by a two-class support vector machine (SVM) classifier is used to synthesize...A machine learning based speech enhancement method is proposed to improve the intelligibility of whispered speech. A binary mask estimated by a two-class support vector machine (SVM) classifier is used to synthesize the enhanced whisper. A novel noise robust feature called Gammatone feature cosine coefficients (GFCCs) extracted by an auditory periphery model is derived and used for the binary mask estimation. The intelligibility performance of the proposed method is evaluated and compared with the traditional speech enhancement methods. Objective and subjective evaluation results indicate that the proposed method can effectively improve the intelligibility of whispered speech which is contaminated by noise. Compared with the power subtract algorithm and the log-MMSE algorithm, both of which do not improve the intelligibility in lower signal-to-noise ratio (SNR) environments, the proposed method has good performance in improving the intelligibility of noisy whisper. Additionally, the intelligibility of the enhanced whispered speech using the proposed method also outperforms that of the corresponding unprocessed noisy whispered speech.展开更多
Purpose:There is a growing interest in speech intelligibility and audito ry perception of deaf children.The aim of the present study was to compare speech intelligibility and auditory perception of pre-school children...Purpose:There is a growing interest in speech intelligibility and audito ry perception of deaf children.The aim of the present study was to compare speech intelligibility and auditory perception of pre-school children with Hearing Aid(HA),Cochlear Implant(Cl),and Typical Hearing(TH).Methods:The research design was descriptive-analytic and comparative.The participants comprised 75 male pre-school children aged 4-6 years in the 2017-2018 from Tehran,Iran.The participants were divided into three groups,and each group consisted of 25 children.The first and second groups were respectively selected from pre-school children with HA and CI using the convenience sampling method,while the third group was selected from pre-school children with TH by random sampling method.All children completed Speech Intelligibility Rating and Catego ries of Auditory Performance Questionnaires.Results:The findings indicated that the mean scores of speech intelligibility and auditory perception of the group with TH were significantly higher than those of the other groups(P<0.0001).The mean scores of speech intelligibility in the group with CI did not significantly differ from those of the group with HA(P<0.38).Also,the mean scores of auditory perception in the group with CI were significantly higher than those of the group with HA(P<0.002).Conclusion:The results showed that auditory perception in children with CI was significantly higher than children with HA.This finding highlights the importance of cochlear implantation at a younger age and its significant impact on auditory perception in deaf children.展开更多
This study examines the effect of speech level on intelligibility in different reverberation conditions, and explores the potential of loudness-based reverberation parameters proposed by Lee et al. [J. Acoust. Soc. Am...This study examines the effect of speech level on intelligibility in different reverberation conditions, and explores the potential of loudness-based reverberation parameters proposed by Lee et al. [J. Acoust. Soc. Am., 131(2), 1194-1205 (2012)] to explain the effect of speech level on intelligibility in various reverberation conditions. Listening experiments were performed with three speech levels (LAeq of 55 dB, 65 dB and 75 dB) and three reverberation conditions (T20 of 1.0 s, 1.9 s and 4.0 s), and subjects listened to speech stimuli through headphones. Collected subjective data were compared with two conventional speech intelligibility parameters (Speech Intelligibility Index and Speech Transmission Index) and two loudness-based reverberation parameters (EDTN and TN). Results reveal that the effect of speech level on intelligibility changes with a room’s reverberation conditions, and that increased level results in reduced intelligibility in highly reverberant conditions. EDTN and TN explain this finding better than do STI and SII, because they consider many psychoacoustic phenomena important for the modeling of the effect of speech level varying with reverberation.展开更多
Speech intelligibility enhancement in noisy environments is still one of the major challenges for hearing impaired in everyday life.Recently,Machine-learning based approaches to speech enhancement have shown great pro...Speech intelligibility enhancement in noisy environments is still one of the major challenges for hearing impaired in everyday life.Recently,Machine-learning based approaches to speech enhancement have shown great promise for improving speech intelligibility.Two key issues of these approaches are acoustic features extracted from noisy signals and classifiers used for supervised learning.In this paper,features are focused.Multi-resolution power-normalized cepstral coefficients(MRPNCC)are proposed as a new feature to enhance the speech intelligibility for hearing impaired.The new feature is constructed by combining four cepstrum at different time–frequency(T–F)resolutions in order to capture both the local and contextual information.MRPNCC vectors and binary masking labels calculated by signals passed through gammatone filterbank are used to train support vector machine(SVM)classifier,which aim to identify the binary masking values of the T–F units in the enhancement stage.The enhanced speech is synthesized by using the estimated masking values and wiener filtered T–F unit.Objective experimental results demonstrate that the proposed feature is superior to other comparing features in terms of HIT-FA,STOI,HASPI and PESQ,and that the proposed algorithm not only improves speech intelligibility but also improves speech quality slightly.Subjective tests validate the effectiveness of the proposed algorithm for hearing impaired.展开更多
The present study examines accentedness, comprehensibility, and intelligibility of American English, Moroccan English, Turkmen English, and Chinese English, from the perspectives of three groups of listeners: native s...The present study examines accentedness, comprehensibility, and intelligibility of American English, Moroccan English, Turkmen English, and Chinese English, from the perspectives of three groups of listeners: native speakers of English and Chinese speakers of English with or without international experiences. Of the 145 listeners, 38 had face-to-face interviews. These listener groups were asked to listen to the recordings of the four English varieties and fill in a cloze test.Results indicated that the three listener groups differed significantly in rating comprehensibility of American English, Moroccan English and Chinese English but they did not give Turkmen English statistically different ratings;there were no significant differences in accentedness ratings except for Chinese English;and there were significant differences in the intelligibility scores of the four English varieties. In addition, with respect to seven linguistic variables-speed, clarity, intonation,smoothness and fluency, vocal intensity, pause, vocabulary and grammar, there were significant differences in three listener groups’ rating of six variables in American English except the one of“speed”. By contrast, Chinese English received significantly different ratings only in “proper speed”. No differences were found in the ratings for Moroccan English and Turkmen English.Finally, unlike Chinese listeners without international experiences, native listeners and Chinese listeners with international experiences shared some similarities in correlations between the ratings of accentedness and comprehensibility and those of linguistic variables on Moroccan English,Turkmen English, and Chinese English. However, the results for American English from Chinese listeners without international experiences and native listeners seemed to be more alike. Linguistic variables correlating with accentedness and comprehensibility of American English showed a mixed profile. Qualitative data provided more variant elaborations on the pronunciations and language uses of the speakers.展开更多
The aim of the study was to evaluate the alterations in speech intelligibility in a cleft palate patient, before and after extending and modifying the palatal contour of the existing prosthesis using a correctable wax...The aim of the study was to evaluate the alterations in speech intelligibility in a cleft palate patient, before and after extending and modifying the palatal contour of the existing prosthesis using a correctable wax recording. An eight-year-old girl studying in second grade with a velopharyngeal defect using an obturator reported to the outpatient clinic complaining of lack in clarity of speech. The existing prosthesis was lacking a speech bulb hence it was decided to add the speech bulb to the existing prosthesis and evaluate the speech. Even after the use of speech bulb it was observed that she was unable to pronounce the vowels and words like shoe, vision, cheer, etc. clearly. Hence, a palatography was done using a correctable wax technique and the existing prosthesis was altered accordingly. Great improvement in speech, mastication, and velopharyngeal function was achieved after the palatography alteration of the existing prosthesis.展开更多
In this paper I examine the following claims by William Eaton in his monograph Boyle on Fire: (i) that Boyle's religious convictions led him to believe that the world was not completely explicable, and this shows ...In this paper I examine the following claims by William Eaton in his monograph Boyle on Fire: (i) that Boyle's religious convictions led him to believe that the world was not completely explicable, and this shows that there is a shortcoming in the power of mechanical explanations; (ii) that mechanical explanations offer only sufficient, not necessary explanations, and this too was taken by Boyle to be a limit in the explanatory power of mechanical explanations; (iii) that the mature Boyle thought that there could be more intelligible explanatory models than mechanism; and (iv) that what Boyle says at any point in his career is incompatible with the statement of Maria Boas-Hall, i.e., that the mechanical hypothesis can explicate all natural phenomena. Since all four of these claims are part of Eaton's developmental argument, my rejection of them will not only show how the particular developmental story Eaton diagnoses is inaccurate, but will also explain what limits there actually are in Boyle's account of the intelligibility of mechanical explanations. My account will also show why important philosophers like Locke and Leibniz should be interested in Boyle's philosophical work.展开更多
Based on the curriculum proposed by the Chilean Ministry of Education,schools are offered textbooks and complementary guides for teachers as a tool for the learning process of the students.Bearing in mind that the Min...Based on the curriculum proposed by the Chilean Ministry of Education,schools are offered textbooks and complementary guides for teachers as a tool for the learning process of the students.Bearing in mind that the Ministry suggests a list of learning targets that must be accomplished at the end of the academic year,the author of said textbooks offers activities that are meant to fulfil these objectives.However,and considering that one of the goals of the Ministry is that 5th grade students,at the end of their learning process,communicate an intelligible message with a proper pronunciation,a lack of activities and methodologies focused on the improvement of these aspects can be noticed.Therefore,this proposal will present teachers with phonics as a methodology that may help students improve intelligibility and the recognition and production of segmental aspects of pronunciation.展开更多
The intelligibility of Thai speakers’English pronunciation from Chinese listeners’perspectives have not been analyzed deeply in current researches.As is known to all,the interconnection between China and Thailand is...The intelligibility of Thai speakers’English pronunciation from Chinese listeners’perspectives have not been analyzed deeply in current researches.As is known to all,the interconnection between China and Thailand is greatly developed with the sup⁃port of‘One Belt and One Road initiative’.However,it is inevitable for non-native English speakers to encounter some pronuncia⁃tion problems,which would affect them to communicate with each other.As for Chinese English learners,Thai accent to some ex⁃tent would be a great challenge during the conversation with Thai speakers.In order to promote a better communication,it is neces⁃sary to analyze the origin of Thai speakers’English pronunciation problems.Therefore,this research based on the previous studies to categorize the common pronunciation problems among Thai speakers.However,there are numerous English learners in Thailand that have various language proficiency.In this respect,two Thai speakers with different IELTS scores have been selected for com⁃parison,and six postgraduate students in the Education University of Hong Kong are invited to serve as a listener.As a result,this research will base on the listeners’feedback to highlight some suggestions to enhance the intelligibility of Thai speakers’pronun⁃ciation.展开更多
As the primary means of communication,speech is an essential aspect for humans to interact and build connections in the social world.Speech intelligibility is critical in social communication;unintelligibility may lea...As the primary means of communication,speech is an essential aspect for humans to interact and build connections in the social world.Speech intelligibility is critical in social communication;unintelligibility may lead to confusion,misunderstanding,and frustration.Many Chinese learners of English find it challenging to apply English into social interaction and reach mutual intelligibility with international communicators.This article analyzes the obstacles impeding Chinese EFL learners’speech intelligibility development,from the aspects of phonology(segmental and suprasegmental features)and pragmatics.Some strategies are proposed to help Chinese learners ameliorate phonology and pragmatics problems and improve speech intelligibility in English communication.展开更多
Pronunciation plays an important role in English language teaching. This essay reviews literature on English pronunciation teaching and English as a Lingua Franca. Through examining the priorities in English pronuncia...Pronunciation plays an important role in English language teaching. This essay reviews literature on English pronunciation teaching and English as a Lingua Franca. Through examining the priorities in English pronunciation teaching, teachers' and students' attitudes towards intelligibility and native-like proficiency, as well as the value of adhering to a language standard, the author provides rational reasons for choosing intelligibility rather than native-like proficiency as the priority of English pronunciation teaching.展开更多
As English serves as an international language(or Lingua Franca),increasing attention is paid to English pedagogy of English as International Language(EIL).This essay maintains that international mutual intelligibilit...As English serves as an international language(or Lingua Franca),increasing attention is paid to English pedagogy of English as International Language(EIL).This essay maintains that international mutual intelligibility is a more practical and achievable goal in English pronunciation pedagogy,compared with native-like proficiency and various of standard English.Meanwhile,Jenkins(2000)’s Lingua Franca Core(LFC),a Phonological Core for EIL,is briefly introduced.展开更多
Objective:To observe the efficacy of oral-facial acupressure combined with oral placement therapy(OPT)in improving articulation clarity in 120 children with spastic cerebral palsy,and to explore effective therapeutic ...Objective:To observe the efficacy of oral-facial acupressure combined with oral placement therapy(OPT)in improving articulation clarity in 120 children with spastic cerebral palsy,and to explore effective therapeutic solutions for speech disorders associated with spastic cerebral palsy.Methods:A total of 120 children with spastic cerebral palsy and speech disorders,meeting the inclusion criteria,were randomly assigned into two groups:60 cases in the treatment group and 60 cases in the control group.The treatment group received orofacial acupressure combined with OPT,while the control group received only OPT.The Oral Motor Function Assessment Scale(OMFAS),developed by the China Rehabilitation Research Centre(CRRC),was used to evaluate the treatment outcomes before and after the intervention.Results:After the treatment,both the treatment and control groups showed improved mobility of the mandible,lips,and tongue.However,the treatment group exhibited significantly better improvement than the control group,with the difference between the two groups being statistically significant(P<0.05).Conclusion:Oral-facial acupressure combined with OPT can effectively improve articulation clarity in children with spastic cerebral palsy.This combined therapy is recommended for clinical promotion and application.展开更多
DURING our discussion at workshops for writing“What Does ChatGPT Say:The DAO from Algorithmic Intelligence to Linguistic Intelligence”[1],we had expected the next milestone for Artificial Intelligence(AI)would be in...DURING our discussion at workshops for writing“What Does ChatGPT Say:The DAO from Algorithmic Intelligence to Linguistic Intelligence”[1],we had expected the next milestone for Artificial Intelligence(AI)would be in the direction of Imaginative Intelligence(II),i.e.,something similar to automatic wordsto-videos generation or intelligent digital movies/theater technology that could be used for conducting new“Artificiofactual Experiments”[2]to replace conventional“Counterfactual Experiments”in scientific research and technical development for both natural and social studies[2]-[6].Now we have OpenAI’s Sora,so soon,but this is not the final,actually far away,and it is just the beginning.展开更多
Colorectal cancer(CRC)is one of the most prevalent malignancies worldwide,being the third most commonly diagnosed malignancy and the second leading cause of cancer-related deaths globally.Despite the progress in scree...Colorectal cancer(CRC)is one of the most prevalent malignancies worldwide,being the third most commonly diagnosed malignancy and the second leading cause of cancer-related deaths globally.Despite the progress in screening,early diagnosis,and treatment,approximately 20%-25%of CRC patients still present with metastatic disease at the time of their initial diagnosis.Furthermore,the burden of disease is still expected to increase,especially in individuals younger than 50 years old,among whom early-onset CRC incidence has been increasing.Screening and early detection are pivotal to improve CRC-related outcomes.It is well established that CRC screening not only reduces incidence,but also decreases deaths from CRC.Diverse screening strategies have proven effective in decreasing both CRC incidence and mortality,though variations in efficacy have been reported across the literature.However,uncertainties persist regarding the optimal screening method,age intervals and periodicity.Moreover,adherence to CRC screening remains globally low.In recent years,emerging technologies,notably artificial intelligence,and non-invasive biomarkers,have been developed to overcome these barriers.However,controversy exists over the actual impact of some of the new discoveries on CRC-related outcomes and how to effectively integrate them into daily practice.In this review,we aim to cover the current evidence surrounding CRC screening.We will further critically assess novel approaches under investigation,in an effort to differentiate promising inno-vations from mere novelties.展开更多
Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and ...Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence.展开更多
The release of the generative pre-trained transformer(GPT)series has brought artificial general intelligence(AGI)to the forefront of the artificial intelligence(AI)field once again.However,the questions of how to defi...The release of the generative pre-trained transformer(GPT)series has brought artificial general intelligence(AGI)to the forefront of the artificial intelligence(AI)field once again.However,the questions of how to define and evaluate AGI remain unclear.This perspective article proposes that the evaluation of AGI should be rooted in dynamic embodied physical and social interactions(DEPSI).More specifically,we propose five critical characteristics to be considered as AGI benchmarks and suggest the Tong test as an AGI evaluation system.The Tong test describes a value-and ability-oriented testing system that delineates five levels of AGI milestones through a virtual environment with DEPSI,allowing for infinite task generation.We contrast the Tong test with classical AI testing systems in terms of various aspects and propose a systematic evaluation system to promote standardized,quantitative,and objective benchmarks and evaluation of AGI.展开更多
Artificial intelligence(AI)models have significantly impacted various areas of the atmospheric sciences,reshaping our approach to climate-related challenges.Amid this AI-driven transformation,the foundational role of ...Artificial intelligence(AI)models have significantly impacted various areas of the atmospheric sciences,reshaping our approach to climate-related challenges.Amid this AI-driven transformation,the foundational role of physics in climate science has occasionally been overlooked.Our perspective suggests that the future of climate modeling involves a synergistic partnership between AI and physics,rather than an“either/or”scenario.Scrutinizing controversies around current physical inconsistencies in large AI models,we stress the critical need for detailed dynamic diagnostics and physical constraints.Furthermore,we provide illustrative examples to guide future assessments and constraints for AI models.Regarding AI integration with numerical models,we argue that offline AI parameterization schemes may fall short of achieving global optimality,emphasizing the importance of constructing online schemes.Additionally,we highlight the significance of fostering a community culture and propose the OCR(Open,Comparable,Reproducible)principles.Through a better community culture and a deep integration of physics and AI,we contend that developing a learnable climate model,balancing AI and physics,is an achievable goal.展开更多
Artificial intelligence can be indirectly applied to the repair of peripheral nerve injury.Specifically,it can be used to analyze and process data regarding peripheral nerve injury and repair,while study findings on p...Artificial intelligence can be indirectly applied to the repair of peripheral nerve injury.Specifically,it can be used to analyze and process data regarding peripheral nerve injury and repair,while study findings on peripheral nerve injury and repair can provide valuable data to enrich artificial intelligence algorithms.To investigate advances in the use of artificial intelligence in the diagnosis,rehabilitation,and scientific examination of peripheral nerve injury,we used CiteSpace and VOSviewer software to analyze the relevant literature included in the Web of Science from 1994–2023.We identified the following research hotspots in peripheral nerve injury and repair:(1)diagnosis,classification,and prognostic assessment of peripheral nerve injury using neuroimaging and artificial intelligence techniques,such as corneal confocal microscopy and coherent anti-Stokes Raman spectroscopy;(2)motion control and rehabilitation following peripheral nerve injury using artificial neural networks and machine learning algorithms,such as wearable devices and assisted wheelchair systems;(3)improving the accuracy and effectiveness of peripheral nerve electrical stimulation therapy using artificial intelligence techniques combined with deep learning,such as implantable peripheral nerve interfaces;(4)the application of artificial intelligence technology to brain-machine interfaces for disabled patients and those with reduced mobility,enabling them to control devices such as networked hand prostheses;(5)artificial intelligence robots that can replace doctors in certain procedures during surgery or rehabilitation,thereby reducing surgical risk and complications,and facilitating postoperative recovery.Although artificial intelligence has shown many benefits and potential applications in peripheral nerve injury and repair,there are some limitations to this technology,such as the consequences of missing or imbalanced data,low data accuracy and reproducibility,and ethical issues(e.g.,privacy,data security,research transparency).Future research should address the issue of data collection,as large-scale,high-quality clinical datasets are required to establish effective artificial intelligence models.Multimodal data processing is also necessary,along with interdisciplinary collaboration,medical-industrial integration,and multicenter,large-sample clinical studies.展开更多
In recent years,the global surge of High-speed Railway(HSR)revolutionized ground transportation,providing secure,comfortable,and punctual services.The next-gen HSR,fueled by emerging services like video surveillance,e...In recent years,the global surge of High-speed Railway(HSR)revolutionized ground transportation,providing secure,comfortable,and punctual services.The next-gen HSR,fueled by emerging services like video surveillance,emergency communication,and real-time scheduling,demands advanced capabilities in real-time perception,automated driving,and digitized services,which accelerate the integration and application of Artificial Intelligence(AI)in the HSR system.This paper first provides a brief overview of AI,covering its origin,evolution,and breakthrough applications.A comprehensive review is then given regarding the most advanced AI technologies and applications in three macro application domains of the HSR system:mechanical manufacturing and electrical control,communication and signal control,and transportation management.The literature is categorized and compared across nine application directions labeled as intelligent manufacturing of trains and key components,forecast of railroad maintenance,optimization of energy consumption in railroads and trains,communication security,communication dependability,channel modeling and estimation,passenger scheduling,traffic flow forecasting,high-speed railway smart platform.Finally,challenges associated with the application of AI are discussed,offering insights for future research directions.展开更多
基金The National Natural Science Foundation of China (No.61231002,61273266,51075068,60872073,60975017, 61003131)the Ph.D.Programs Foundation of the Ministry of Education of China(No.20110092130004)+1 种基金the Science Foundation for Young Talents in the Educational Committee of Anhui Province(No. 2010SQRL018)the 211 Project of Anhui University(No.2009QN027B)
文摘A machine learning based speech enhancement method is proposed to improve the intelligibility of whispered speech. A binary mask estimated by a two-class support vector machine (SVM) classifier is used to synthesize the enhanced whisper. A novel noise robust feature called Gammatone feature cosine coefficients (GFCCs) extracted by an auditory periphery model is derived and used for the binary mask estimation. The intelligibility performance of the proposed method is evaluated and compared with the traditional speech enhancement methods. Objective and subjective evaluation results indicate that the proposed method can effectively improve the intelligibility of whispered speech which is contaminated by noise. Compared with the power subtract algorithm and the log-MMSE algorithm, both of which do not improve the intelligibility in lower signal-to-noise ratio (SNR) environments, the proposed method has good performance in improving the intelligibility of noisy whisper. Additionally, the intelligibility of the enhanced whispered speech using the proposed method also outperforms that of the corresponding unprocessed noisy whispered speech.
文摘Purpose:There is a growing interest in speech intelligibility and audito ry perception of deaf children.The aim of the present study was to compare speech intelligibility and auditory perception of pre-school children with Hearing Aid(HA),Cochlear Implant(Cl),and Typical Hearing(TH).Methods:The research design was descriptive-analytic and comparative.The participants comprised 75 male pre-school children aged 4-6 years in the 2017-2018 from Tehran,Iran.The participants were divided into three groups,and each group consisted of 25 children.The first and second groups were respectively selected from pre-school children with HA and CI using the convenience sampling method,while the third group was selected from pre-school children with TH by random sampling method.All children completed Speech Intelligibility Rating and Catego ries of Auditory Performance Questionnaires.Results:The findings indicated that the mean scores of speech intelligibility and auditory perception of the group with TH were significantly higher than those of the other groups(P<0.0001).The mean scores of speech intelligibility in the group with CI did not significantly differ from those of the group with HA(P<0.38).Also,the mean scores of auditory perception in the group with CI were significantly higher than those of the group with HA(P<0.002).Conclusion:The results showed that auditory perception in children with CI was significantly higher than children with HA.This finding highlights the importance of cochlear implantation at a younger age and its significant impact on auditory perception in deaf children.
文摘This study examines the effect of speech level on intelligibility in different reverberation conditions, and explores the potential of loudness-based reverberation parameters proposed by Lee et al. [J. Acoust. Soc. Am., 131(2), 1194-1205 (2012)] to explain the effect of speech level on intelligibility in various reverberation conditions. Listening experiments were performed with three speech levels (LAeq of 55 dB, 65 dB and 75 dB) and three reverberation conditions (T20 of 1.0 s, 1.9 s and 4.0 s), and subjects listened to speech stimuli through headphones. Collected subjective data were compared with two conventional speech intelligibility parameters (Speech Intelligibility Index and Speech Transmission Index) and two loudness-based reverberation parameters (EDTN and TN). Results reveal that the effect of speech level on intelligibility changes with a room’s reverberation conditions, and that increased level results in reduced intelligibility in highly reverberant conditions. EDTN and TN explain this finding better than do STI and SII, because they consider many psychoacoustic phenomena important for the modeling of the effect of speech level varying with reverberation.
基金supported by the National Natural Science Foundation of China(Nos.61902158,61673108)the Science and Technology Program of Nantong(JC2018129,MS12018082)Top-notch Academic Programs Project of Jiangsu Higher Education Institu-tions(PPZY2015B135).
文摘Speech intelligibility enhancement in noisy environments is still one of the major challenges for hearing impaired in everyday life.Recently,Machine-learning based approaches to speech enhancement have shown great promise for improving speech intelligibility.Two key issues of these approaches are acoustic features extracted from noisy signals and classifiers used for supervised learning.In this paper,features are focused.Multi-resolution power-normalized cepstral coefficients(MRPNCC)are proposed as a new feature to enhance the speech intelligibility for hearing impaired.The new feature is constructed by combining four cepstrum at different time–frequency(T–F)resolutions in order to capture both the local and contextual information.MRPNCC vectors and binary masking labels calculated by signals passed through gammatone filterbank are used to train support vector machine(SVM)classifier,which aim to identify the binary masking values of the T–F units in the enhancement stage.The enhanced speech is synthesized by using the estimated masking values and wiener filtered T–F unit.Objective experimental results demonstrate that the proposed feature is superior to other comparing features in terms of HIT-FA,STOI,HASPI and PESQ,and that the proposed algorithm not only improves speech intelligibility but also improves speech quality slightly.Subjective tests validate the effectiveness of the proposed algorithm for hearing impaired.
文摘The present study examines accentedness, comprehensibility, and intelligibility of American English, Moroccan English, Turkmen English, and Chinese English, from the perspectives of three groups of listeners: native speakers of English and Chinese speakers of English with or without international experiences. Of the 145 listeners, 38 had face-to-face interviews. These listener groups were asked to listen to the recordings of the four English varieties and fill in a cloze test.Results indicated that the three listener groups differed significantly in rating comprehensibility of American English, Moroccan English and Chinese English but they did not give Turkmen English statistically different ratings;there were no significant differences in accentedness ratings except for Chinese English;and there were significant differences in the intelligibility scores of the four English varieties. In addition, with respect to seven linguistic variables-speed, clarity, intonation,smoothness and fluency, vocal intensity, pause, vocabulary and grammar, there were significant differences in three listener groups’ rating of six variables in American English except the one of“speed”. By contrast, Chinese English received significantly different ratings only in “proper speed”. No differences were found in the ratings for Moroccan English and Turkmen English.Finally, unlike Chinese listeners without international experiences, native listeners and Chinese listeners with international experiences shared some similarities in correlations between the ratings of accentedness and comprehensibility and those of linguistic variables on Moroccan English,Turkmen English, and Chinese English. However, the results for American English from Chinese listeners without international experiences and native listeners seemed to be more alike. Linguistic variables correlating with accentedness and comprehensibility of American English showed a mixed profile. Qualitative data provided more variant elaborations on the pronunciations and language uses of the speakers.
文摘The aim of the study was to evaluate the alterations in speech intelligibility in a cleft palate patient, before and after extending and modifying the palatal contour of the existing prosthesis using a correctable wax recording. An eight-year-old girl studying in second grade with a velopharyngeal defect using an obturator reported to the outpatient clinic complaining of lack in clarity of speech. The existing prosthesis was lacking a speech bulb hence it was decided to add the speech bulb to the existing prosthesis and evaluate the speech. Even after the use of speech bulb it was observed that she was unable to pronounce the vowels and words like shoe, vision, cheer, etc. clearly. Hence, a palatography was done using a correctable wax technique and the existing prosthesis was altered accordingly. Great improvement in speech, mastication, and velopharyngeal function was achieved after the palatography alteration of the existing prosthesis.
文摘In this paper I examine the following claims by William Eaton in his monograph Boyle on Fire: (i) that Boyle's religious convictions led him to believe that the world was not completely explicable, and this shows that there is a shortcoming in the power of mechanical explanations; (ii) that mechanical explanations offer only sufficient, not necessary explanations, and this too was taken by Boyle to be a limit in the explanatory power of mechanical explanations; (iii) that the mature Boyle thought that there could be more intelligible explanatory models than mechanism; and (iv) that what Boyle says at any point in his career is incompatible with the statement of Maria Boas-Hall, i.e., that the mechanical hypothesis can explicate all natural phenomena. Since all four of these claims are part of Eaton's developmental argument, my rejection of them will not only show how the particular developmental story Eaton diagnoses is inaccurate, but will also explain what limits there actually are in Boyle's account of the intelligibility of mechanical explanations. My account will also show why important philosophers like Locke and Leibniz should be interested in Boyle's philosophical work.
文摘Based on the curriculum proposed by the Chilean Ministry of Education,schools are offered textbooks and complementary guides for teachers as a tool for the learning process of the students.Bearing in mind that the Ministry suggests a list of learning targets that must be accomplished at the end of the academic year,the author of said textbooks offers activities that are meant to fulfil these objectives.However,and considering that one of the goals of the Ministry is that 5th grade students,at the end of their learning process,communicate an intelligible message with a proper pronunciation,a lack of activities and methodologies focused on the improvement of these aspects can be noticed.Therefore,this proposal will present teachers with phonics as a methodology that may help students improve intelligibility and the recognition and production of segmental aspects of pronunciation.
文摘The intelligibility of Thai speakers’English pronunciation from Chinese listeners’perspectives have not been analyzed deeply in current researches.As is known to all,the interconnection between China and Thailand is greatly developed with the sup⁃port of‘One Belt and One Road initiative’.However,it is inevitable for non-native English speakers to encounter some pronuncia⁃tion problems,which would affect them to communicate with each other.As for Chinese English learners,Thai accent to some ex⁃tent would be a great challenge during the conversation with Thai speakers.In order to promote a better communication,it is neces⁃sary to analyze the origin of Thai speakers’English pronunciation problems.Therefore,this research based on the previous studies to categorize the common pronunciation problems among Thai speakers.However,there are numerous English learners in Thailand that have various language proficiency.In this respect,two Thai speakers with different IELTS scores have been selected for com⁃parison,and six postgraduate students in the Education University of Hong Kong are invited to serve as a listener.As a result,this research will base on the listeners’feedback to highlight some suggestions to enhance the intelligibility of Thai speakers’pronun⁃ciation.
文摘As the primary means of communication,speech is an essential aspect for humans to interact and build connections in the social world.Speech intelligibility is critical in social communication;unintelligibility may lead to confusion,misunderstanding,and frustration.Many Chinese learners of English find it challenging to apply English into social interaction and reach mutual intelligibility with international communicators.This article analyzes the obstacles impeding Chinese EFL learners’speech intelligibility development,from the aspects of phonology(segmental and suprasegmental features)and pragmatics.Some strategies are proposed to help Chinese learners ameliorate phonology and pragmatics problems and improve speech intelligibility in English communication.
文摘Pronunciation plays an important role in English language teaching. This essay reviews literature on English pronunciation teaching and English as a Lingua Franca. Through examining the priorities in English pronunciation teaching, teachers' and students' attitudes towards intelligibility and native-like proficiency, as well as the value of adhering to a language standard, the author provides rational reasons for choosing intelligibility rather than native-like proficiency as the priority of English pronunciation teaching.
文摘As English serves as an international language(or Lingua Franca),increasing attention is paid to English pedagogy of English as International Language(EIL).This essay maintains that international mutual intelligibility is a more practical and achievable goal in English pronunciation pedagogy,compared with native-like proficiency and various of standard English.Meanwhile,Jenkins(2000)’s Lingua Franca Core(LFC),a Phonological Core for EIL,is briefly introduced.
文摘Objective:To observe the efficacy of oral-facial acupressure combined with oral placement therapy(OPT)in improving articulation clarity in 120 children with spastic cerebral palsy,and to explore effective therapeutic solutions for speech disorders associated with spastic cerebral palsy.Methods:A total of 120 children with spastic cerebral palsy and speech disorders,meeting the inclusion criteria,were randomly assigned into two groups:60 cases in the treatment group and 60 cases in the control group.The treatment group received orofacial acupressure combined with OPT,while the control group received only OPT.The Oral Motor Function Assessment Scale(OMFAS),developed by the China Rehabilitation Research Centre(CRRC),was used to evaluate the treatment outcomes before and after the intervention.Results:After the treatment,both the treatment and control groups showed improved mobility of the mandible,lips,and tongue.However,the treatment group exhibited significantly better improvement than the control group,with the difference between the two groups being statistically significant(P<0.05).Conclusion:Oral-facial acupressure combined with OPT can effectively improve articulation clarity in children with spastic cerebral palsy.This combined therapy is recommended for clinical promotion and application.
基金the National Natural Science Foundation of China(62271485,61903363,U1811463,62103411,62203250)the Science and Technology Development Fund of Macao SAR(0093/2023/RIA2,0050/2020/A1)。
文摘DURING our discussion at workshops for writing“What Does ChatGPT Say:The DAO from Algorithmic Intelligence to Linguistic Intelligence”[1],we had expected the next milestone for Artificial Intelligence(AI)would be in the direction of Imaginative Intelligence(II),i.e.,something similar to automatic wordsto-videos generation or intelligent digital movies/theater technology that could be used for conducting new“Artificiofactual Experiments”[2]to replace conventional“Counterfactual Experiments”in scientific research and technical development for both natural and social studies[2]-[6].Now we have OpenAI’s Sora,so soon,but this is not the final,actually far away,and it is just the beginning.
文摘Colorectal cancer(CRC)is one of the most prevalent malignancies worldwide,being the third most commonly diagnosed malignancy and the second leading cause of cancer-related deaths globally.Despite the progress in screening,early diagnosis,and treatment,approximately 20%-25%of CRC patients still present with metastatic disease at the time of their initial diagnosis.Furthermore,the burden of disease is still expected to increase,especially in individuals younger than 50 years old,among whom early-onset CRC incidence has been increasing.Screening and early detection are pivotal to improve CRC-related outcomes.It is well established that CRC screening not only reduces incidence,but also decreases deaths from CRC.Diverse screening strategies have proven effective in decreasing both CRC incidence and mortality,though variations in efficacy have been reported across the literature.However,uncertainties persist regarding the optimal screening method,age intervals and periodicity.Moreover,adherence to CRC screening remains globally low.In recent years,emerging technologies,notably artificial intelligence,and non-invasive biomarkers,have been developed to overcome these barriers.However,controversy exists over the actual impact of some of the new discoveries on CRC-related outcomes and how to effectively integrate them into daily practice.In this review,we aim to cover the current evidence surrounding CRC screening.We will further critically assess novel approaches under investigation,in an effort to differentiate promising inno-vations from mere novelties.
基金supported in part by the National Natural Science Foundation of China(62222301, 62073085, 62073158, 61890930-5, 62021003)the National Key Research and Development Program of China (2021ZD0112302, 2021ZD0112301, 2018YFC1900800-5)Beijing Natural Science Foundation (JQ19013)。
文摘Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence.
基金supported by the National Key Research and Development Program of China (2022ZD0114900).
文摘The release of the generative pre-trained transformer(GPT)series has brought artificial general intelligence(AGI)to the forefront of the artificial intelligence(AI)field once again.However,the questions of how to define and evaluate AGI remain unclear.This perspective article proposes that the evaluation of AGI should be rooted in dynamic embodied physical and social interactions(DEPSI).More specifically,we propose five critical characteristics to be considered as AGI benchmarks and suggest the Tong test as an AGI evaluation system.The Tong test describes a value-and ability-oriented testing system that delineates five levels of AGI milestones through a virtual environment with DEPSI,allowing for infinite task generation.We contrast the Tong test with classical AI testing systems in terms of various aspects and propose a systematic evaluation system to promote standardized,quantitative,and objective benchmarks and evaluation of AGI.
基金supported by the National Natural Science Foundation of China(Grant Nos.42141019 and 42261144687)and STEP(Grant No.2019QZKK0102)supported by the Korea Environmental Industry&Technology Institute(KEITI)through the“Project for developing an observation-based GHG emissions geospatial information map”,funded by the Korea Ministry of Environment(MOE)(Grant No.RS-2023-00232066).
文摘Artificial intelligence(AI)models have significantly impacted various areas of the atmospheric sciences,reshaping our approach to climate-related challenges.Amid this AI-driven transformation,the foundational role of physics in climate science has occasionally been overlooked.Our perspective suggests that the future of climate modeling involves a synergistic partnership between AI and physics,rather than an“either/or”scenario.Scrutinizing controversies around current physical inconsistencies in large AI models,we stress the critical need for detailed dynamic diagnostics and physical constraints.Furthermore,we provide illustrative examples to guide future assessments and constraints for AI models.Regarding AI integration with numerical models,we argue that offline AI parameterization schemes may fall short of achieving global optimality,emphasizing the importance of constructing online schemes.Additionally,we highlight the significance of fostering a community culture and propose the OCR(Open,Comparable,Reproducible)principles.Through a better community culture and a deep integration of physics and AI,we contend that developing a learnable climate model,balancing AI and physics,is an achievable goal.
基金supported by the Capital’s Funds for Health Improvement and Research,No.2022-2-2072(to YG).
文摘Artificial intelligence can be indirectly applied to the repair of peripheral nerve injury.Specifically,it can be used to analyze and process data regarding peripheral nerve injury and repair,while study findings on peripheral nerve injury and repair can provide valuable data to enrich artificial intelligence algorithms.To investigate advances in the use of artificial intelligence in the diagnosis,rehabilitation,and scientific examination of peripheral nerve injury,we used CiteSpace and VOSviewer software to analyze the relevant literature included in the Web of Science from 1994–2023.We identified the following research hotspots in peripheral nerve injury and repair:(1)diagnosis,classification,and prognostic assessment of peripheral nerve injury using neuroimaging and artificial intelligence techniques,such as corneal confocal microscopy and coherent anti-Stokes Raman spectroscopy;(2)motion control and rehabilitation following peripheral nerve injury using artificial neural networks and machine learning algorithms,such as wearable devices and assisted wheelchair systems;(3)improving the accuracy and effectiveness of peripheral nerve electrical stimulation therapy using artificial intelligence techniques combined with deep learning,such as implantable peripheral nerve interfaces;(4)the application of artificial intelligence technology to brain-machine interfaces for disabled patients and those with reduced mobility,enabling them to control devices such as networked hand prostheses;(5)artificial intelligence robots that can replace doctors in certain procedures during surgery or rehabilitation,thereby reducing surgical risk and complications,and facilitating postoperative recovery.Although artificial intelligence has shown many benefits and potential applications in peripheral nerve injury and repair,there are some limitations to this technology,such as the consequences of missing or imbalanced data,low data accuracy and reproducibility,and ethical issues(e.g.,privacy,data security,research transparency).Future research should address the issue of data collection,as large-scale,high-quality clinical datasets are required to establish effective artificial intelligence models.Multimodal data processing is also necessary,along with interdisciplinary collaboration,medical-industrial integration,and multicenter,large-sample clinical studies.
基金supported by the National Natural Science Foundation of China(62172033).
文摘In recent years,the global surge of High-speed Railway(HSR)revolutionized ground transportation,providing secure,comfortable,and punctual services.The next-gen HSR,fueled by emerging services like video surveillance,emergency communication,and real-time scheduling,demands advanced capabilities in real-time perception,automated driving,and digitized services,which accelerate the integration and application of Artificial Intelligence(AI)in the HSR system.This paper first provides a brief overview of AI,covering its origin,evolution,and breakthrough applications.A comprehensive review is then given regarding the most advanced AI technologies and applications in three macro application domains of the HSR system:mechanical manufacturing and electrical control,communication and signal control,and transportation management.The literature is categorized and compared across nine application directions labeled as intelligent manufacturing of trains and key components,forecast of railroad maintenance,optimization of energy consumption in railroads and trains,communication security,communication dependability,channel modeling and estimation,passenger scheduling,traffic flow forecasting,high-speed railway smart platform.Finally,challenges associated with the application of AI are discussed,offering insights for future research directions.