期刊文献+
共找到541篇文章
< 1 2 28 >
每页显示 20 50 100
Linguistic Features of Oral English Corpora and Their Implications for English Teaching in Colleges and Universities
1
作者 Yanru Li Yu Bei 《Journal of Contemporary Educational Research》 2024年第2期208-212,共5页
The main goal of English teaching in colleges and universities is to cultivate students’ability to use the language,but many students are still unable to complete oral communication fluently after years of study.For ... The main goal of English teaching in colleges and universities is to cultivate students’ability to use the language,but many students are still unable to complete oral communication fluently after years of study.For this reason,teachers need to deeply analyze and study the linguistic features of oral English corpora and formulate reasonable teaching strategies to improve students’oral expression skills.This paper outlines the linguistic features of oral English corpora,comparatively analyzes the differences between oral English corpora and written English corpora,and explores effective teaching strategies,hoping to provide guidelines for relevant teachers. 展开更多
关键词 Oral English corpora Linguistic features Colleges and universities English teaching
下载PDF
Breathe New Life into English Teaching:A Book Review of Using Corpora in the Language Classroom 被引量:1
2
作者 唐磊 江晓敏 《海外英语》 2013年第13期93-94,共2页
The present article provides a critical review of Randi Reppen's impressing book Using Corpora in the Language Classroom.It's argued that Randi Reppen's book,despite a few slight flaws,has a strong practic... The present article provides a critical review of Randi Reppen's impressing book Using Corpora in the Language Classroom.It's argued that Randi Reppen's book,despite a few slight flaws,has a strong practical orientation and is a laudable effort to make English language teachers to realize the importance and practicality of bringing corpora into classroom in digital age.The book is particularly worthy of reading for those language teachers(especially beginner teachers) who want to breathe new life into their English teaching. 展开更多
关键词 English TEACHING BOOK review USING corpora in the
下载PDF
Classroom Application of Corpora for Extending Knowledge of English Lexis
3
作者 蔡蕾 《海外英语》 2018年第23期65-67,共3页
This paper examines the application of electronic corpora to English classroom of lexical learning.It starts with a literature review of basic issues on corpus linguistics and theories underlying lexical study,followe... This paper examines the application of electronic corpora to English classroom of lexical learning.It starts with a literature review of basic issues on corpus linguistics and theories underlying lexical study,followed by discussion on the specific lexical learning aspects in which a corpus might provide some insight.Based other research like data-driven learning(DDL)in this area,the paper goes further by exploring the possibility and ways of applying corpora in vocabulary learning classroom. 展开更多
关键词 corpora lexis LANGUAGE use COLLOCATION CONTEXT
下载PDF
Standard NER Tagging Scheme for Big Data Healthcare Analytics built on Unified Medical Corpora 被引量:1
4
作者 Sarah Shafqat Hammad Majeed +1 位作者 Qaisar Javaid Hafiz Farooq Ahmad 《Journal of Artificial Intelligence and Technology》 2022年第4期152-157,共6页
The motivation for this research comes from the gap found in discovering the common ground for medical context learning through analytics for different purposes of diagnosing,recommending,prescribing,or treating patie... The motivation for this research comes from the gap found in discovering the common ground for medical context learning through analytics for different purposes of diagnosing,recommending,prescribing,or treating patients for uniform phenotype features from patients’profile.The authors of this paper while searching for possible solutions for medical context learning found that unified corpora tagged with medical nomenclature was missing to train the analytics for medical context learning.Therefore,here we demonstrated a mechanism to come up with uniform NER(Named Entity Recognition)tagged medical corpora that is fed with 14407 endocrine patients’data set in Comma Separated Values(CSV)format diagnosed with diabetes mellitus and comorbidity diseases.The other corpus is of ICD-10-CM coding scheme in text format taken from www.icd10data.com.ICD-10-CM corpus is to be tagged for understanding the medical context with uniformity for which we are conducting different experiments using common natural language programming(NLP)techniques and frameworks like TensorFlow,Keras,Long Short-Term Memory(LSTM),and Bi-LSTM.In our preliminary experiments,albeit label sets in form of(instance,label)pair were tagged with Sequential()model formed on TensorFlow.Keras and Bi-LSTM NLP algorithms.The maximum accuracy achieved for model validation was 0.8846. 展开更多
关键词 big data endocrine diseases international diabetes federation healthcare analytics ICD-10 medical corpora NLP
下载PDF
A protective role of resveratrol against the effects of immobilization stress in corpora lutea of mice in early pregnancy
5
作者 Saif ULLAH Sheeraz MUSTAFA +6 位作者 Wael ENNAB Muhammad JAN Muhammad SHAFIQ Ngekure MXKAVITA LU Zeng-peng MAO Da-gan SHI Fang-xiong 《Journal of Integrative Agriculture》 SCIE CAS CSCD 2020年第7期1857-1866,共10页
In the present study,we aimed to investigate a protective role for resveratrol against the effects of immobilization stress on corpora lutea(CL)of mice in early pregnancy.A total of 45 early-pregnant mice were divided... In the present study,we aimed to investigate a protective role for resveratrol against the effects of immobilization stress on corpora lutea(CL)of mice in early pregnancy.A total of 45 early-pregnant mice were divided into no immobilization stress(NIS)group,immobilization stress(IS)group,and immobilization and resveratrol treatment(IS+RES)group(n=15).Mice were immobilized in plastic tubes(50 mL)for 3 h per day during day 1 to 7 of pregnancy.In the IS+RES group,5 mg kg-'d-1 of resveratrol was administered just prior to application of stress.We analyzed apoptotic activity in CL by Western botting analysis(WB),transmission electron microscopy(TEM),and immunohistochemistry(IHC).Serum progesterone levels were examined with radioimmunoassay(RIA).IHC results showed that the intensity of positive staining for Bax was increased,and for BcI-2 was decreased in CL after IS,while resveratrol treatment reversed the positive staining for Bax and Bcl-2.WB revealed that immobilization stress up-regulated the expression of Bax and caspase-9,and down-regulated Bcl-2 expression,while resveratrol treatment attenuated the effects of immobilization stress on the expression of Bax,Bcl-2 and caspase-9.According to our TEM results,apoptosis as defined by chromatin condensation was found in CL after immobilization stress,while resveratrol inhibited the apoptosis.We also demonstrated that immobilization stress decreased progesterone concentrations and ovarian expression of StAR,while resveratrol restored the concentrations of progesterone and expression of StAR back to normal.These results indicated that immobilization stress induced luteal regression while resveratrol inhibited luteal regression,suggesting that resveratrol plays a protective role on corpora lutea of mice during early pregnancy. 展开更多
关键词 immobilization stress APOPTOSIS corpora lutea RESVERATROL pregnant mice
下载PDF
Contextual Text Mining Framework for Unstructured Textual Judicial Corpora through Ontologies
6
作者 Zubair Nabi Ramzan Talib +1 位作者 Muhammad Kashif Hanif Muhammad Awais 《Computer Systems Science & Engineering》 SCIE EI 2022年第12期1357-1374,共18页
Digitalization has changed the way of information processing, and newtechniques of legal data processing are evolving. Text mining helps to analyze andsearch different court cases available in the form of digital text... Digitalization has changed the way of information processing, and newtechniques of legal data processing are evolving. Text mining helps to analyze andsearch different court cases available in the form of digital text documents toextract case reasoning and related data. This sort of case processing helps professionals and researchers to refer the previous case with more accuracy in reducedtime. The rapid development of judicial ontologies seems to deliver interestingproblem solving to legal knowledge formalization. Mining context informationthrough ontologies from corpora is a challenging and interesting field. Thisresearch paper presents a three tier contextual text mining framework throughontologies for judicial corpora. This framework comprises on the judicial corpus,text mining processing resources and ontologies for mining contextual text fromcorpora to make text and data mining more reliable and fast. A top-down ontologyconstruction approach has been adopted in this paper. The judicial corpus hasbeen selected with a sufficient dataset to process and evaluate the results.The experimental results and evaluations show significant improvements incomparison with the available techniques. 展开更多
关键词 Natural language processing judicial corpora contextual text mining ontologies information extraction information retrieval
下载PDF
Expression of insulin-like growth factor-1 mRNA and protein level of corpora striata in ischemic side at the early stage of middle cerebral artery ischemia/reperfusion in rhesus monkeys
7
作者 Huanmin Gao Rui Zhang Yunliang Guo 《Neural Regeneration Research》 SCIE CAS CSCD 2006年第2期133-136,共4页
BACKGROUND: Insulin-like growth factor-I(IGF-1), as one of the important members of growth factor family, participants in the regulation of many physiological functions and behaviors, having very strong neuroprotectiv... BACKGROUND: Insulin-like growth factor-I(IGF-1), as one of the important members of growth factor family, participants in the regulation of many physiological functions and behaviors, having very strong neuroprotective effect. However, the expression of IGF-1 following cerebral ischemia/reperfusion is still disputed. OBJECTIVE: To observe the expression of IGF-1 and protein of corpora striata in ischemic side at the early stage of middle cerebral artery ischemia/reperfusion in rhesus monkey. DESIGN: A completely randomized grouping design, controlled animal experiment. SETTING: Institute of Cerebrovascular Disease, Affiliated Hospital of Medical College of Qingdao University. MATERIALS: ① Totally 17 rhesus monkeys , of either gender, aged 4 to 5 years, were enrolled . Seven rhesus monkeys observed with gene chip were randomly divided into 2 groups: sham operation group (n=3) and ischemia/reperfusion group (n=4). Ten rhesus monkeys observed with in situ hybridization and immunohistochemistry method were randomly divided into 2 groups: sham operation group (n=3)and ischemia/reperfusion group (n=7). Rhesus monkeys observed under microscope were divided into 2 groups: sham operation group (n=6) and ischamia/reperfusion group (n=11). ② Materials used in the experiment: cresyl violet (Sigma Company, America); immunohistochemical reagent kit ( Huamei Bio-engineering Company); In situ hybridization reagent kit (Boshide Bio-engineering Co.Ltd, Wuhan); 12 800 dots chip (Boxing Company, Shanghai). METHODS: This experiment was carried out at the Institute of Cerebrovascular Disease, Affiliated Hospital of Medical College of Qingdao University from January 2001 to December 2003. ① The onset area of middle cerebral artery was blocked for 2 hours, middle cerebral artery ischemia/reperfusion models were created. ② After ischemia/reperfusion for 24 hours, cerebral tissue sections of rhesus monkeys were prepared and stained with cresyl violet. Image analysis was performed with 500IW image analysis software. Morphological change of corpora striata of operative side was observed in the rhesus monkeys between two groups. Total RNA was extracted from cerebral tissue. ③ Detection of gene chip: Cy3-duTP and Cy5-duTP were used to respectively perform reverse transcription labeling. The sample was reversely transcribed into cDNA, then hybridized with cDNA of cerebral tissue. Genes with the separate absolute value of cy3 and cy5>800, cy3/cy5 > 2(high expression) or < 0.5 (low expression) were found out. Those were genes with differential expression. ④ The expressions of IGF-1 mRNA and protein level of corpora striata in ischemic side of rhesus monkeys were detected between sham operation group and ischemia/reperfusion group at 9 and 24 hours after ischemia/reperfusion with in situ hybridization method and immunohistochemical method. Brown granules were IGF-1 protein positive cells. ⑤ Analysis of variance was used in the difference comparison of measurement data among groups. MAIN OUTCOME MEASURES: ① Change of morphological structure of corpora striata at ischemic side in rhesus monkeys. ② Change of cerebral gene expression profiles at ischemia/reperfusion in rhesus monkeys between two groups. ③ Expression of IGF-1 mRNA and protein level of corpora striata at ischemia/reperfusion in rhesus monkeys between two groups. RESULTS: ① Pathological change : Obvious pathological change of cerebral infarction appeared in the ischemia and reperfusion group, while there was no such pathological change in the sham operation group. ② Change of gene expression profile : There were 4480 genes with difference expression in the ischemia/reperfusion group and sham-operation group, in which, 260 genes had high expression and their absolute value was over 800, and 63 genes had low expression. cy3/cy5 of IGF-1 was 0.379, being relative low expression. ③ IGF-1 mRNA and protein positive cell counts in corpora striata at cerebral ischemic side[IGF-1 mRNA:(9.72±1.18),(9.11±0.76),(14.77±0.60) counts/field;IGF-1 protein:(15.11±1.83),(15.39±0.78),(34.62±0.97)counts/field,P < 0.05-0.01]. CONCLUSION: IGF-1 mRNA and protein are lowly expressed in middle cerebral artery of rhesus monkeys at ischemia/reperfusion. 展开更多
关键词 IG Expression of insulin-like growth factor-1 mRNA and protein level of corpora striata in ischemic side at the early stage of middle cerebral artery ischemia/reperfusion in rhesus monkeys MRNA
下载PDF
A Novel Visualization Tool for Manual Annotation when Building Large Speech Corpora
8
作者 SHE Kun CHEN Shuzhen YANG Shen ZOU Lian 《Wuhan University Journal of Natural Sciences》 CAS 2006年第2期381-384,共4页
A novel visualized sound description, called sound dendrogram is proposed to make manual annotation easier when building large speech corpora. It is a lattice structure built from a group of “seed regions” and throu... A novel visualized sound description, called sound dendrogram is proposed to make manual annotation easier when building large speech corpora. It is a lattice structure built from a group of “seed regions” and through an iterative procedure of mergence. A simple but reliable extraction method of “seed regions” and advanced distance metric are adopted to construct the sound dendrogram, so that it can present speech’s structure character ranging from coarse to fine in a visualized way. Tests show that all phonemic boundaries are contained in the lattice structure of sound dendrogram and very easy to identify. Sound dendrogram can be a powerful assistant tool during the process of speech corpora’s manual annotation. 展开更多
关键词 可视化 人工注解 语音处理 计算机
下载PDF
The Youth’s View of Marriage in Chinese Mainstream Media Discourse-A Corpus-assisted Three-dimensional Discourse Analysis
9
作者 DING Shu-li LIN Ying 《Journal of Literature and Art Studies》 2024年第4期284-289,共6页
The rising of aging and the declining of birth rates have forced the public to focus on the youth’s view on marriage.Based on critical discourse analysis and combined with Fairclough’s three-dimensional discourse an... The rising of aging and the declining of birth rates have forced the public to focus on the youth’s view on marriage.Based on critical discourse analysis and combined with Fairclough’s three-dimensional discourse analysis model,this paper builds a“Chinese media News Report Corpus on the topic of‘marriage’”whose news are collected from China Daily.It is found that the discourses are neutral and objective with regard to the advantages and disadvantages of marriage,but in general,it is still a traditional view of marriage that is inevitable and closely related to fertility.Although this is controlled by the policies and the social reasons including declining fertility rate,it deviates from the current view of the youth towards marriage,resulting in many serious consequences such as young people’s rejection.In addition,this research found that male and female have great differences in their views on marriage,and men’s resistance to marriage is far greater than that of women,which is departure from the public’s cognition.The reasons behind this need to be explored in order to solve the marriage and love problems of young people in today’s era and realize the healthy development of young marriage. 展开更多
关键词 corpora Three-dimensional discourse analysis the youth’view of marriage
下载PDF
基于变形图匹配的知识图谱多跳问答
10
作者 李香粤 方全 +2 位作者 胡骏 钱胜胜 徐常胜 《北京航空航天大学学报》 EI CAS CSCD 北大核心 2024年第2期529-534,共6页
知识图谱问答(KGQA)是给定自然语言问题,对问题进行语义理解和解析,进而利用知识图谱进行查询、推理得出答案的过程。但知识图谱通常是不完整的,链接缺失给多跳问答带来许多挑战。许多方法在利用知识图谱嵌入时忽略了重要的路径信息来... 知识图谱问答(KGQA)是给定自然语言问题,对问题进行语义理解和解析,进而利用知识图谱进行查询、推理得出答案的过程。但知识图谱通常是不完整的,链接缺失给多跳问答带来许多挑战。许多方法在利用知识图谱嵌入时忽略了重要的路径信息来评估路径和多关系问题之间的相关性;且使用文本语料库也会限制文本增强模型的可扩展性。针对这些现有方法的缺陷,提出了基于变形图匹配的知识图谱问答(DGM-KGQA)模型,该模型同时利用问题和主题实体构建语义子图,与知识图谱的局部结构匹配并找到正确答案。在基准数据集MetaQA上的实验结果验证了DGM-KGQA的有效性,该模型在完整知识图谱上检索到的答案准确率分别比PullNet、EmbedKGQA增加了4.2%、0.8%;在完整度仅有一半的知识图谱上检索到的答案准确率分别比PullNet、EmbedKGQA增加了11.1%、0.5%。实验证明提出的变形图匹配模型能够有效地增强知识图谱的关联性及多跳问答的答案准确率。 展开更多
关键词 自然语言问题 链接缺失 文本语料库 多跳问答 变形图匹配
原文传递
Using parallel corpora in contrastive studies:Cross-linguistic contrast of future referring expressions in English and Norwegian 被引量:3
11
作者 Hilde Hasselgrd 《外语教学与研究》 CSSCI 北大核心 2012年第1期3-19,共17页
Multilingual corpora have well been recognised as a valuable resource in contrastive and translation studies.This article investigates the development and use of multilingual corpora with a focus on work done in Scand... Multilingual corpora have well been recognised as a valuable resource in contrastive and translation studies.This article investigates the development and use of multilingual corpora with a focus on work done in Scandinavia with the purpose of showing how parallel corpora can be useful within different fields of language descriptions:lexis,grammar and discourse.It also presents a case study that demonstrates how a parallel corpus can be used in comparing two seemingly equivalent future-referring expressions cross-linguistically,namely the English 'be going to' and the Norwegian 'kommer til '('come to'). 展开更多
关键词 parallel corpora future-referring expressions ENGLISH NORWEGIAN
原文传递
Expression and regulation of mRNAs for insulin-like growth factor-I receptor and LH receptor in corpora lutea 被引量:1
12
作者 罗文祥 祝诚 《Science China(Life Sciences)》 SCIE CAS 2000年第2期183-190,共8页
Relationship between insulin-like growth factor-l receptor (IGF-IR) and luteinizing hormone receptor (LHR) mRNA expression as well as their regulation was determined in rat corpora lutea (CL) . In the CL of estrous cy... Relationship between insulin-like growth factor-l receptor (IGF-IR) and luteinizing hormone receptor (LHR) mRNA expression as well as their regulation was determined in rat corpora lutea (CL) . In the CL of estrous cycle rat, LHR mRNA positive CL expressed high level of mRNA of IGF-IR. While the expression of LHR mRNA decreased on estrus, the CL still expressed relatively high level of IGF-IR mRNA. In pseudopregnant rat CL, the expression level of LHR mRNA was low on day 1, the most intense signals were detected on day 8, the signals of LHR mRNA became undetectable on day 14. In contrast to LHR expression, the high level of IGF-IR mRNA was observed in pseudopregnant CL of day 1, and thereafter its signals were detected from day 2 to day 14. Pregnant rat CL expressed both LHR and IGF-IR mRNAs. IGF-I stimulated LHR expression in CL. PGF2ainhibited expression of IGF-IR and LHR. PGE2 negated the inhibiting effects of PGF2α. These data suggest that IGF-I may be involved in regulating CL function, and 展开更多
关键词 IGF-I RECEPTOR LH RECEPTOR mRNA corpora lutea.
原文传递
In vitro Activation of Corpora Allata (CA) by the Allatotropic Factor From the Brains of Coccinella septempunctata
13
作者 关雪辰 欧阳迎春 王宗舜 《Chinese Science Bulletin》 SCIE EI CAS 1994年第6期509-513,共5页
1 Introduction Lady beetle (Coccinella septempunctata) is an important natural enemy of aphids.Our data show that JH produced in the corpora allata (CA) in the adult beetle playsa key role in regulating its reproducti... 1 Introduction Lady beetle (Coccinella septempunctata) is an important natural enemy of aphids.Our data show that JH produced in the corpora allata (CA) in the adult beetle playsa key role in regulating its reproduction. Recently, it is demonstrated that CA ininsects are target tissues of allatostatin and allatotropin. Juvenile hormone (JH) issynthesized and released by CA and plays a vital role in insect development, primarilyin the control of metamorphosis, sexual maturation and reproduction. The activity ofCA during reproduction could be modulated by stimulatory factors (Allatotropicfactor, ATF), inhibitory factors (Allatostatic factor AST) or both. 展开更多
关键词 corpora allata (CA) JUVENILE hormone (JH) allatotropic FACTOR (ATF) brain equivalent (BE).
下载PDF
基于语料库工具Wmatrix的商务语篇隐喻分析
14
作者 李晓冉 《语言与文化研究》 2024年第1期20-23,共4页
隐喻作为一种认知工具,广泛地存在于商务语篇中,用以解释复杂且抽象的商业现象。本研究以概念隐喻理论为理论框架,以具有语义域赋码功能的Wmatrix5.0为检索工具,并结合MIPVU隐喻识别方法,对2021年《经济学人》中357篇商务专栏报道进行... 隐喻作为一种认知工具,广泛地存在于商务语篇中,用以解释复杂且抽象的商业现象。本研究以概念隐喻理论为理论框架,以具有语义域赋码功能的Wmatrix5.0为检索工具,并结合MIPVU隐喻识别方法,对2021年《经济学人》中357篇商务专栏报道进行隐喻研究,分析商务经济话题中常见的概念隐喻现象。研究发现:商务语篇中最常用的五种隐喻分别为身体隐喻、战争隐喻、旅行隐喻、植物隐喻和建筑隐喻,它们又分别归属于结构隐喻、方位隐喻和本体隐喻。这表明,商务语篇使用的多是常规隐喻,是在人们普遍认知范围内进行的隐喻映射,意在用习以为常的概念来解释高度抽象且复杂的商业现象。 展开更多
关键词 隐喻 商务语篇 语料库 Wmatrix
原文传递
Generating Chinese named entity data from parallel corpora 被引量:1
15
作者 Ruiji FU Bing QIN Ting LIU 《Frontiers of Computer Science》 SCIE EI CSCD 2014年第4期629-641,共13页
关键词 命名实体识别 双语语料库 中国 平行 训练数据 识别系统 识别标记 标签
原文传递
Regeneration of rat corpora cavernosa tissue by transplantation of CD133+ cells derived from human bone marrow and placement of biodegradable gel sponge sheet
16
作者 Shogo Inoue Katsutoshi Miyamoto +5 位作者 Shunsuke Shinmei Koichi Shoji Jun Teishima Kazuhiro Sentani Wataru Yasui Akio Matsubara 《Asian Journal of Andrology》 SCIE CAS CSCD 2017年第2期203-207,共5页
目的是为通过人的骨头的移植改革阴茎海绵体织物开发一种更容易的技术导出髓的 CD133 + 细胞进一只老鼠阴茎海绵体缺点模型。我们切除了 2 公里千吗??
原文传递
Improved Ant Lion Optimizer with Deep Learning Driven Arabic Hate Speech Detection
17
作者 Abdelwahed Motwakel Badriyya B.Al-onazi +5 位作者 Jaber S.Alzahrani Sana Alazwari Mahmoud Othman Abu Sarwar Zamani Ishfaq Yaseen Amgad Atta Abdelmageed 《Computer Systems Science & Engineering》 SCIE EI 2023年第9期3321-3338,共18页
Arabic is the world’s first language,categorized by its rich and complicated grammatical formats.Furthermore,the Arabic morphology can be perplexing because nearly 10,000 roots and 900 patterns were the basis for ver... Arabic is the world’s first language,categorized by its rich and complicated grammatical formats.Furthermore,the Arabic morphology can be perplexing because nearly 10,000 roots and 900 patterns were the basis for verbs and nouns.The Arabic language consists of distinct variations utilized in a community and particular situations.Social media sites are a medium for expressing opinions and social phenomena like racism,hatred,offensive language,and all kinds of verbal violence.Such conduct does not impact particular nations,communities,or groups only,extending beyond such areas into people’s everyday lives.This study introduces an Improved Ant Lion Optimizer with Deep Learning Dirven Offensive and Hate Speech Detection(IALODL-OHSD)on Arabic Cross-Corpora.The presented IALODL-OHSD model mainly aims to detect and classify offensive/hate speech expressed on social media.In the IALODL-OHSD model,a threestage process is performed,namely pre-processing,word embedding,and classification.Primarily,data pre-processing is performed to transform the Arabic social media text into a useful format.In addition,the word2vec word embedding process is utilized to produce word embeddings.The attentionbased cascaded long short-term memory(ACLSTM)model is utilized for the classification process.Finally,the IALO algorithm is exploited as a hyperparameter optimizer to boost classifier results.To illustrate a brief result analysis of the IALODL-OHSD model,a detailed set of simulations were performed.The extensive comparison study portrayed the enhanced performance of the IALODL-OHSD model over other approaches. 展开更多
关键词 Hate speech offensive speech Arabic corpora natural language processing social networks
下载PDF
基于ERNIE预训练的中医临床病历分类
18
作者 程强 杜中敏 《南阳师范学院学报》 CAS 2023年第1期37-42,共6页
将中医临床病历分为五大类问题,利用Transformers的双向编码器,在训练文本分类器之前,用未标注的临床语料库来微调ERNIE(Traditional Chinese Medicine-ERNIE)模型,精炼出一个针对中医知识领域的TCM-ERNIE模型,该语料库只使用临床记录... 将中医临床病历分为五大类问题,利用Transformers的双向编码器,在训练文本分类器之前,用未标注的临床语料库来微调ERNIE(Traditional Chinese Medicine-ERNIE)模型,精炼出一个针对中医知识领域的TCM-ERNIE模型,该语料库只使用临床记录文本中的汉字作为输入,无须再进行预处理或特征提取.最后采用基准数据集来评估TCM-ERNIE模型和传统文本分类器,取得了89.39%±0.35%的分类精度,Macro F1为88.64%±0.40%,Micro F1为89.39%±0.35%,还采用可视化的方法来显示注意力权重,进一步揭示临床病历文本中的指标性症状. 展开更多
关键词 自然语言处理 临床记录分类 ERNIE 知识领域 中医语料库
下载PDF
Design of Hierarchical Classifier to Improve Speech Emotion Recognition
19
作者 P.Vasuki 《Computer Systems Science & Engineering》 SCIE EI 2023年第1期19-33,共15页
Automatic Speech Emotion Recognition(SER)is used to recognize emotion from speech automatically.Speech Emotion recognition is working well in a laboratory environment but real-time emotion recognition has been influen... Automatic Speech Emotion Recognition(SER)is used to recognize emotion from speech automatically.Speech Emotion recognition is working well in a laboratory environment but real-time emotion recognition has been influenced by the variations in gender,age,the cultural and acoustical background of the speaker.The acoustical resemblance between emotional expressions further increases the complexity of recognition.Many recent research works are concentrated to address these effects individually.Instead of addressing every influencing attribute individually,we would like to design a system,which reduces the effect that arises on any factor.We propose a two-level Hierarchical classifier named Interpreter of responses(IR).Thefirst level of IR has been realized using Support Vector Machine(SVM)and Gaussian Mixer Model(GMM)classifiers.In the second level of IR,a discriminative SVM classifier has been trained and tested with meta information offirst-level classifiers along with the input acoustical feature vector which is used in primary classifiers.To train the system with a corpus of versatile nature,an integrated emotion corpus has been composed using emotion samples of 5 speech corpora,namely;EMO-DB,IITKGP-SESC,SAVEE Corpus,Spanish emotion corpus,CMU's Woogle corpus.The hierarchical classifier has been trained and tested using MFCC and Low-Level Descriptors(LLD).The empirical analysis shows that the proposed classifier outperforms the traditional classifiers.The proposed ensemble design is very generic and can be adapted even when the number and nature of features change.Thefirst-level classifiers GMM or SVM may be replaced with any other learning algorithm. 展开更多
关键词 Speech emotion recognition hierarchical classifier design ENSEMBLE emotion speech corpora
下载PDF
中华学术外译背景下汉译英学术文本的词汇丰富度研究
20
作者 刘永厚 魏旖旎 《浙江外国语学院学报》 2023年第4期77-83,共7页
近年来,使用平行语料库进行汉译英研究已成为国内翻译学界的一大热点。本研究自建平行语料库,从词汇特征入手,对汉语学术著作《语言符号学》及其英译本,以及英语学术著作Handbook of Semiotics(以下简称Handbook)的词汇丰富度进行了比... 近年来,使用平行语料库进行汉译英研究已成为国内翻译学界的一大热点。本研究自建平行语料库,从词汇特征入手,对汉语学术著作《语言符号学》及其英译本,以及英语学术著作Handbook of Semiotics(以下简称Handbook)的词汇丰富度进行了比较研究。研究结果表明:1)《语言符号学》英译本的词汇多样性与Handbook比较接近,未呈现出词汇范围窄化的倾向;2)相对于原作,《语言符号学》英译本的词汇密度有所降低,连词、介词和代词存在扩增现象;3)《语言符号学》英译本的词汇复杂度低于Handbook,前者的阅读难度相对较低。 展开更多
关键词 中华学术外译 词汇丰富度 平行语料库 翻译简化
下载PDF
上一页 1 2 28 下一页 到第
使用帮助 返回顶部