Analyzing big data, especially medical data, helps to provide good health care to patients and face the risks of death. The COVID-19 pandemic has had a significant impact on public health worldwide, emphasizing the ne...Analyzing big data, especially medical data, helps to provide good health care to patients and face the risks of death. The COVID-19 pandemic has had a significant impact on public health worldwide, emphasizing the need for effective risk prediction models. Machine learning (ML) techniques have shown promise in analyzing complex data patterns and predicting disease outcomes. The accuracy of these techniques is greatly affected by changing their parameters. Hyperparameter optimization plays a crucial role in improving model performance. In this work, the Particle Swarm Optimization (PSO) algorithm was used to effectively search the hyperparameter space and improve the predictive power of the machine learning models by identifying the optimal hyperparameters that can provide the highest accuracy. A dataset with a variety of clinical and epidemiological characteristics linked to COVID-19 cases was used in this study. Various machine learning models, including Random Forests, Decision Trees, Support Vector Machines, and Neural Networks, were utilized to capture the complex relationships present in the data. To evaluate the predictive performance of the models, the accuracy metric was employed. The experimental findings showed that the suggested method of estimating COVID-19 risk is effective. When compared to baseline models, the optimized machine learning models performed better and produced better results.展开更多
The application of single-cell RNA sequencing(scRNA-seq)in biomedical research has advanced our understanding of the pathogenesis of disease and provided valuable insights into new diagnostic and therapeutic strategie...The application of single-cell RNA sequencing(scRNA-seq)in biomedical research has advanced our understanding of the pathogenesis of disease and provided valuable insights into new diagnostic and therapeutic strategies.With the expansion of capacity for high-throughput scRNA-seq,including clinical samples,the analysis of these huge volumes of data has become a daunting prospect for researchers entering this field.Here,we review the workflow for typical scRNA-seq data analysis,covering raw data processing and quality control,basic data analysis applicable for almost all scRNA-seq data sets,and advanced data analysis that should be tailored to specific scientific questions.While summarizing the current methods for each analysis step,we also provide an online repository of software and wrapped-up scripts to support the implementation.Recommendations and caveats are pointed out for some specific analysis tasks and approaches.We hope this resource will be helpful to researchers engaging with scRNA-seq,in particular for emerging clinical applications.展开更多
As COVID-19 poses a major threat to people’s health and economy,there is an urgent need for forecasting methodologies that can anticipate its trajectory efficiently.In non-stationary time series forecasting jobs,ther...As COVID-19 poses a major threat to people’s health and economy,there is an urgent need for forecasting methodologies that can anticipate its trajectory efficiently.In non-stationary time series forecasting jobs,there is frequently a hysteresis in the anticipated values relative to the real values.The multilayer deep-time convolutional network and a feature fusion network are combined in this paper’s proposal of an enhanced Multilayer Deep Time Convolutional Neural Network(MDTCNet)for COVID-19 prediction to address this problem.In particular,it is possible to record the deep features and temporal dependencies in uncertain time series,and the features may then be combined using a feature fusion network and a multilayer perceptron.Last but not least,the experimental verification is conducted on the prediction task of COVID-19 real daily confirmed cases in the world and the United States with uncertainty,realizing the short-term and long-term prediction of COVID-19 daily confirmed cases,and verifying the effectiveness and accuracy of the suggested prediction method,as well as reducing the hysteresis of the prediction results.展开更多
Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for rep...Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for reporting site-specific air pollution levels. Accurately predicting air quality, as measured by the AQI, is essential for effective air pollution management. In this study, we aim to identify the most reliable regression model among linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), logistic regression, and K-nearest neighbors (KNN). We conducted four different regression analyses using a machine learning approach to determine the model with the best performance. By employing the confusion matrix and error percentages, we selected the best-performing model, which yielded prediction error rates of 22%, 23%, 20%, and 27%, respectively, for LDA, QDA, logistic regression, and KNN models. The logistic regression model outperformed the other three statistical models in predicting AQI. Understanding these models' performance can help address an existing gap in air quality research and contribute to the integration of regression techniques in AQI studies, ultimately benefiting stakeholders like environmental regulators, healthcare professionals, urban planners, and researchers.展开更多
Human living would be impossible without air quality. Consistent advancements in practically every aspect of contemporary human life have harmed air quality. Everyday industrial, transportation, and home activities tu...Human living would be impossible without air quality. Consistent advancements in practically every aspect of contemporary human life have harmed air quality. Everyday industrial, transportation, and home activities turn up dangerous contaminants in our surroundings. This study investigated two years’ worth of air quality and outlier detection data from two Indian cities. Studies on air pollution have used numerous types of methodologies, with various gases being seen as a vector whose components include gas concentration values for each observation per-formed. We use curves to represent the monthly average of daily gas emissions in our technique. The approach, which is based on functional depth, was used to find outliers in the city of Delhi and Kolkata’s gas emissions, and the outcomes were compared to those from the traditional method. In the evaluation and comparison of these models’ performances, the functional approach model studied well.展开更多
This article presents a comprehensive analysis of the current state of research on the English translation of Lu You’s poetry, utilizing a data sample comprising research papers published in the CNKI Full-text Databa...This article presents a comprehensive analysis of the current state of research on the English translation of Lu You’s poetry, utilizing a data sample comprising research papers published in the CNKI Full-text Database from 2001 to 2022. Employing rigorous longitudinal statistical methods, the study examines the progress achieved over the past two decades. Notably, domestic researchers have displayed considerable interest in the study of Lu You’s English translation works since 2001. The research on the English translation of Lu You’s poetry reveals a diverse range of perspectives, indicating a rich body of scholarship. However, several challenges persist, including insufficient research, limited translation coverage, and a noticeable focus on specific poems such as “Phoenix Hairpin” in the realm of English translation research. Consequently, there is ample room for improvement in the quality of research output on the English translation of Lu You’s poems, as well as its recognition within the academic community. Building on these findings, it is argued that future investigations pertaining to the English translation of Lu You’s poetry should transcend the boundaries of textual analysis and encompass broader theoretical perspectives and research methodologies. By undertaking this shift, scholars will develop a more profound comprehension of Lu You’s poetic works and make substantive contributions to the field of translation studies. Thus, this article aims to bridge the gap between past research endeavors and future possibilities, serving as a guide and inspiration for scholars to embark on a more nuanced and enriching exploration of Lu You’s poetry as well as other Chinese literature classics.展开更多
The scientists are dedicated to studying the detection of Alzheimer’s disease onset to find a cure, or at the very least, medication that can slow the progression of the disease. This article explores the effectivene...The scientists are dedicated to studying the detection of Alzheimer’s disease onset to find a cure, or at the very least, medication that can slow the progression of the disease. This article explores the effectiveness of longitudinal data analysis, artificial intelligence, and machine learning approaches based on magnetic resonance imaging and positron emission tomography neuroimaging modalities for progression estimation and the detection of Alzheimer’s disease onset. The significance of feature extraction in highly complex neuroimaging data, identification of vulnerable brain regions, and the determination of the threshold values for plaques, tangles, and neurodegeneration of these regions will extensively be evaluated. Developing automated methods to improve the aforementioned research areas would enable specialists to determine the progression of the disease and find the link between the biomarkers and more accurate detection of Alzheimer’s disease onset.展开更多
Standalone systems cannot handle the giant traffic loads generated by Twitter due to memory constraints.A parallel computational environment pro-vided by Apache Hadoop can distribute and process the data over differen...Standalone systems cannot handle the giant traffic loads generated by Twitter due to memory constraints.A parallel computational environment pro-vided by Apache Hadoop can distribute and process the data over different desti-nation systems.In this paper,the Hadoop cluster with four nodes integrated with RHadoop,Flume,and Hive is created to analyze the tweets gathered from the Twitter stream.Twitter stream data is collected relevant to an event/topic like IPL-2015,cricket,Royal Challengers Bangalore,Kohli,Modi,from May 24 to 30,2016 using Flume.Hive is used as a data warehouse to store the streamed tweets.Twitter analytics like maximum number of tweets by users,the average number of followers,and maximum number of friends are obtained using Hive.The network graph is constructed with the user’s unique screen name and men-tions using‘R’.A timeline graph of individual users is generated using‘R’.Also,the proposed solution analyses the emotions of cricket fans by classifying their Twitter messages into appropriate emotional categories using the optimized sup-port vector neural network(OSVNN)classification model.To attain better classi-fication accuracy,the performance of SVNN is enhanced using a chimp optimization algorithm(ChOA).Extracting the users’emotions toward an event is beneficial for prediction,but when coupled with visualizations,it becomes more powerful.Bar-chart and wordcloud are generated to visualize the emotional analysis results.展开更多
With the successful holding of Beijing Winter Olympic Games in 2022,the planning and cultural services of Shijingshan District have been reviewed,but the systematic planning theory has not been fully applied.Through t...With the successful holding of Beijing Winter Olympic Games in 2022,the planning and cultural services of Shijingshan District have been reviewed,but the systematic planning theory has not been fully applied.Through the big data research method,the location advantages and disadvantages of Shijingshan District were analyzed,and the distribution of its cultural facilities was defined.Feasible optimization schemes were proposed according to its advantages and disadvantages as well as the experience and conditions of the Winter Olympics.展开更多
Purpose:To explore the traditional Chinese medicine(TCM)regulates ferroptosis key genes in the occurrence and development of lupus nephritis(LN)based on biological information database.Patients and methods:Ferroptosis...Purpose:To explore the traditional Chinese medicine(TCM)regulates ferroptosis key genes in the occurrence and development of lupus nephritis(LN)based on biological information database.Patients and methods:Ferroptosis related genes were identified based on FerrDb database and literature retrieval.Used the OMIM,Gene Cards,Drug Bank to obtain the targets of LN.Cytoscapes 3.8.2 software and STRING database were used to analyze protein-protein interaction(PPI)network.Metacape software and Weishengxin were used to analyze the gene ontology(GO)classification and Kyoto encyclopedia of genes and genomes(KEGG)pathway enrichment analysis.UniProt Database and Traditional Chinese Systems Pharmacology Database and Analysis Platform analysis platform were used to obtain the data table of key TCM and related targets.Cytoscapes 3.8.2 software was used to analyze the PPI network.Results:A total of 401 ferroptosis-related genes,361 LN related genes and 21“Ferroptosis-LN”intersection genes were obtained.Ferroptosis in the occurrence and prognosis of LN mainly involved the inflammatory response,cell activation,positive regulation of chemokine production and it was mainly involved in necroptosis,inflammatory bowel disease,ferroptosis and other pathways.A total of 412 TCMs containing key genes of“Ferroptosis-LN”were acquired.The most key genes were contained in Mahuang,Gehua,Baiguo,Chuanniuxi,Jinyinhua.15 key genes of“TCM-LN”were obtained.5 ferroptosis-related key genes in LN regulated by TCM were obtained,which were IL1β,TLR4,IFNG,STAT3 and HMOX1.Conclusion:TCM,such as Mahuang,Gehua,Baiguo,Chuanniuxi,Jinyinhua,may affect the occurrence and development of LN through the key ferroptosis genes,such as IL1B,TLR4,IFNG,STAT3 and HMOX1.展开更多
Paperless reading has become a prevalent trend among global readers,leading to the accumulation of vast amounts of reading data on numerous book websites.This offers new perspectives for studying translated works.This...Paperless reading has become a prevalent trend among global readers,leading to the accumulation of vast amounts of reading data on numerous book websites.This offers new perspectives for studying translated works.This paper utilizes Python-based data processing technology to collect and analyze reader reviews of Romance of the Three Kingdoms on Amazon and Goodreads,presenting trends in review volume,word cloud maps,and readers’emotional attitudes in a quantitative manner.The findings indicate that overseas readers generally exhibit a positive emotional tendency towards Romance of the Three Kingdoms and recognize its cultural value.However,negative opinions do exist,focusing on aspects of the book’s quality,such as printing quality and proofreading.These results provide valuable insights for the foreign translation of canonical texts.展开更多
With the emphasis on using technological innovation and digital transformation to generate new development momentum,the Ministry of Education issued the“Code for the Construction of Digital Campuses in Colleges and U...With the emphasis on using technological innovation and digital transformation to generate new development momentum,the Ministry of Education issued the“Code for the Construction of Digital Campuses in Colleges and Universities”in March 2021,which clarified the talent training path for innovative exploration of education mode under the informatization condition.In the process of exploring new methods of talent training models in the digital age,this article uses employment big data to enter the classroom,triggering changes in classroom teaching interaction methods,and then accumulates summary records of a series of research experience,in the hope that more research can extend on this result.展开更多
RNA-sequencing(RNA-seq),based on next-generation sequencing technologies,has rapidly become a standard and popular technology for transcriptome analysis.However,serious challenges still exist in analyzing and interpre...RNA-sequencing(RNA-seq),based on next-generation sequencing technologies,has rapidly become a standard and popular technology for transcriptome analysis.However,serious challenges still exist in analyzing and interpreting the RNA-seq data.With the development of high-throughput sequencing technology,the sequencing depth of RNA-seq data increases explosively.The intricate biological process of transcriptome is more complicated and diversified beyond our imagination.Moreover,most of the remaining organisms still have no available reference genome or have only incomplete genome annotations.Therefore,a large number of bioinformatics methods for various transcriptomics studies are proposed to effectively settle these challenges.This review comprehensively summarizes the various studies in RNA-seq data analysis and their corresponding analysis methods,including genome annotation,quality control and pre-processing of reads,read alignment,transcriptome assembly,gene and isoform expression quantification,differential expression analysis,data visualization and other analyses.展开更多
Under industry 4.0, internet of things(IoT), especially radio frequency identification(RFID) technology, has been widely applied in manufacturing environment. This technology can bring convenience to production contro...Under industry 4.0, internet of things(IoT), especially radio frequency identification(RFID) technology, has been widely applied in manufacturing environment. This technology can bring convenience to production control and production transparency. Meanwhile, it generates increasing production data that are sometimes discrete, uncorrelated, and hard-to-use. Thus,an efficient analysis method is needed to utilize the invaluable data. This work provides an RFID-based production data analysis method for production control in Io T-enabled smart job-shops.The physical configuration and operation logic of Io T-enabled smart job-shop production are firstly described. Based on that,an RFID-based production data model is built to formalize and correlate the heterogeneous production data. Then, an eventdriven RFID-based production data analysis method is proposed to construct the RFID events and judge the process command execution. Furthermore, a near big data approach is used to excavate hidden information and knowledge from the historical production data. A demonstrative case is studied to verify the feasibility of the proposed model and methods. It is expected that our work will provide a different insight into the RFIDbased production data analysis.展开更多
This paper presents the development and application of a production data analysis software that can analyze and forecast the production performance and reservoir properties of shale gas wells.The theories used in the ...This paper presents the development and application of a production data analysis software that can analyze and forecast the production performance and reservoir properties of shale gas wells.The theories used in the study were based on the analytical and empirical approaches.Its reliability has been confirmed through comparisons with a commercial software.Using transient data relating to multi-stage hydraulic fractured horizontal wells,it was confirmed that the accuracy of the modified hyperbolic method showed an error of approximately 4%compared to the actual estimated ultimate recovery(EUR).On the basis of the developed model,reliable productivity forecasts have been obtained by analyzing field production data relating to wells in Canada.The EUR was computed as 9.6 Bcf using the modified hyperbolic method.Employing the Pow Law Exponential method,the EUR would be 9.4 Bcf.The models developed in this study will allow in the future integration of new analytical and empirical theories in a relatively readily than commercial models.展开更多
A new dynamic model identification method is developed for continuous-time series analysis and forward prediction applications. The quantum of data is defined over moving time intervals in sliding window coordinates f...A new dynamic model identification method is developed for continuous-time series analysis and forward prediction applications. The quantum of data is defined over moving time intervals in sliding window coordinates for compressing the size of stored data while retaining the resolution of information. Quantum vectors are introduced as the basis of a linear space for defining a Dynamic Quantum Operator (DQO) model of the system defined by its data stream. The transport of the quantum of compressed data is modeled between the time interval bins during the movement of the sliding time window. The DQO model is identified from the samples of the real-time flow of data over the sliding time window. A least-square-fit identification method is used for evaluating the parameters of the quantum operator model, utilizing the repeated use of the sampled data through a number of time steps. The method is tested to analyze, and forward-predict air temperature variations accessed from weather data as well as methane concentration variations obtained from measurements of an operating mine. The results show efficient forward prediction capabilities, surpassing those using neural networks and other methods for the same task.展开更多
The issue of privacy protection for mobile social networks is a frontier topic in the field of social network applications.The existing researches on user privacy protection in mobile social network mainly focus on pr...The issue of privacy protection for mobile social networks is a frontier topic in the field of social network applications.The existing researches on user privacy protection in mobile social network mainly focus on privacy preserving data publishing and access control.There is little research on the association of user privacy information,so it is not easy to design personalized privacy protection strategy,but also increase the complexity of user privacy settings.Therefore,this paper concentrates on the association of user privacy information taking big data analysis tools,so as to provide data support for personalized privacy protection strategy design.展开更多
A factor analysis was applied to soil geochemical data to define anomalies related to buried Pb-Zn mineralization.A favorable main factor with a strong association of the elements Zn,Cu and Pb,related to mineralizatio...A factor analysis was applied to soil geochemical data to define anomalies related to buried Pb-Zn mineralization.A favorable main factor with a strong association of the elements Zn,Cu and Pb,related to mineralization,was selected for interpretation.The median+2 MAD(median absolute deviation)method of exploratory data analysis(EDA)and C-A(concentration-area)fractal modeling were then applied to the Mahalanobis distance,as defined by Zn,Cu and Pb from the factor analysis to set the thresholds for defining multi-element anomalies.As a result,the median+2 MAD method more successfully identified the Pb-Zn mineralization than the C-A fractal model.The soil anomaly identified by the median+2 MAD method on the Mahalanobis distances defined by three principal elements(Zn,Cu and Pb)rather than thirteen elements(Co,Zn,Cu,V,Mo,Ni,Cr,Mn,Pb,Ba,Sr,Zr and Ti)was the more favorable reflection of the ore body.The identified soil geochemical anomalies were compared with the in situ economic Pb-Zn ore bodies for validation.The results showed that the median+2 MAD approach is capable of mapping both strong and weak geochemical anomalies related to buried Pb-Zn mineralization,which is therefore useful at the reconnaissance drilling stage.展开更多
文摘Analyzing big data, especially medical data, helps to provide good health care to patients and face the risks of death. The COVID-19 pandemic has had a significant impact on public health worldwide, emphasizing the need for effective risk prediction models. Machine learning (ML) techniques have shown promise in analyzing complex data patterns and predicting disease outcomes. The accuracy of these techniques is greatly affected by changing their parameters. Hyperparameter optimization plays a crucial role in improving model performance. In this work, the Particle Swarm Optimization (PSO) algorithm was used to effectively search the hyperparameter space and improve the predictive power of the machine learning models by identifying the optimal hyperparameters that can provide the highest accuracy. A dataset with a variety of clinical and epidemiological characteristics linked to COVID-19 cases was used in this study. Various machine learning models, including Random Forests, Decision Trees, Support Vector Machines, and Neural Networks, were utilized to capture the complex relationships present in the data. To evaluate the predictive performance of the models, the accuracy metric was employed. The experimental findings showed that the suggested method of estimating COVID-19 risk is effective. When compared to baseline models, the optimized machine learning models performed better and produced better results.
基金suppor ted by the National Key Research and Development Program of China (2022YFC2702502)the National Natural Science Foundation of China (32170742, 31970646, and 32060152)+7 种基金the Start Fund for Specially Appointed Professor of Jiangsu ProvinceHainan Province Science and Technology Special Fund (ZDYF2021SHFZ051)the Natural Science Foundation of Hainan Province (820MS053)the Start Fund for High-level Talents of Nanjing Medical University (NMUR2020009)the Marshal Initiative Funding of Hainan Medical University (JBGS202103)the Hainan Province Clinical Medical Center (QWYH202175)the Bioinformatics for Major Diseases Science Innovation Group of Hainan Medical Universitythe Shenzhen Science and Technology Program (JCYJ20210324140407021)
文摘The application of single-cell RNA sequencing(scRNA-seq)in biomedical research has advanced our understanding of the pathogenesis of disease and provided valuable insights into new diagnostic and therapeutic strategies.With the expansion of capacity for high-throughput scRNA-seq,including clinical samples,the analysis of these huge volumes of data has become a daunting prospect for researchers entering this field.Here,we review the workflow for typical scRNA-seq data analysis,covering raw data processing and quality control,basic data analysis applicable for almost all scRNA-seq data sets,and advanced data analysis that should be tailored to specific scientific questions.While summarizing the current methods for each analysis step,we also provide an online repository of software and wrapped-up scripts to support the implementation.Recommendations and caveats are pointed out for some specific analysis tasks and approaches.We hope this resource will be helpful to researchers engaging with scRNA-seq,in particular for emerging clinical applications.
基金supported by the major scientific and technological research project of Chongqing Education Commission(KJZD-M202000802)The first batch of Industrial and Informatization Key Special Fund Support Projects in Chongqing in 2022(2022000537).
文摘As COVID-19 poses a major threat to people’s health and economy,there is an urgent need for forecasting methodologies that can anticipate its trajectory efficiently.In non-stationary time series forecasting jobs,there is frequently a hysteresis in the anticipated values relative to the real values.The multilayer deep-time convolutional network and a feature fusion network are combined in this paper’s proposal of an enhanced Multilayer Deep Time Convolutional Neural Network(MDTCNet)for COVID-19 prediction to address this problem.In particular,it is possible to record the deep features and temporal dependencies in uncertain time series,and the features may then be combined using a feature fusion network and a multilayer perceptron.Last but not least,the experimental verification is conducted on the prediction task of COVID-19 real daily confirmed cases in the world and the United States with uncertainty,realizing the short-term and long-term prediction of COVID-19 daily confirmed cases,and verifying the effectiveness and accuracy of the suggested prediction method,as well as reducing the hysteresis of the prediction results.
文摘Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for reporting site-specific air pollution levels. Accurately predicting air quality, as measured by the AQI, is essential for effective air pollution management. In this study, we aim to identify the most reliable regression model among linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), logistic regression, and K-nearest neighbors (KNN). We conducted four different regression analyses using a machine learning approach to determine the model with the best performance. By employing the confusion matrix and error percentages, we selected the best-performing model, which yielded prediction error rates of 22%, 23%, 20%, and 27%, respectively, for LDA, QDA, logistic regression, and KNN models. The logistic regression model outperformed the other three statistical models in predicting AQI. Understanding these models' performance can help address an existing gap in air quality research and contribute to the integration of regression techniques in AQI studies, ultimately benefiting stakeholders like environmental regulators, healthcare professionals, urban planners, and researchers.
文摘Human living would be impossible without air quality. Consistent advancements in practically every aspect of contemporary human life have harmed air quality. Everyday industrial, transportation, and home activities turn up dangerous contaminants in our surroundings. This study investigated two years’ worth of air quality and outlier detection data from two Indian cities. Studies on air pollution have used numerous types of methodologies, with various gases being seen as a vector whose components include gas concentration values for each observation per-formed. We use curves to represent the monthly average of daily gas emissions in our technique. The approach, which is based on functional depth, was used to find outliers in the city of Delhi and Kolkata’s gas emissions, and the outcomes were compared to those from the traditional method. In the evaluation and comparison of these models’ performances, the functional approach model studied well.
文摘This article presents a comprehensive analysis of the current state of research on the English translation of Lu You’s poetry, utilizing a data sample comprising research papers published in the CNKI Full-text Database from 2001 to 2022. Employing rigorous longitudinal statistical methods, the study examines the progress achieved over the past two decades. Notably, domestic researchers have displayed considerable interest in the study of Lu You’s English translation works since 2001. The research on the English translation of Lu You’s poetry reveals a diverse range of perspectives, indicating a rich body of scholarship. However, several challenges persist, including insufficient research, limited translation coverage, and a noticeable focus on specific poems such as “Phoenix Hairpin” in the realm of English translation research. Consequently, there is ample room for improvement in the quality of research output on the English translation of Lu You’s poems, as well as its recognition within the academic community. Building on these findings, it is argued that future investigations pertaining to the English translation of Lu You’s poetry should transcend the boundaries of textual analysis and encompass broader theoretical perspectives and research methodologies. By undertaking this shift, scholars will develop a more profound comprehension of Lu You’s poetic works and make substantive contributions to the field of translation studies. Thus, this article aims to bridge the gap between past research endeavors and future possibilities, serving as a guide and inspiration for scholars to embark on a more nuanced and enriching exploration of Lu You’s poetry as well as other Chinese literature classics.
文摘The scientists are dedicated to studying the detection of Alzheimer’s disease onset to find a cure, or at the very least, medication that can slow the progression of the disease. This article explores the effectiveness of longitudinal data analysis, artificial intelligence, and machine learning approaches based on magnetic resonance imaging and positron emission tomography neuroimaging modalities for progression estimation and the detection of Alzheimer’s disease onset. The significance of feature extraction in highly complex neuroimaging data, identification of vulnerable brain regions, and the determination of the threshold values for plaques, tangles, and neurodegeneration of these regions will extensively be evaluated. Developing automated methods to improve the aforementioned research areas would enable specialists to determine the progression of the disease and find the link between the biomarkers and more accurate detection of Alzheimer’s disease onset.
文摘Standalone systems cannot handle the giant traffic loads generated by Twitter due to memory constraints.A parallel computational environment pro-vided by Apache Hadoop can distribute and process the data over different desti-nation systems.In this paper,the Hadoop cluster with four nodes integrated with RHadoop,Flume,and Hive is created to analyze the tweets gathered from the Twitter stream.Twitter stream data is collected relevant to an event/topic like IPL-2015,cricket,Royal Challengers Bangalore,Kohli,Modi,from May 24 to 30,2016 using Flume.Hive is used as a data warehouse to store the streamed tweets.Twitter analytics like maximum number of tweets by users,the average number of followers,and maximum number of friends are obtained using Hive.The network graph is constructed with the user’s unique screen name and men-tions using‘R’.A timeline graph of individual users is generated using‘R’.Also,the proposed solution analyses the emotions of cricket fans by classifying their Twitter messages into appropriate emotional categories using the optimized sup-port vector neural network(OSVNN)classification model.To attain better classi-fication accuracy,the performance of SVNN is enhanced using a chimp optimization algorithm(ChOA).Extracting the users’emotions toward an event is beneficial for prediction,but when coupled with visualizations,it becomes more powerful.Bar-chart and wordcloud are generated to visualize the emotional analysis results.
基金Sponsored by the General Project of Beijing Higher Education Association in 2022(MS2022414)Innovation and Entrepreneurship Training Planning Project for University Students in 2023。
文摘With the successful holding of Beijing Winter Olympic Games in 2022,the planning and cultural services of Shijingshan District have been reviewed,but the systematic planning theory has not been fully applied.Through the big data research method,the location advantages and disadvantages of Shijingshan District were analyzed,and the distribution of its cultural facilities was defined.Feasible optimization schemes were proposed according to its advantages and disadvantages as well as the experience and conditions of the Winter Olympics.
基金supported by National Natural Science Foundation of China[grant numbers 8216087881960866].
文摘Purpose:To explore the traditional Chinese medicine(TCM)regulates ferroptosis key genes in the occurrence and development of lupus nephritis(LN)based on biological information database.Patients and methods:Ferroptosis related genes were identified based on FerrDb database and literature retrieval.Used the OMIM,Gene Cards,Drug Bank to obtain the targets of LN.Cytoscapes 3.8.2 software and STRING database were used to analyze protein-protein interaction(PPI)network.Metacape software and Weishengxin were used to analyze the gene ontology(GO)classification and Kyoto encyclopedia of genes and genomes(KEGG)pathway enrichment analysis.UniProt Database and Traditional Chinese Systems Pharmacology Database and Analysis Platform analysis platform were used to obtain the data table of key TCM and related targets.Cytoscapes 3.8.2 software was used to analyze the PPI network.Results:A total of 401 ferroptosis-related genes,361 LN related genes and 21“Ferroptosis-LN”intersection genes were obtained.Ferroptosis in the occurrence and prognosis of LN mainly involved the inflammatory response,cell activation,positive regulation of chemokine production and it was mainly involved in necroptosis,inflammatory bowel disease,ferroptosis and other pathways.A total of 412 TCMs containing key genes of“Ferroptosis-LN”were acquired.The most key genes were contained in Mahuang,Gehua,Baiguo,Chuanniuxi,Jinyinhua.15 key genes of“TCM-LN”were obtained.5 ferroptosis-related key genes in LN regulated by TCM were obtained,which were IL1β,TLR4,IFNG,STAT3 and HMOX1.Conclusion:TCM,such as Mahuang,Gehua,Baiguo,Chuanniuxi,Jinyinhua,may affect the occurrence and development of LN through the key ferroptosis genes,such as IL1B,TLR4,IFNG,STAT3 and HMOX1.
基金funded by the Teacher Development Research Project of USST(Project Fund No.:CFTD2023YB21).
文摘Paperless reading has become a prevalent trend among global readers,leading to the accumulation of vast amounts of reading data on numerous book websites.This offers new perspectives for studying translated works.This paper utilizes Python-based data processing technology to collect and analyze reader reviews of Romance of the Three Kingdoms on Amazon and Goodreads,presenting trends in review volume,word cloud maps,and readers’emotional attitudes in a quantitative manner.The findings indicate that overseas readers generally exhibit a positive emotional tendency towards Romance of the Three Kingdoms and recognize its cultural value.However,negative opinions do exist,focusing on aspects of the book’s quality,such as printing quality and proofreading.These results provide valuable insights for the foreign translation of canonical texts.
基金2021 Jilin Province Higher Education Teaching Reform Research Topic“Research and Practice on the Integration of Employment Big Data Analysis into the Classroom for the Training of Professional Talents”(2021S006)。
文摘With the emphasis on using technological innovation and digital transformation to generate new development momentum,the Ministry of Education issued the“Code for the Construction of Digital Campuses in Colleges and Universities”in March 2021,which clarified the talent training path for innovative exploration of education mode under the informatization condition.In the process of exploring new methods of talent training models in the digital age,this article uses employment big data to enter the classroom,triggering changes in classroom teaching interaction methods,and then accumulates summary records of a series of research experience,in the hope that more research can extend on this result.
文摘RNA-sequencing(RNA-seq),based on next-generation sequencing technologies,has rapidly become a standard and popular technology for transcriptome analysis.However,serious challenges still exist in analyzing and interpreting the RNA-seq data.With the development of high-throughput sequencing technology,the sequencing depth of RNA-seq data increases explosively.The intricate biological process of transcriptome is more complicated and diversified beyond our imagination.Moreover,most of the remaining organisms still have no available reference genome or have only incomplete genome annotations.Therefore,a large number of bioinformatics methods for various transcriptomics studies are proposed to effectively settle these challenges.This review comprehensively summarizes the various studies in RNA-seq data analysis and their corresponding analysis methods,including genome annotation,quality control and pre-processing of reads,read alignment,transcriptome assembly,gene and isoform expression quantification,differential expression analysis,data visualization and other analyses.
基金supported by the National Natural Science Foundation of China(71571142,51275396)
文摘Under industry 4.0, internet of things(IoT), especially radio frequency identification(RFID) technology, has been widely applied in manufacturing environment. This technology can bring convenience to production control and production transparency. Meanwhile, it generates increasing production data that are sometimes discrete, uncorrelated, and hard-to-use. Thus,an efficient analysis method is needed to utilize the invaluable data. This work provides an RFID-based production data analysis method for production control in Io T-enabled smart job-shops.The physical configuration and operation logic of Io T-enabled smart job-shop production are firstly described. Based on that,an RFID-based production data model is built to formalize and correlate the heterogeneous production data. Then, an eventdriven RFID-based production data analysis method is proposed to construct the RFID events and judge the process command execution. Furthermore, a near big data approach is used to excavate hidden information and knowledge from the historical production data. A demonstrative case is studied to verify the feasibility of the proposed model and methods. It is expected that our work will provide a different insight into the RFIDbased production data analysis.
基金supported by the Energy Efficiency&Resources Core Technology Program of the Korea Institute of Energy Technology Evaluation and Planning(KETEP)granted financial resource from the Ministry of Trade,Industry&Energy,Republic of Korea(No.20172510102090).
文摘This paper presents the development and application of a production data analysis software that can analyze and forecast the production performance and reservoir properties of shale gas wells.The theories used in the study were based on the analytical and empirical approaches.Its reliability has been confirmed through comparisons with a commercial software.Using transient data relating to multi-stage hydraulic fractured horizontal wells,it was confirmed that the accuracy of the modified hyperbolic method showed an error of approximately 4%compared to the actual estimated ultimate recovery(EUR).On the basis of the developed model,reliable productivity forecasts have been obtained by analyzing field production data relating to wells in Canada.The EUR was computed as 9.6 Bcf using the modified hyperbolic method.Employing the Pow Law Exponential method,the EUR would be 9.4 Bcf.The models developed in this study will allow in the future integration of new analytical and empirical theories in a relatively readily than commercial models.
文摘A new dynamic model identification method is developed for continuous-time series analysis and forward prediction applications. The quantum of data is defined over moving time intervals in sliding window coordinates for compressing the size of stored data while retaining the resolution of information. Quantum vectors are introduced as the basis of a linear space for defining a Dynamic Quantum Operator (DQO) model of the system defined by its data stream. The transport of the quantum of compressed data is modeled between the time interval bins during the movement of the sliding time window. The DQO model is identified from the samples of the real-time flow of data over the sliding time window. A least-square-fit identification method is used for evaluating the parameters of the quantum operator model, utilizing the repeated use of the sampled data through a number of time steps. The method is tested to analyze, and forward-predict air temperature variations accessed from weather data as well as methane concentration variations obtained from measurements of an operating mine. The results show efficient forward prediction capabilities, surpassing those using neural networks and other methods for the same task.
基金We thank the anonymous reviewers and editors for their very constructive comments.the National Social Science Foundation Project of China under Grant 16BTQ085.
文摘The issue of privacy protection for mobile social networks is a frontier topic in the field of social network applications.The existing researches on user privacy protection in mobile social network mainly focus on privacy preserving data publishing and access control.There is little research on the association of user privacy information,so it is not easy to design personalized privacy protection strategy,but also increase the complexity of user privacy settings.Therefore,this paper concentrates on the association of user privacy information taking big data analysis tools,so as to provide data support for personalized privacy protection strategy design.
文摘A factor analysis was applied to soil geochemical data to define anomalies related to buried Pb-Zn mineralization.A favorable main factor with a strong association of the elements Zn,Cu and Pb,related to mineralization,was selected for interpretation.The median+2 MAD(median absolute deviation)method of exploratory data analysis(EDA)and C-A(concentration-area)fractal modeling were then applied to the Mahalanobis distance,as defined by Zn,Cu and Pb from the factor analysis to set the thresholds for defining multi-element anomalies.As a result,the median+2 MAD method more successfully identified the Pb-Zn mineralization than the C-A fractal model.The soil anomaly identified by the median+2 MAD method on the Mahalanobis distances defined by three principal elements(Zn,Cu and Pb)rather than thirteen elements(Co,Zn,Cu,V,Mo,Ni,Cr,Mn,Pb,Ba,Sr,Zr and Ti)was the more favorable reflection of the ore body.The identified soil geochemical anomalies were compared with the in situ economic Pb-Zn ore bodies for validation.The results showed that the median+2 MAD approach is capable of mapping both strong and weak geochemical anomalies related to buried Pb-Zn mineralization,which is therefore useful at the reconnaissance drilling stage.