Log anomaly detection is an important paradigm for system troubleshooting.Existing log anomaly detection based on Long Short-Term Memory(LSTM)networks is time-consuming to handle long sequences.Transformer model is in...Log anomaly detection is an important paradigm for system troubleshooting.Existing log anomaly detection based on Long Short-Term Memory(LSTM)networks is time-consuming to handle long sequences.Transformer model is introduced to promote efficiency.However,most existing Transformer-based log anomaly detection methods convert unstructured log messages into structured templates by log parsing,which introduces parsing errors.They only extract simple semantic feature,which ignores other features,and are generally supervised,relying on the amount of labeled data.To overcome the limitations of existing methods,this paper proposes a novel unsupervised log anomaly detection method based on multi-feature(UMFLog).UMFLog includes two sub-models to consider two kinds of features:semantic feature and statistical feature,respectively.UMFLog applies the log original content with detailed parameters instead of templates or template IDs to avoid log parsing errors.In the first sub-model,UMFLog uses Bidirectional Encoder Representations from Transformers(BERT)instead of random initialization to extract effective semantic feature,and an unsupervised hypersphere-based Transformer model to learn compact log sequence representations and obtain anomaly candidates.In the second sub-model,UMFLog exploits a statistical feature-based Variational Autoencoder(VAE)about word occurrence times to identify the final anomaly from anomaly candidates.Extensive experiments and evaluations are conducted on three real public log datasets.The results show that UMFLog significantly improves F1-scores compared to the state-of-the-art(SOTA)methods because of the multi-feature.展开更多
Sensors produce a large amount of multivariate time series data to record the states of Internet of Things(IoT)systems.Multivariate time series timestamp anomaly detection(TSAD)can identify timestamps of attacks and m...Sensors produce a large amount of multivariate time series data to record the states of Internet of Things(IoT)systems.Multivariate time series timestamp anomaly detection(TSAD)can identify timestamps of attacks and malfunctions.However,it is necessary to determine which sensor or indicator is abnormal to facilitate a more detailed diagnosis,a process referred to as fine-grained anomaly detection(FGAD).Although further FGAD can be extended based on TSAD methods,existing works do not provide a quantitative evaluation,and the performance is unknown.Therefore,to tackle the FGAD problem,this paper first verifies that the TSAD methods achieve low performance when applied to the FGAD task directly because of the excessive fusion of features and the ignoring of the relationship’s dynamic changes between indicators.Accordingly,this paper proposes a mul-tivariate time series fine-grained anomaly detection(MFGAD)framework.To avoid excessive fusion of features,MFGAD constructs two sub-models to independently identify the abnormal timestamp and abnormal indicator instead of a single model and then combines the two kinds of abnormal results to detect the fine-grained anomaly.Based on this framework,an algorithm based on Graph Attention Neural Network(GAT)and Attention Convolutional Long-Short Term Memory(A-ConvLSTM)is proposed,in which GAT learns temporal features of multiple indicators to detect abnormal timestamps and A-ConvLSTM captures the dynamic relationship between indicators to identify abnormal indicators.Extensive simulations on a real-world dataset demonstrate that the proposed algorithm can achieve a higher F1 score and hit rate than the extension of existing TSAD methods with the benefit of two independent sub-models for timestamp and indicator detection.展开更多
Diagnostic methods for urothelial carcinomas(UCs)are often invasive or have suboptimal accuracy.Methylation of exfoliated cell DNA or cell-free DNA in urine has shown great promise in the diagnosis of UCs.However,most...Diagnostic methods for urothelial carcinomas(UCs)are often invasive or have suboptimal accuracy.Methylation of exfoliated cell DNA or cell-free DNA in urine has shown great promise in the diagnosis of UCs.However,most current studies have focused on bladder cancer(BCa),and only a few high-plex methylated DNA panels based on large-volume urine have been reported to exhibit both high sensitivity and specificity.[1,2]The purpose of this study was to identify universal biomarkers for BCa and upper tract urothelial carcinoma(UTUC)using a small volume of urine.We developed a dual-target diagnostic panel comprising the novel marker AL021918.2 and the well-known BCa biomarker Vimentin(VIM).This panel can accurately detect UCs using only 1.8 mL of full-voided urine.展开更多
With the popularity of deep learning tools in image decomposition and natural language processing,how to support and store a large number of parameters required by deep learning algorithms has become an urgent problem...With the popularity of deep learning tools in image decomposition and natural language processing,how to support and store a large number of parameters required by deep learning algorithms has become an urgent problem to be solved.These parameters are huge and can be as many as millions.At present,a feasible direction is to use the sparse representation technique to compress the parameter matrix to achieve the purpose of reducing parameters and reducing the storage pressure.These methods include matrix decomposition and tensor decomposition.To let vector take advance of the compressing performance of matrix decomposition and tensor decomposition,we use reshaping and unfolding to let vector be the input and output of Tensor-Factorized Neural Networks.We analyze how reshaping can get the best compress ratio.According to the relationship between the shape of tensor and the number of parameters,we get a lower bound of the number of parameters.We take some data sets to verify the lower bound.展开更多
System logs record detailed information about system operation and areimportant for analyzing the system's operational status and performance. Rapidand accurate detection of system anomalies is of great significan...System logs record detailed information about system operation and areimportant for analyzing the system's operational status and performance. Rapidand accurate detection of system anomalies is of great significance to ensure system stability. However, large-scale distributed systems are becoming more andmore complex, and the number of system logs gradually increases, which bringschallenges to analyze system logs. Some recent studies show that logs can beunstable due to the evolution of log statements and noise introduced by log collection and parsing. Moreover, deep learning-based detection methods take a longtime to train models. Therefore, to reduce the computational cost and avoid loginstability we propose a new Word2Vec-based log unsupervised anomaly detection method (LogUAD). LogUAD does not require a log parsing step and takesoriginal log messages as input to avoid the noise. LogUAD uses Word2Vec togenerate word vectors and generates weighted log sequence feature vectors withTF-IDF to handle the evolution of log statements. At last, a computationally effi-cient unsupervised clustering is exploited to detect the anomaly. We conductedextensive experiments on the public dataset from Blue Gene/L (BGL). Experimental results show that the F1-score of LogUAD can be improved by 67.25%compared to LogCluster.展开更多
Energy efficiency is an important criterion for routing algorithms in the wireless sensor network. Cooperative routing can reduce energy consumption effectively stemming from its diversity gain advantage. To solve the...Energy efficiency is an important criterion for routing algorithms in the wireless sensor network. Cooperative routing can reduce energy consumption effectively stemming from its diversity gain advantage. To solve the energy consumption problem and maximize the network lifetime, this paper proposes a Virtual Multiple Input Multiple Output based Cooperative Routing algorithm(VMIMOCR). VMIMOCR chooses cooperative relay nodes based on Virtual Multiple Input Multiple Output Model, and balances energy consumption by reasonable power allocation among transmitters, and decides the forwarding path finally. The experimental results show that VMIMOCR can improve network lifetime from 37% to 348% in the medium node density, compared with existing routing algorithms.展开更多
As the development of smart grid and energy internet, this leads to a significantincrease in the amount of data transmitted in real time. Due to the mismatch withcommunication networks that were not designed to carry ...As the development of smart grid and energy internet, this leads to a significantincrease in the amount of data transmitted in real time. Due to the mismatch withcommunication networks that were not designed to carry high-speed and real time data,data losses and data quality degradation may happen constantly. For this problem,according to the strong spatial and temporal correlation of electricity data which isgenerated by human’s actions and feelings, we build a low-rank electricity data matrixwhere the row is time and the column is user. Inspired by matrix decomposition, we dividethe low-rank electricity data matrix into the multiply of two small matrices and use theknown data to approximate the low-rank electricity data matrix and recover the missedelectrical data. Based on the real electricity data, we analyze the low-rankness of theelectricity data matrix and perform the Matrix Decomposition-based method on the realdata. The experimental results verify the efficiency and efficiency of the proposed scheme.展开更多
Esophageal cancer(EC)is one of the most common cancers with high morbidity and mortality rates.EC includes two histological subtypes,namely esophageal squamous cell carcinoma(ESCC)and esophageal adenocarcinoma(EAC).ES...Esophageal cancer(EC)is one of the most common cancers with high morbidity and mortality rates.EC includes two histological subtypes,namely esophageal squamous cell carcinoma(ESCC)and esophageal adenocarcinoma(EAC).ESCC primarily occurs in East Asia,whereas EAC occurs in Western countries.The currently available treatment strategies for EC include surgery,chemotherapy,radiation therapy,molecular targeted therapy,and combinations thereof.However,the prognosis remains poor,and the overall five-year survival rate is very low.Therefore,achieving the goal of effective treatment remains challenging.In this review,we discuss the latest developments in chemotherapy and molecular targeted therapy for EC,and comprehensively analyze the application prospects and existing problems of immunotherapy.Collectively,this review aims to provide a better understanding of the currently available drugs through in-depth analysis,promote the development of new therapeutic agents,and eventually improve the treatment outcomes of patients with EC.展开更多
基金supported in part by the National Natural Science Foundation of China under Grant 62272062the Scientific Research Fund of Hunan Provincial Transportation Department(No.202143)the Open Fund ofKey Laboratory of Safety Control of Bridge Engineering,Ministry of Education(Changsha University of Science Technology)under Grant 21KB07.
文摘Log anomaly detection is an important paradigm for system troubleshooting.Existing log anomaly detection based on Long Short-Term Memory(LSTM)networks is time-consuming to handle long sequences.Transformer model is introduced to promote efficiency.However,most existing Transformer-based log anomaly detection methods convert unstructured log messages into structured templates by log parsing,which introduces parsing errors.They only extract simple semantic feature,which ignores other features,and are generally supervised,relying on the amount of labeled data.To overcome the limitations of existing methods,this paper proposes a novel unsupervised log anomaly detection method based on multi-feature(UMFLog).UMFLog includes two sub-models to consider two kinds of features:semantic feature and statistical feature,respectively.UMFLog applies the log original content with detailed parameters instead of templates or template IDs to avoid log parsing errors.In the first sub-model,UMFLog uses Bidirectional Encoder Representations from Transformers(BERT)instead of random initialization to extract effective semantic feature,and an unsupervised hypersphere-based Transformer model to learn compact log sequence representations and obtain anomaly candidates.In the second sub-model,UMFLog exploits a statistical feature-based Variational Autoencoder(VAE)about word occurrence times to identify the final anomaly from anomaly candidates.Extensive experiments and evaluations are conducted on three real public log datasets.The results show that UMFLog significantly improves F1-scores compared to the state-of-the-art(SOTA)methods because of the multi-feature.
基金supported in part by the National Natural Science Foundation of China under Grant 62272062the Researchers Supporting Project number.(RSP2023R102)King Saud University+5 种基金Riyadh,Saudi Arabia,the Open Research Fund of the Hunan Provincial Key Laboratory of Network Investigational Technology under Grant 2018WLZC003the National Science Foundation of Hunan Province under Grant 2020JJ2029the Hunan Provincial Key Research and Development Program under Grant 2022GK2019the Science Fund for Creative Research Groups of Hunan Province under Grant 2020JJ1006the Scientific Research Fund of Hunan Provincial Transportation Department under Grant 202143the Open Fund of Key Laboratory of Safety Control of Bridge Engineering,Ministry of Education(Changsha University of Science Technology)under Grant 21KB07.
文摘Sensors produce a large amount of multivariate time series data to record the states of Internet of Things(IoT)systems.Multivariate time series timestamp anomaly detection(TSAD)can identify timestamps of attacks and malfunctions.However,it is necessary to determine which sensor or indicator is abnormal to facilitate a more detailed diagnosis,a process referred to as fine-grained anomaly detection(FGAD).Although further FGAD can be extended based on TSAD methods,existing works do not provide a quantitative evaluation,and the performance is unknown.Therefore,to tackle the FGAD problem,this paper first verifies that the TSAD methods achieve low performance when applied to the FGAD task directly because of the excessive fusion of features and the ignoring of the relationship’s dynamic changes between indicators.Accordingly,this paper proposes a mul-tivariate time series fine-grained anomaly detection(MFGAD)framework.To avoid excessive fusion of features,MFGAD constructs two sub-models to independently identify the abnormal timestamp and abnormal indicator instead of a single model and then combines the two kinds of abnormal results to detect the fine-grained anomaly.Based on this framework,an algorithm based on Graph Attention Neural Network(GAT)and Attention Convolutional Long-Short Term Memory(A-ConvLSTM)is proposed,in which GAT learns temporal features of multiple indicators to detect abnormal timestamps and A-ConvLSTM captures the dynamic relationship between indicators to identify abnormal indicators.Extensive simulations on a real-world dataset demonstrate that the proposed algorithm can achieve a higher F1 score and hit rate than the extension of existing TSAD methods with the benefit of two independent sub-models for timestamp and indicator detection.
基金supported by grants from the National Natural Science Foundation of China(Nos.82273350,81972380,81872083,and 81772703)the Natural Science Foundation of Beijing(No.7222190)+1 种基金the Beijing Municipal Science&Technology Commission Project(No.Z201100005420029)National High Level Hospital Clinical Research Funding(Scientific Research Seed Fund of Peking University First Hospital,No.2022SF18)
文摘Diagnostic methods for urothelial carcinomas(UCs)are often invasive or have suboptimal accuracy.Methylation of exfoliated cell DNA or cell-free DNA in urine has shown great promise in the diagnosis of UCs.However,most current studies have focused on bladder cancer(BCa),and only a few high-plex methylated DNA panels based on large-volume urine have been reported to exhibit both high sensitivity and specificity.[1,2]The purpose of this study was to identify universal biomarkers for BCa and upper tract urothelial carcinoma(UTUC)using a small volume of urine.We developed a dual-target diagnostic panel comprising the novel marker AL021918.2 and the well-known BCa biomarker Vimentin(VIM).This panel can accurately detect UCs using only 1.8 mL of full-voided urine.
基金This work was supported by National Natural Science Foundation of China(Nos.61802030,61572184)the Science and Technology Projects of Hunan Province(No.2016JC2075)the International Cooperative Project for“Double First-Class”,CSUST(No.2018IC24).
文摘With the popularity of deep learning tools in image decomposition and natural language processing,how to support and store a large number of parameters required by deep learning algorithms has become an urgent problem to be solved.These parameters are huge and can be as many as millions.At present,a feasible direction is to use the sparse representation technique to compress the parameter matrix to achieve the purpose of reducing parameters and reducing the storage pressure.These methods include matrix decomposition and tensor decomposition.To let vector take advance of the compressing performance of matrix decomposition and tensor decomposition,we use reshaping and unfolding to let vector be the input and output of Tensor-Factorized Neural Networks.We analyze how reshaping can get the best compress ratio.According to the relationship between the shape of tensor and the number of parameters,we get a lower bound of the number of parameters.We take some data sets to verify the lower bound.
基金funded by the Researchers Supporting Project No.(RSP.2021/102)King Saud University,Riyadh,Saudi ArabiaThis work was supported in part by the National Natural Science Foundation of China under Grant 61802030+2 种基金Natural Science Foundation of Hunan Province under Grant 2020JJ5602the Research Foundation of Education Bureau of Hunan Province under Grant 19B005the International Cooperative Project for“Double First-Class”,CSUST under Grant 2018IC24.
文摘System logs record detailed information about system operation and areimportant for analyzing the system's operational status and performance. Rapidand accurate detection of system anomalies is of great significance to ensure system stability. However, large-scale distributed systems are becoming more andmore complex, and the number of system logs gradually increases, which bringschallenges to analyze system logs. Some recent studies show that logs can beunstable due to the evolution of log statements and noise introduced by log collection and parsing. Moreover, deep learning-based detection methods take a longtime to train models. Therefore, to reduce the computational cost and avoid loginstability we propose a new Word2Vec-based log unsupervised anomaly detection method (LogUAD). LogUAD does not require a log parsing step and takesoriginal log messages as input to avoid the noise. LogUAD uses Word2Vec togenerate word vectors and generates weighted log sequence feature vectors withTF-IDF to handle the evolution of log statements. At last, a computationally effi-cient unsupervised clustering is exploited to detect the anomaly. We conductedextensive experiments on the public dataset from Blue Gene/L (BGL). Experimental results show that the F1-score of LogUAD can be improved by 67.25%compared to LogCluster.
基金supported by the National Basic Research Program of China (973 program) (Grant No.2012CB315805)the National Natural Science Foundation of China (Grant No.61472130 and 61572184)
文摘Energy efficiency is an important criterion for routing algorithms in the wireless sensor network. Cooperative routing can reduce energy consumption effectively stemming from its diversity gain advantage. To solve the energy consumption problem and maximize the network lifetime, this paper proposes a Virtual Multiple Input Multiple Output based Cooperative Routing algorithm(VMIMOCR). VMIMOCR chooses cooperative relay nodes based on Virtual Multiple Input Multiple Output Model, and balances energy consumption by reasonable power allocation among transmitters, and decides the forwarding path finally. The experimental results show that VMIMOCR can improve network lifetime from 37% to 348% in the medium node density, compared with existing routing algorithms.
文摘As the development of smart grid and energy internet, this leads to a significantincrease in the amount of data transmitted in real time. Due to the mismatch withcommunication networks that were not designed to carry high-speed and real time data,data losses and data quality degradation may happen constantly. For this problem,according to the strong spatial and temporal correlation of electricity data which isgenerated by human’s actions and feelings, we build a low-rank electricity data matrixwhere the row is time and the column is user. Inspired by matrix decomposition, we dividethe low-rank electricity data matrix into the multiply of two small matrices and use theknown data to approximate the low-rank electricity data matrix and recover the missedelectrical data. Based on the real electricity data, we analyze the low-rankness of theelectricity data matrix and perform the Matrix Decomposition-based method on the realdata. The experimental results verify the efficiency and efficiency of the proposed scheme.
基金supported by the National Key Research and Development Program of China(No.2016YFA0201504)the Drug Innovation Major Project(No.2018ZX09711001-009-003,China)CAMS Innovation Fund for Medical Sciences(No.2016-I2M3-013,China)。
文摘Esophageal cancer(EC)is one of the most common cancers with high morbidity and mortality rates.EC includes two histological subtypes,namely esophageal squamous cell carcinoma(ESCC)and esophageal adenocarcinoma(EAC).ESCC primarily occurs in East Asia,whereas EAC occurs in Western countries.The currently available treatment strategies for EC include surgery,chemotherapy,radiation therapy,molecular targeted therapy,and combinations thereof.However,the prognosis remains poor,and the overall five-year survival rate is very low.Therefore,achieving the goal of effective treatment remains challenging.In this review,we discuss the latest developments in chemotherapy and molecular targeted therapy for EC,and comprehensively analyze the application prospects and existing problems of immunotherapy.Collectively,this review aims to provide a better understanding of the currently available drugs through in-depth analysis,promote the development of new therapeutic agents,and eventually improve the treatment outcomes of patients with EC.