In content-based image retrieval(CBIR),primitive image signatures are critical because they represent the visual characteristics.Image signatures,which are algorithmically descriptive and accurately recognized visual ...In content-based image retrieval(CBIR),primitive image signatures are critical because they represent the visual characteristics.Image signatures,which are algorithmically descriptive and accurately recognized visual components,are used to appropriately index and retrieve comparable results.To differentiate an image in the category of qualifying contender,feature vectors must have image information's like colour,objects,shape,spatial viewpoints.Previous methods such as sketch-based image retrieval by salient contour(SBIR)and greedy learning of deep Boltzmann machine(GDBM)used spatial information to distinguish between image categories.This requires interest points and also feature analysis emerged image detection problems.Thus,a proposed model to overcome this issue and predict the repeating pattern as well as series of pixels that conclude similarity has been necessary.In this study,a technique called CBIR-similarity measure via artificial neural network interpolation(CBIR-SMANN)has been presented.By collecting datasets,the images are resized then subject to Gaussian filtering in the pre-processing stage,then by permitting them to the Hessian detector,the interesting points are gathered.Based on Skewness,mean,kurtosis and standard deviation features were extracted then given to ANN for interpolation.Interpolated results are stored in a database for retrieval.In the testing stage,the query image was inputted that is subjected to pre-processing,and feature extraction was then fed to the similarity measurement function.Thus,ANN helps to get similar images from the database.CBIR-SMANN have been implemented in the python tool and then evaluated for its performance.Results show that CBIR-SMANN exhibited a high recall value of 78%with a minimum retrieval time of 980 ms.This showed the supremacy of the proposed model was comparatively greater than the previous ones.展开更多
Objective:Nature theory of Chinese medicine (CM) is the core basic theory of Traditional Chinese Medicine (TCM), in which cold-hot nature is the focus of research. Studies have found that CM ingredients are the materi...Objective:Nature theory of Chinese medicine (CM) is the core basic theory of Traditional Chinese Medicine (TCM), in which cold-hot nature is the focus of research. Studies have found that CM ingredients are the material basis for the production of medicine natures. Therefore, it is speculated that CMs with similar composition of substances should have similar medicinal nature. Modern work studies cold-hot medicine of CMs with chemical fingerprinting technology because the chemical fingerprint data of CM can reflect the whole composition of CM ingredients. Methods:To verify the hypothesis above, in this work, we study quantifying the similarity of CM ingredients to fingerprint similarity, and explore the relationship between the composition of CMs and cold-hot nature. Firstly, we utilize ultraviolet (UV) spectrum technology to analyze 61 CMs, which have clear cold-hot nature (including 30 ‘cold’ CMs and 31 ‘hot’ CMs). Secondly, with the constructed fingerprint database of CMs, a distance metric learning algorithm is studied to metric the similarity of UV fingerprints. Finally, a retrieval scheme is proposed to build a predictive identification model to identify cold-hot nature of CMs. Results:By means of numerous experiment analyses, ultraviolet spectrum data of petroleum ether solvent can better represent CMs to distinguish between cold and hot natures. Comparing with existing classical models, the proposed identification scheme has better predictive performance. Conclusion:The experimental results prove our inference that CMs with similar composition of substances should have similar medicinal nature. The proposed prediction model is proved to be effective and feasible.展开更多
Similarity measure construction has been proposed as fault detection of flight test method in order to obtain the primary control surface stuck and the combination stuck of primary control.Similarity measures were obt...Similarity measure construction has been proposed as fault detection of flight test method in order to obtain the primary control surface stuck and the combination stuck of primary control.Similarity measures were obtained through analyzing the certainty and uncertainty of fuzzy membership functions,which were designed based on well-known Hamming distance.It was applied to the fault detection of primary control surface stuck of uninhabited aerial vehicle(UAV).At post-failure control surface,if the UAV is controllable and trimmable using other control surfaces,the UAV is able to fly or return to the safety region through reconfiguration of flight control system.To detect the fault,similarity measure computations were carried out.This result could be applicable with the real-time parameter estimation method.By monitoring the value of coefficients due to the control surface deviation,it becomes aware that the control surface fault occurs or not.The control surface stuck position and value were separated by comparing the trim value with the reference value.This is the advantage of increasing in reliability without adding sensors or with additional low cost.展开更多
Collaborative filtering (CF) has been widely applied to recommender systems, since it can assist users to discover their favorite items. Similarity measurement that measures the similarity between two users or items...Collaborative filtering (CF) has been widely applied to recommender systems, since it can assist users to discover their favorite items. Similarity measurement that measures the similarity between two users or items is critical to CF. However, traditional similarity measurement approaches for memory-based CF can be strongly improved. In this paper, we propose a novel similarity measurement, named Jaccard Uniform Operator Distance (JacUOD), to effectively measure the similarity. Our JacUOD approach aims at unifying similarity comparison for vectors in different multidimensional vector spaces. Compared with traditional similarity measurement approaches, JacUOD properly handles dimension-number difference for different vector spaces. We conduct experiments based on the well-known MovieLens datasets, and take user-based CF as an example to show the effectiveness of our approach. The experimental results show that our JacUOD approach achieves better prediction accuracy than traditional similarity measurement approaches.展开更多
Purpose–The similarity measurement of time series is an important research in time series detection,which is a basic work of time series clustering,anomaly discovery,prediction and many other data mining problems.The...Purpose–The similarity measurement of time series is an important research in time series detection,which is a basic work of time series clustering,anomaly discovery,prediction and many other data mining problems.The purpose of this paper is to design a new similarity measurement algorithm to improve the performance of the original similarity measurement algorithm.The subsequence morphological information is taken into account by the proposed algorithm,and time series is represented by a pattern,so the similarity measurement algorithm is more accurate.Design/methodology/approach–Following some previous researches on similarity measurement,an improved method is presented.This new method combines morphological representation and dynamic time warping(DTW)technique to measure the similarities of time series.After the segmentation of time series data into segments,three parameter values of median,point number and slope are introduced into the improved distance measurement formula.The effectiveness of the morphological weighted DTW algorithm(MW-DTW)is demonstrated by the example of momentum wheel data of an aircraft attitude control system.Findings–The improved method is insensitive to the distortion and expansion of time axis and can be used to detect the morphological changes of time series data.Simulation results confirm that this method proposed in this paper has a high accuracy of similarity measurement.Practical implications–This improved method has been used to solve the problem of similarity measurement in time series,which is widely emerged in different fields of science and engineering,such as the field of control,measurement,monitoring,process signal processing and economic analysis.Originality/value–In the similarity measurement of time series,the distance between sequences is often used as the only detection index.The results of similarity measurement should not be affected by the longitudinal or transverse stretching and translation changes of the sequence,so it is necessary to incorporate themorphological changes of the sequence into similarity measurement.The MW-DTW is more suitable for the actual situation.At the same time,the MW-DTW algorithm reduces the computational complexity by transforming the computational object to subsequences.展开更多
Fidelity plays an important role in quantum information processing,which provides a basic scale for comparing two quantum states.At present,one of the most commonly used fidelities is Uhlmann-Jozsa(U-J)fidelity.Howeve...Fidelity plays an important role in quantum information processing,which provides a basic scale for comparing two quantum states.At present,one of the most commonly used fidelities is Uhlmann-Jozsa(U-J)fidelity.However,U-J fidelity needs to calculate the square root of the matrix,which is not trivial in the case of large or infinite density matrices.Moreover,U-J fidelity is a measure of overlap,which has limitations in some cases and cannot reflect the similarity between quantum states well.Therefore,a novel quantum fidelity measure called quantum Tanimoto coefficient(QTC)fidelity is proposed in this paper.Unlike other existing fidelities,QTC fidelity not only considers the overlap between quantum states,but also takes into account the separation between quantum states for the first time,which leads to a better performance of measure.Specifically,we discuss the properties of the proposed QTC fidelity.QTC fidelity is compared with some existing fidelities through specific examples,which reflects the effectiveness and advantages of QTC fidelity.In addition,based on the QTC fidelity,three discrimination coefficients d_(1)^(QTC),d_(2)^(QTC),and d_^(3)^(QTC)are defined to measure the difference between quantum states.It is proved that the discrimination coefficient d_(3)^(QTC)is a true metric.Finally,we apply the proposed QTC fidelity-based discrimination coefficients to measure the entanglement of quantum states to show their practicability.展开更多
Content-based medical image retrieval(CBMIR)is a technique for retrieving medical images based on automatically derived image features.There are many applications of CBMIR,such as teaching,research,diagnosis and elect...Content-based medical image retrieval(CBMIR)is a technique for retrieving medical images based on automatically derived image features.There are many applications of CBMIR,such as teaching,research,diagnosis and electronic patient records.Several methods are applied to enhance the retrieval performance of CBMIR systems.Developing new and effective similarity measure and features fusion methods are two of the most powerful and effective strategies for improving these systems.This study proposes the relative difference-based similarity measure(RDBSM)for CBMIR.The new measure was first used in the similarity calculation stage for the CBMIR using an unweighted fusion method of traditional color and texture features.Furthermore,the study also proposes a weighted fusion method for medical image features extracted using pre-trained convolutional neural networks(CNNs)models.Our proposed RDBSM has outperformed the standard well-known similarity and distance measures using two popular medical image datasets,Kvasir and PH2,in terms of recall and precision retrieval measures.The effectiveness and quality of our proposed similarity measure are also proved using a significant test and statistical confidence bound.展开更多
Ontologies have been used for several years in life sciences to formally represent concepts and reason about knowledge bases in domains such as the semantic web, information retrieval and artificial intelligence. The ...Ontologies have been used for several years in life sciences to formally represent concepts and reason about knowledge bases in domains such as the semantic web, information retrieval and artificial intelligence. The exploration of these domains for the correspondence of semantic content requires calculation of the measure of semantic similarity between concepts. Semantic similarity is a measure on a set of documents, based on the similarity of their meanings, which refers to the similarity between two concepts belonging to one or more ontologies. The similarity between concepts is also a quantitative measure of information, calculated based on the properties of concepts and their relationships. This study proposes a method for finding similarity between concepts in two different ontologies based on feature, information content and structure. More specifically, this means proposing a hybrid method using two existing measures to find the similarity between two concepts from different ontologies based on information content and the set of common superconcepts, which represents the set of common parent concepts. We simulated our method on datasets. The results show that our measure provides similarity values that are better than those reported in the literature.展开更多
Pattern matching method is one of the classic classifications of existing online portfolio selection strategies. This article aims to study the key aspects of this method—measurement of similarity and selection of si...Pattern matching method is one of the classic classifications of existing online portfolio selection strategies. This article aims to study the key aspects of this method—measurement of similarity and selection of similarity sets, and proposes a Portfolio Selection Method based on Pattern Matching with Dual Information of Direction and Distance (PMDI). By studying different combination methods of indicators such as Euclidean distance, Chebyshev distance, and correlation coefficient, important information such as direction and distance in stock historical price information is extracted, thereby filtering out the similarity set required for pattern matching based investment portfolio selection algorithms. A large number of experiments conducted on two datasets of real stock markets have shown that PMDI outperforms other algorithms in balancing income and risk. Therefore, it is suitable for the financial environment in the real world.展开更多
A method that combines category-based and keyword-based concepts for a better information retrieval system is introduced. To improve document clustering, a document similarity measure based on cosine vector and keywor...A method that combines category-based and keyword-based concepts for a better information retrieval system is introduced. To improve document clustering, a document similarity measure based on cosine vector and keywords frequency in documents is proposed, but also with an input ontology. The ontology is domain specific and includes a list of keywords organized by degree of importance to the categories of the ontology, and by means of semantic knowledge, the ontology can improve the effects of document similarity measure and feedback of information retrieval systems. Two approaches to evaluating the performance of this similarity measure and the comparison with standard cosine vector similarity measure are also described.展开更多
Aiming at the problem that the data-driven automatic correlation methods which are difficult to adapt to the automatic correlation of oil-bearing strata with large changes in lateral sedimentary facies and strata thic...Aiming at the problem that the data-driven automatic correlation methods which are difficult to adapt to the automatic correlation of oil-bearing strata with large changes in lateral sedimentary facies and strata thickness,an intelligent automatic correlation method of oil-bearing strata based on pattern constraints is formed.We propose to introduce knowledge-driven in automatic correlation of oil-bearing strata,constraining the correlation process by stratigraphic sedimentary patterns and improving the similarity measuring machine and conditional constraint dynamic time warping algorithm to automate the correlation of marker layers and the interfaces of each stratum.The application in Shishen 100 block in the Shinan Oilfield of the Bohai Bay Basin shows that the coincidence rate of the marker layers identified by this method is over 95.00%,and the average coincidence rate of identified oil-bearing strata reaches 90.02% compared to artificial correlation results,which is about 17 percentage points higher than that of the existing automatic correlation methods.The accuracy of the automatic correlation of oil-bearing strata has been effectively improved.展开更多
This paper presents a fuzzy logic approach to efficiently perform unsupervised character classification for improvement in robustness, correctness and speed of a character recognition system. The characters are first ...This paper presents a fuzzy logic approach to efficiently perform unsupervised character classification for improvement in robustness, correctness and speed of a character recognition system. The characters are first split into eight typographical categories. The classification scheme uses pattern matching to classify the characters in each category into a set of fuzzy prototypes based on a nonlinear weighted similarity function. The fuzzy unsupervised character classification, which is natural in the repre...展开更多
The existing multi-source contour matching studies have focused on the matching methods with consideration of topological relations and similarity measurement based on spatial Euclidean distance,while it is lack of ta...The existing multi-source contour matching studies have focused on the matching methods with consideration of topological relations and similarity measurement based on spatial Euclidean distance,while it is lack of taking the contour geometric features into account,which may lead to mismatching in map boundaries and areas with intensive contours or extreme terrain changes.In light of this,it is put forward that a matching strategy from coarse to precious based on the contour geometric features.The proposed matching strategy can be described as follows.Firstly,the point sequence is converted to feature sequence according to a feature descriptive function based on curvature and angle of normal vector.Then the level of similarity among multi-source contours is calculated by using the longest common subsequence solution.Accordingly,the identical contours could be matched based on the above calculated results.In the experiment for the proposed method,the reliability and efficiency of the matching method are verified using simulative datasets and real datasets respectively.It has been proved that the proposed contour matching strategy has a high matching precision and good applicability.展开更多
The successful face recognition based on local binary pattern(LBP)relies on the effective extraction of LBP features and the inferring of similarity between the extracted features.In this paper,we focus on the latter ...The successful face recognition based on local binary pattern(LBP)relies on the effective extraction of LBP features and the inferring of similarity between the extracted features.In this paper,we focus on the latter and propose two novel similarity measures for the local matching methods and the holistic matching methods respectively.One is Earth Mover's Distance with Hamming and Lp ground distance(EMD-HammingLp),which is a cross-bin dissimilarity measure for LBP histograms.The other is IMage Hamming Distance(IMHD),which is a dissimilarity measure for the whole LBP images.Experiments on FERET database show that the proposed two similarity measures outperform the state-of-the-art Chi-square similarity measure for extraction of LBP features.展开更多
A new modeling and monitoring approach for multi-mode processes is proposed.The method of similarity measure(SM) and kernel principal component analysis(KPCA) are integrated to construct SM-KPCA monitoring scheme,wher...A new modeling and monitoring approach for multi-mode processes is proposed.The method of similarity measure(SM) and kernel principal component analysis(KPCA) are integrated to construct SM-KPCA monitoring scheme,where SM method serves as the separation of common subspace and specific subspace.Compared with the traditional methods,the main contributions of this work are:1) SM consisted of two measures of distance and angle to accommodate process characters.The different monitoring effect involves putting on the different weight,which would simplify the monitoring model structure and enhance its reliability and robustness.2) The proposed method can be used to find faults by the common space and judge which mode the fault belongs to by the specific subspace.Results of algorithm analysis and fault detection experiments indicate the validity and practicability of the presented method.展开更多
The similarity computations for fuzzy membership function pairs were carried out.Fuzzy number related knowledge was introduced,and conventional similarity was compared with distance based similarity measure.The useful...The similarity computations for fuzzy membership function pairs were carried out.Fuzzy number related knowledge was introduced,and conventional similarity was compared with distance based similarity measure.The usefulness of the proposed similarity measure was verified.The results show that the proposed similarity measure could be applied to ordinary fuzzy membership functions,though it was not easy to design.Through conventional results on the calculation of similarity for fuzzy membership pair,fuzzy membership-crisp pair and crisp-crisp pair were carried out.The proposed distance based similarity measure represented rational performance with the heuristic point of view.Furthermore,troublesome in fuzzy number based similarity measure for abnormal universe of discourse case was discussed.Finally,the similarity measure computation for various membership function pairs was discussed with other conventional results.展开更多
Study of fuzzy entropy and similarity measure on intuitionistic fuzzy sets (IFSs) was proposed and analyzed. Unlike fuzzy set, IFSs contain uncertainty named hesitance, which is contained in fuzzy membership function ...Study of fuzzy entropy and similarity measure on intuitionistic fuzzy sets (IFSs) was proposed and analyzed. Unlike fuzzy set, IFSs contain uncertainty named hesitance, which is contained in fuzzy membership function itself. Hence, designing fuzzy entropy is not easy because of many entropy definitions. By considering different fuzzy entropy definitions, fuzzy entropy on IFSs is designed and discussed. Similarity measure was also presented and its usefulness was verified to evaluate degree of similarity.展开更多
Fault detection technique is introduced with similarity measure. The characteristics of conventional similarity measure based on fuzzy number are discussed. With the help of distance measure, similarity measure is con...Fault detection technique is introduced with similarity measure. The characteristics of conventional similarity measure based on fuzzy number are discussed. With the help of distance measure, similarity measure is constructed explicitly. The designed distance-based similarity measure is applicable to general fuzzy membership functions including non-convex fuzzy membership function, whereas fuzzy number-based similarity measure has limitation to calculate the similarity of general fuzzy membership functions. The applicability of the proposed similarity measure to general fuzzy membership structures is proven by identifying the definition. To decide fault detection of flight system, the experimental data (pitching moment coefficients and lift coefficients) are transformed into fuzzy membership functions. Distance-based similarity measure is applied to the obtained fuzzy membership functions, and similarity computation and analysis are obtained with the fault and normal operation coefficients.展开更多
Information analysis of high dimensional data was carried out through similarity measure application. High dimensional data were considered as the a typical structure. Additionally, overlapped and non-overlapped data ...Information analysis of high dimensional data was carried out through similarity measure application. High dimensional data were considered as the a typical structure. Additionally, overlapped and non-overlapped data were introduced, and similarity measure analysis was also illustrated and compared with conventional similarity measure. As a result, overlapped data comparison was possible to present similarity with conventional similarity measure. Non-overlapped data similarity analysis provided the clue to solve the similarity of high dimensional data. Considering high dimensional data analysis was designed with consideration of neighborhoods information. Conservative and strict solutions were proposed. Proposed similarity measure was applied to express financial fraud among multi dimensional datasets. In illustrative example, financial fraud similarity with respect to age, gender, qualification and job was presented. And with the proposed similarity measure, high dimensional personal data were calculated to evaluate how similar to the financial fraud. Calculation results show that the actual fraud has rather high similarity measure compared to the average, from minimal 0.0609 to maximal 0.1667.展开更多
Similarity measure design for discrete data group was proposed. Similarity measure design for continuous membership function was also carried out. Proposed similarity measures were designed based on fuzzy number and d...Similarity measure design for discrete data group was proposed. Similarity measure design for continuous membership function was also carried out. Proposed similarity measures were designed based on fuzzy number and distance measure, and were proved. To calculate the degree of similarity of discrete data, relative degree between data and total distribution was obtained. Discrete data similarity measure was completed with combination of mentioned relative degrees. Power interconnected system with multi characteristics was considered to apply discrete similarity measure. Naturally, similarity measure was extended to multi-dimensional similarity measure case, and applied to bus clustering problem.展开更多
文摘In content-based image retrieval(CBIR),primitive image signatures are critical because they represent the visual characteristics.Image signatures,which are algorithmically descriptive and accurately recognized visual components,are used to appropriately index and retrieve comparable results.To differentiate an image in the category of qualifying contender,feature vectors must have image information's like colour,objects,shape,spatial viewpoints.Previous methods such as sketch-based image retrieval by salient contour(SBIR)and greedy learning of deep Boltzmann machine(GDBM)used spatial information to distinguish between image categories.This requires interest points and also feature analysis emerged image detection problems.Thus,a proposed model to overcome this issue and predict the repeating pattern as well as series of pixels that conclude similarity has been necessary.In this study,a technique called CBIR-similarity measure via artificial neural network interpolation(CBIR-SMANN)has been presented.By collecting datasets,the images are resized then subject to Gaussian filtering in the pre-processing stage,then by permitting them to the Hessian detector,the interesting points are gathered.Based on Skewness,mean,kurtosis and standard deviation features were extracted then given to ANN for interpolation.Interpolated results are stored in a database for retrieval.In the testing stage,the query image was inputted that is subjected to pre-processing,and feature extraction was then fed to the similarity measurement function.Thus,ANN helps to get similar images from the database.CBIR-SMANN have been implemented in the python tool and then evaluated for its performance.Results show that CBIR-SMANN exhibited a high recall value of 78%with a minimum retrieval time of 980 ms.This showed the supremacy of the proposed model was comparatively greater than the previous ones.
基金National key basic research development program (973 Program)(Grant No.:2007CB512600)National Natural Science Foundation of China (Grant No.:81473369)+1 种基金Key research and development plan of Shandong province (Grant No.:2016CYJS08A01-1)Shandong Province TCM science and technology development plan project (2019-0037).
文摘Objective:Nature theory of Chinese medicine (CM) is the core basic theory of Traditional Chinese Medicine (TCM), in which cold-hot nature is the focus of research. Studies have found that CM ingredients are the material basis for the production of medicine natures. Therefore, it is speculated that CMs with similar composition of substances should have similar medicinal nature. Modern work studies cold-hot medicine of CMs with chemical fingerprinting technology because the chemical fingerprint data of CM can reflect the whole composition of CM ingredients. Methods:To verify the hypothesis above, in this work, we study quantifying the similarity of CM ingredients to fingerprint similarity, and explore the relationship between the composition of CMs and cold-hot nature. Firstly, we utilize ultraviolet (UV) spectrum technology to analyze 61 CMs, which have clear cold-hot nature (including 30 ‘cold’ CMs and 31 ‘hot’ CMs). Secondly, with the constructed fingerprint database of CMs, a distance metric learning algorithm is studied to metric the similarity of UV fingerprints. Finally, a retrieval scheme is proposed to build a predictive identification model to identify cold-hot nature of CMs. Results:By means of numerous experiment analyses, ultraviolet spectrum data of petroleum ether solvent can better represent CMs to distinguish between cold and hot natures. Comparing with existing classical models, the proposed identification scheme has better predictive performance. Conclusion:The experimental results prove our inference that CMs with similar composition of substances should have similar medicinal nature. The proposed prediction model is proved to be effective and feasible.
基金Project(20110018394) supported by Key Research Institute Program through the National Research Foundation (NRF) of Korea
文摘Similarity measure construction has been proposed as fault detection of flight test method in order to obtain the primary control surface stuck and the combination stuck of primary control.Similarity measures were obtained through analyzing the certainty and uncertainty of fuzzy membership functions,which were designed based on well-known Hamming distance.It was applied to the fault detection of primary control surface stuck of uninhabited aerial vehicle(UAV).At post-failure control surface,if the UAV is controllable and trimmable using other control surfaces,the UAV is able to fly or return to the safety region through reconfiguration of flight control system.To detect the fault,similarity measure computations were carried out.This result could be applicable with the real-time parameter estimation method.By monitoring the value of coefficients due to the control surface deviation,it becomes aware that the control surface fault occurs or not.The control surface stuck position and value were separated by comparing the trim value with the reference value.This is the advantage of increasing in reliability without adding sensors or with additional low cost.
基金supported by the National Basic Research 973 Program of China under Grant No.2011CB302506the National Natural Science Foundation of China under Grant Nos.61001118,61132001,61003067+1 种基金the National Major Science and Technology Project of New Generation Broadband Wireless Network of China under Grant No.2010ZX03004-001the Fundamental Research Funds for the Central Universities of Beijing University of Posts and Telecommunications of China under Grant No.2011RC0502
文摘Collaborative filtering (CF) has been widely applied to recommender systems, since it can assist users to discover their favorite items. Similarity measurement that measures the similarity between two users or items is critical to CF. However, traditional similarity measurement approaches for memory-based CF can be strongly improved. In this paper, we propose a novel similarity measurement, named Jaccard Uniform Operator Distance (JacUOD), to effectively measure the similarity. Our JacUOD approach aims at unifying similarity comparison for vectors in different multidimensional vector spaces. Compared with traditional similarity measurement approaches, JacUOD properly handles dimension-number difference for different vector spaces. We conduct experiments based on the well-known MovieLens datasets, and take user-based CF as an example to show the effectiveness of our approach. The experimental results show that our JacUOD approach achieves better prediction accuracy than traditional similarity measurement approaches.
基金This paper is supported by the National Nature Science Foundation of China(Nos 61473222,91646108).
文摘Purpose–The similarity measurement of time series is an important research in time series detection,which is a basic work of time series clustering,anomaly discovery,prediction and many other data mining problems.The purpose of this paper is to design a new similarity measurement algorithm to improve the performance of the original similarity measurement algorithm.The subsequence morphological information is taken into account by the proposed algorithm,and time series is represented by a pattern,so the similarity measurement algorithm is more accurate.Design/methodology/approach–Following some previous researches on similarity measurement,an improved method is presented.This new method combines morphological representation and dynamic time warping(DTW)technique to measure the similarities of time series.After the segmentation of time series data into segments,three parameter values of median,point number and slope are introduced into the improved distance measurement formula.The effectiveness of the morphological weighted DTW algorithm(MW-DTW)is demonstrated by the example of momentum wheel data of an aircraft attitude control system.Findings–The improved method is insensitive to the distortion and expansion of time axis and can be used to detect the morphological changes of time series data.Simulation results confirm that this method proposed in this paper has a high accuracy of similarity measurement.Practical implications–This improved method has been used to solve the problem of similarity measurement in time series,which is widely emerged in different fields of science and engineering,such as the field of control,measurement,monitoring,process signal processing and economic analysis.Originality/value–In the similarity measurement of time series,the distance between sequences is often used as the only detection index.The results of similarity measurement should not be affected by the longitudinal or transverse stretching and translation changes of the sequence,so it is necessary to incorporate themorphological changes of the sequence into similarity measurement.The MW-DTW is more suitable for the actual situation.At the same time,the MW-DTW algorithm reduces the computational complexity by transforming the computational object to subsequences.
基金supported by the National Natural Science Foundation of China(62003280,61976120)Chongqing Talents:Exceptional Young Talents Project(cstc2022ycjh-bgzxm0070)+2 种基金Natural Science Foundation of Chongqing(2022NSCQ-MSX2993)Natural Science Key Foundation of Jiangsu Education Department(21KJA510004)Chongqing Overseas Scholars Innovation Program(cx2022024)。
文摘Fidelity plays an important role in quantum information processing,which provides a basic scale for comparing two quantum states.At present,one of the most commonly used fidelities is Uhlmann-Jozsa(U-J)fidelity.However,U-J fidelity needs to calculate the square root of the matrix,which is not trivial in the case of large or infinite density matrices.Moreover,U-J fidelity is a measure of overlap,which has limitations in some cases and cannot reflect the similarity between quantum states well.Therefore,a novel quantum fidelity measure called quantum Tanimoto coefficient(QTC)fidelity is proposed in this paper.Unlike other existing fidelities,QTC fidelity not only considers the overlap between quantum states,but also takes into account the separation between quantum states for the first time,which leads to a better performance of measure.Specifically,we discuss the properties of the proposed QTC fidelity.QTC fidelity is compared with some existing fidelities through specific examples,which reflects the effectiveness and advantages of QTC fidelity.In addition,based on the QTC fidelity,three discrimination coefficients d_(1)^(QTC),d_(2)^(QTC),and d_^(3)^(QTC)are defined to measure the difference between quantum states.It is proved that the discrimination coefficient d_(3)^(QTC)is a true metric.Finally,we apply the proposed QTC fidelity-based discrimination coefficients to measure the entanglement of quantum states to show their practicability.
基金funded by the Deanship of Scientific Research (DSR)at King Abdulaziz University,Jeddah,Saudi Arabia,Under Grant No. (G:146-830-1441).
文摘Content-based medical image retrieval(CBMIR)is a technique for retrieving medical images based on automatically derived image features.There are many applications of CBMIR,such as teaching,research,diagnosis and electronic patient records.Several methods are applied to enhance the retrieval performance of CBMIR systems.Developing new and effective similarity measure and features fusion methods are two of the most powerful and effective strategies for improving these systems.This study proposes the relative difference-based similarity measure(RDBSM)for CBMIR.The new measure was first used in the similarity calculation stage for the CBMIR using an unweighted fusion method of traditional color and texture features.Furthermore,the study also proposes a weighted fusion method for medical image features extracted using pre-trained convolutional neural networks(CNNs)models.Our proposed RDBSM has outperformed the standard well-known similarity and distance measures using two popular medical image datasets,Kvasir and PH2,in terms of recall and precision retrieval measures.The effectiveness and quality of our proposed similarity measure are also proved using a significant test and statistical confidence bound.
文摘Ontologies have been used for several years in life sciences to formally represent concepts and reason about knowledge bases in domains such as the semantic web, information retrieval and artificial intelligence. The exploration of these domains for the correspondence of semantic content requires calculation of the measure of semantic similarity between concepts. Semantic similarity is a measure on a set of documents, based on the similarity of their meanings, which refers to the similarity between two concepts belonging to one or more ontologies. The similarity between concepts is also a quantitative measure of information, calculated based on the properties of concepts and their relationships. This study proposes a method for finding similarity between concepts in two different ontologies based on feature, information content and structure. More specifically, this means proposing a hybrid method using two existing measures to find the similarity between two concepts from different ontologies based on information content and the set of common superconcepts, which represents the set of common parent concepts. We simulated our method on datasets. The results show that our measure provides similarity values that are better than those reported in the literature.
文摘Pattern matching method is one of the classic classifications of existing online portfolio selection strategies. This article aims to study the key aspects of this method—measurement of similarity and selection of similarity sets, and proposes a Portfolio Selection Method based on Pattern Matching with Dual Information of Direction and Distance (PMDI). By studying different combination methods of indicators such as Euclidean distance, Chebyshev distance, and correlation coefficient, important information such as direction and distance in stock historical price information is extracted, thereby filtering out the similarity set required for pattern matching based investment portfolio selection algorithms. A large number of experiments conducted on two datasets of real stock markets have shown that PMDI outperforms other algorithms in balancing income and risk. Therefore, it is suitable for the financial environment in the real world.
基金The Young Teachers Scientific Research Foundation (YTSRF) of Nanjing University of Science and Technology in the Year of2005-2006.
文摘A method that combines category-based and keyword-based concepts for a better information retrieval system is introduced. To improve document clustering, a document similarity measure based on cosine vector and keywords frequency in documents is proposed, but also with an input ontology. The ontology is domain specific and includes a list of keywords organized by degree of importance to the categories of the ontology, and by means of semantic knowledge, the ontology can improve the effects of document similarity measure and feedback of information retrieval systems. Two approaches to evaluating the performance of this similarity measure and the comparison with standard cosine vector similarity measure are also described.
基金Supported by the National Natural Science Foundation of China(42272110)CNPC-China University of Petroleum(Beijing)Strategic Cooperation Project(ZLZX2020-02).
文摘Aiming at the problem that the data-driven automatic correlation methods which are difficult to adapt to the automatic correlation of oil-bearing strata with large changes in lateral sedimentary facies and strata thickness,an intelligent automatic correlation method of oil-bearing strata based on pattern constraints is formed.We propose to introduce knowledge-driven in automatic correlation of oil-bearing strata,constraining the correlation process by stratigraphic sedimentary patterns and improving the similarity measuring machine and conditional constraint dynamic time warping algorithm to automate the correlation of marker layers and the interfaces of each stratum.The application in Shishen 100 block in the Shinan Oilfield of the Bohai Bay Basin shows that the coincidence rate of the marker layers identified by this method is over 95.00%,and the average coincidence rate of identified oil-bearing strata reaches 90.02% compared to artificial correlation results,which is about 17 percentage points higher than that of the existing automatic correlation methods.The accuracy of the automatic correlation of oil-bearing strata has been effectively improved.
文摘This paper presents a fuzzy logic approach to efficiently perform unsupervised character classification for improvement in robustness, correctness and speed of a character recognition system. The characters are first split into eight typographical categories. The classification scheme uses pattern matching to classify the characters in each category into a set of fuzzy prototypes based on a nonlinear weighted similarity function. The fuzzy unsupervised character classification, which is natural in the repre...
基金National Science Foundation of China(Nos.41801388,41901397)。
文摘The existing multi-source contour matching studies have focused on the matching methods with consideration of topological relations and similarity measurement based on spatial Euclidean distance,while it is lack of taking the contour geometric features into account,which may lead to mismatching in map boundaries and areas with intensive contours or extreme terrain changes.In light of this,it is put forward that a matching strategy from coarse to precious based on the contour geometric features.The proposed matching strategy can be described as follows.Firstly,the point sequence is converted to feature sequence according to a feature descriptive function based on curvature and angle of normal vector.Then the level of similarity among multi-source contours is calculated by using the longest common subsequence solution.Accordingly,the identical contours could be matched based on the above calculated results.In the experiment for the proposed method,the reliability and efficiency of the matching method are verified using simulative datasets and real datasets respectively.It has been proved that the proposed contour matching strategy has a high matching precision and good applicability.
文摘The successful face recognition based on local binary pattern(LBP)relies on the effective extraction of LBP features and the inferring of similarity between the extracted features.In this paper,we focus on the latter and propose two novel similarity measures for the local matching methods and the holistic matching methods respectively.One is Earth Mover's Distance with Hamming and Lp ground distance(EMD-HammingLp),which is a cross-bin dissimilarity measure for LBP histograms.The other is IMage Hamming Distance(IMHD),which is a dissimilarity measure for the whole LBP images.Experiments on FERET database show that the proposed two similarity measures outperform the state-of-the-art Chi-square similarity measure for extraction of LBP features.
基金Projects(61273163,61325015,61304121)supported by the National Natural Science Foundation of China
文摘A new modeling and monitoring approach for multi-mode processes is proposed.The method of similarity measure(SM) and kernel principal component analysis(KPCA) are integrated to construct SM-KPCA monitoring scheme,where SM method serves as the separation of common subspace and specific subspace.Compared with the traditional methods,the main contributions of this work are:1) SM consisted of two measures of distance and angle to accommodate process characters.The different monitoring effect involves putting on the different weight,which would simplify the monitoring model structure and enhance its reliability and robustness.2) The proposed method can be used to find faults by the common space and judge which mode the fault belongs to by the specific subspace.Results of algorithm analysis and fault detection experiments indicate the validity and practicability of the presented method.
基金Project(2010-0020163) supported by Priority Research Centers Program through the National Research Foundation of Korea(NRF) funded by the Ministry of Education,Science and Technology
文摘The similarity computations for fuzzy membership function pairs were carried out.Fuzzy number related knowledge was introduced,and conventional similarity was compared with distance based similarity measure.The usefulness of the proposed similarity measure was verified.The results show that the proposed similarity measure could be applied to ordinary fuzzy membership functions,though it was not easy to design.Through conventional results on the calculation of similarity for fuzzy membership pair,fuzzy membership-crisp pair and crisp-crisp pair were carried out.The proposed distance based similarity measure represented rational performance with the heuristic point of view.Furthermore,troublesome in fuzzy number based similarity measure for abnormal universe of discourse case was discussed.Finally,the similarity measure computation for various membership function pairs was discussed with other conventional results.
基金Project(ER120001) supported by Development of Application Technology BioNano Super Composites, Korea
文摘Study of fuzzy entropy and similarity measure on intuitionistic fuzzy sets (IFSs) was proposed and analyzed. Unlike fuzzy set, IFSs contain uncertainty named hesitance, which is contained in fuzzy membership function itself. Hence, designing fuzzy entropy is not easy because of many entropy definitions. By considering different fuzzy entropy definitions, fuzzy entropy on IFSs is designed and discussed. Similarity measure was also presented and its usefulness was verified to evaluate degree of similarity.
基金Project supported by the Second Stage of Brain Korea and Korea Research Foundation
文摘Fault detection technique is introduced with similarity measure. The characteristics of conventional similarity measure based on fuzzy number are discussed. With the help of distance measure, similarity measure is constructed explicitly. The designed distance-based similarity measure is applicable to general fuzzy membership functions including non-convex fuzzy membership function, whereas fuzzy number-based similarity measure has limitation to calculate the similarity of general fuzzy membership functions. The applicability of the proposed similarity measure to general fuzzy membership structures is proven by identifying the definition. To decide fault detection of flight system, the experimental data (pitching moment coefficients and lift coefficients) are transformed into fuzzy membership functions. Distance-based similarity measure is applied to the obtained fuzzy membership functions, and similarity computation and analysis are obtained with the fault and normal operation coefficients.
基金Project(RDF 11-02-03)supported by the Research Development Fund of XJTLU,China
文摘Information analysis of high dimensional data was carried out through similarity measure application. High dimensional data were considered as the a typical structure. Additionally, overlapped and non-overlapped data were introduced, and similarity measure analysis was also illustrated and compared with conventional similarity measure. As a result, overlapped data comparison was possible to present similarity with conventional similarity measure. Non-overlapped data similarity analysis provided the clue to solve the similarity of high dimensional data. Considering high dimensional data analysis was designed with consideration of neighborhoods information. Conservative and strict solutions were proposed. Proposed similarity measure was applied to express financial fraud among multi dimensional datasets. In illustrative example, financial fraud similarity with respect to age, gender, qualification and job was presented. And with the proposed similarity measure, high dimensional personal data were calculated to evaluate how similar to the financial fraud. Calculation results show that the actual fraud has rather high similarity measure compared to the average, from minimal 0.0609 to maximal 0.1667.
基金Project(2010-0020163) supported by Key Research Institute Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology, Korea
文摘Similarity measure design for discrete data group was proposed. Similarity measure design for continuous membership function was also carried out. Proposed similarity measures were designed based on fuzzy number and distance measure, and were proved. To calculate the degree of similarity of discrete data, relative degree between data and total distribution was obtained. Discrete data similarity measure was completed with combination of mentioned relative degrees. Power interconnected system with multi characteristics was considered to apply discrete similarity measure. Naturally, similarity measure was extended to multi-dimensional similarity measure case, and applied to bus clustering problem.