期刊文献+
共找到396篇文章
< 1 2 20 >
每页显示 20 50 100
Performance Analysis of Support Vector Machine (SVM) on Challenging Datasets for Forest Fire Detection
1
作者 Ankan Kar Nirjhar Nath +1 位作者 Utpalraj Kemprai   Aman 《International Journal of Communications, Network and System Sciences》 2024年第2期11-29,共19页
This article delves into the analysis of performance and utilization of Support Vector Machines (SVMs) for the critical task of forest fire detection using image datasets. With the increasing threat of forest fires to... This article delves into the analysis of performance and utilization of Support Vector Machines (SVMs) for the critical task of forest fire detection using image datasets. With the increasing threat of forest fires to ecosystems and human settlements, the need for rapid and accurate detection systems is of utmost importance. SVMs, renowned for their strong classification capabilities, exhibit proficiency in recognizing patterns associated with fire within images. By training on labeled data, SVMs acquire the ability to identify distinctive attributes associated with fire, such as flames, smoke, or alterations in the visual characteristics of the forest area. The document thoroughly examines the use of SVMs, covering crucial elements like data preprocessing, feature extraction, and model training. It rigorously evaluates parameters such as accuracy, efficiency, and practical applicability. The knowledge gained from this study aids in the development of efficient forest fire detection systems, enabling prompt responses and improving disaster management. Moreover, the correlation between SVM accuracy and the difficulties presented by high-dimensional datasets is carefully investigated, demonstrated through a revealing case study. The relationship between accuracy scores and the different resolutions used for resizing the training datasets has also been discussed in this article. These comprehensive studies result in a definitive overview of the difficulties faced and the potential sectors requiring further improvement and focus. 展开更多
关键词 Support Vector Machine Challenging datasets Forest Fire Detection CLASSIFICATION
下载PDF
Empirical Analysis of Neural Networks-Based Models for Phishing Website Classification Using Diverse Datasets
2
作者 Shoaib Khan Bilal Khan +2 位作者 Saifullah Jan Subhan Ullah Aiman 《Journal of Cyber Security》 2023年第1期47-66,共20页
Phishing attacks pose a significant security threat by masquerading as trustworthy entities to steal sensitive information,a problem that persists despite user awareness.This study addresses the pressing issue of phis... Phishing attacks pose a significant security threat by masquerading as trustworthy entities to steal sensitive information,a problem that persists despite user awareness.This study addresses the pressing issue of phishing attacks on websites and assesses the performance of three prominent Machine Learning(ML)models—Artificial Neural Networks(ANN),Convolutional Neural Networks(CNN),and Long Short-Term Memory(LSTM)—utilizing authentic datasets sourced from Kaggle and Mendeley repositories.Extensive experimentation and analysis reveal that the CNN model achieves a better accuracy of 98%.On the other hand,LSTM shows the lowest accuracy of 96%.These findings underscore the potential of ML techniques in enhancing phishing detection systems and bolstering cybersecurity measures against evolving phishing tactics,offering a promising avenue for safeguarding sensitive information and online security. 展开更多
关键词 Artificial neural networks phishing websites network security machine learning phishing datasets CLASSIFICATION
下载PDF
A Comprehensive Analysis of Datasets for Automotive Intrusion Detection Systems
3
作者 Seyoung Lee Wonsuk Choi +2 位作者 InsupKim Ganggyu Lee Dong Hoon Lee 《Computers, Materials & Continua》 SCIE EI 2023年第9期3413-3442,共30页
Recently,automotive intrusion detection systems(IDSs)have emerged as promising defense approaches to counter attacks on in-vehicle networks(IVNs).However,the effectiveness of IDSs relies heavily on the quality of the ... Recently,automotive intrusion detection systems(IDSs)have emerged as promising defense approaches to counter attacks on in-vehicle networks(IVNs).However,the effectiveness of IDSs relies heavily on the quality of the datasets used for training and evaluation.Despite the availability of several datasets for automotive IDSs,there has been a lack of comprehensive analysis focusing on assessing these datasets.This paper aims to address the need for dataset assessment in the context of automotive IDSs.It proposes qualitative and quantitative metrics that are independent of specific automotive IDSs,to evaluate the quality of datasets.These metrics take into consideration various aspects such as dataset description,collection environment,and attack complexity.This paper evaluates eight commonly used datasets for automotive IDSs using the proposed metrics.The evaluation reveals biases in the datasets,particularly in terms of limited contexts and lack of diversity.Additionally,it highlights that the attacks in the datasets were mostly injected without considering normal behaviors,which poses challenges for training and evaluating machine learning-based IDSs.This paper emphasizes the importance of addressing the identified limitations in existing datasets to improve the performance and adaptability of automotive IDSs.The proposed metrics can serve as valuable guidelines for researchers and practitioners in selecting and constructing high-quality datasets for automotive security applications.Finally,this paper presents the requirements for high-quality datasets,including the need for representativeness,diversity,and balance. 展开更多
关键词 Controller area network(CAN) intrusion detection system(IDS) automotive security machine learning(ML) DATASET
下载PDF
Study on the Structure of a Horizontal Shear Line over the Tibetan Plateau Based on CRA-Interim Datasets and Its Comparison with ERA-Interim Datasets
4
作者 姚秀萍 张硕 +2 位作者 包晓红 师春香 刘景卫 《Journal of Tropical Meteorology》 SCIE 2020年第4期483-494,共12页
The CRA-Interim trial production of the global atmospheric reanalysis for 10 years from 2007 to 2016 was carried out by the China Meteorological Administration in 2017. The structural characteristics of the horizontal... The CRA-Interim trial production of the global atmospheric reanalysis for 10 years from 2007 to 2016 was carried out by the China Meteorological Administration in 2017. The structural characteristics of the horizontal shear line over the Tibetan Plateau (TPHSL) based on the CRA-Interim datasets are examined by objectively identifying the shear line, and are compared with the analysis results of the European Centre for Medium-Range Weather Forecasts reanalysis data (ERA-Interim). The case occurred at 18UTC on July 5, 2016. The results show that both of the ERA-Interim and CRA-Interim datasets can well reveal the circulation background and the dynamic and thermal structure characteristics of TPHSL, and they have shown some similar features. The middle and high latitudes at 500 hPa are characterized by the circulation situation of"two troughs and two ridges", and at 200 hPa, the TPHSL is located in the northeast quadrant of the South Asian High Pressure (SAHP). The TPHSL locates in the positive vorticity zone and passes through the positive vorticity center corresponding to the ascending motion. Near the TPHSL, the contours of pseudo-equivalent potential temperature (θse) tend to be intensive, with a high-value center on the south side of the TPHSL. The TPHSL can extend to460 hPa and vertically inclines northward. There is a positive vorticity zone near the TPHSL which is also characterized by the northward inclination with the height, the ascending motion near the TPHSL can extend to 300 hPa, and the atmospheric layer above the TPHSL is stable. However, the intensities of the TPHSL’s structure characteristics analyzed with the two datasets are different, revealing the relatively strong intensity of geopotential height field, vertical velocity field, vorticity field and divergence field from the CRA-Interim datasets. In addition, the vertical profiles of the dynamic and water vapor thermal physical quantities of the two datasets are also consistent in the east and west part of the TPHSL. In summary, the reliable and usable CRA-Interim datasets show excellent properties in the analysis on the structural characteristics of a horizontal shear line over the Tibetan Plateau. 展开更多
关键词 CRA-Interim datasets ERA-Interim datasets horizontal shear line over the Tibetan Plateau structure
下载PDF
The Assessment of Global Surface Temperature Change from 1850s:The C-LSAT2.0 Ensemble and the CMST-Interim Datasets 被引量:8
5
作者 Wenbin SUN Qingxiang LI +6 位作者 Boyin HUANG Jiayi CHENG Zhaoyang SONG Haiyan LI Wenjie DONG Panmao ZHAI Phil JONES 《Advances in Atmospheric Sciences》 SCIE CAS CSCD 2021年第5期875-888,共14页
Based on C-LSAT2.0,using high-and low-frequency components reconstruction methods,combined with observation constraint masking,a reconstructed C-LSAT2.0 with 756 ensemble members from the 1850s to 2018 has been develo... Based on C-LSAT2.0,using high-and low-frequency components reconstruction methods,combined with observation constraint masking,a reconstructed C-LSAT2.0 with 756 ensemble members from the 1850s to 2018 has been developed.These ensemble versions have been merged with the ERSSTv5 ensemble dataset,and an upgraded version of the CMSTInterim dataset with 5°×5°resolution has been developed.The CMST-Interim dataset has significantly improved the coverage rate of global surface temperature data.After reconstruction,the data coverage before 1950 increased from 78%−81%of the original CMST to 81%−89%.The total coverage after 1955 reached about 93%,including more than 98%in the Northern Hemisphere and 81%−89%in the Southern Hemisphere.Through the reconstruction ensemble experiments with different parameters,a good basis is provided for more systematic uncertainty assessment of C-LSAT2.0 and CMSTInterim.In comparison with the original CMST,the global mean surface temperatures are estimated to be cooler in the second half of 19th century and warmer during the 21st century,which shows that the global warming trend is further amplified.The global warming trends are updated from 0.085±0.004℃(10 yr)^(–1)and 0.128±0.006℃(10 yr)^(–1)to 0.089±0.004℃(10 yr)^(–1)and 0.137±0.007℃(10 yr)^(–1),respectively,since the start and the second half of 20th century. 展开更多
关键词 C-LSAT2.0 ensemble datasets CMST-Interim EOTs high-and low-frequency components RECONSTRUCTION
下载PDF
Intercomparison of the Extended Reconstructed Sea Surface Temperature v4 and v3b Datasets 被引量:1
6
作者 WANG Jinping CHEN Xianyao 《Journal of Ocean University of China》 SCIE CAS CSCD 2018年第2期209-218,共10页
Version 4(v4) of the Extended Reconstructed Sea Surface Temperature(ERSST) dataset is compared with its precedent, the widely used version 3b(v3b). The essential upgrades applied to v4 lead to remarkable differences i... Version 4(v4) of the Extended Reconstructed Sea Surface Temperature(ERSST) dataset is compared with its precedent, the widely used version 3b(v3b). The essential upgrades applied to v4 lead to remarkable differences in the characteristics of the sea surface temperature(SST) anomaly(SSTa) in both the temporal and spatial domains. First, the largest discrepancy of the global mean SSTa values around the 1940 s is due to ship-observation corrections made to reconcile observations from buckets and engine intake thermometers. Second, differences in global and regional mean SSTa values between v4 and v3b exhibit a downward trend(around-0.032℃ per decade) before the 1940s, an upward trend(around 0.014℃ per decade) during the period of 1950–2015, interdecadal oscillation with one peak around the 1980s, and two troughs during the 1960s and 2000s, respectively. This does not derive from treatments of the polar or the other data-void regions, since the difference of the SSTa does not share the common features. Third, the spatial pattern of the ENSO-related variability of v4 exhibits a wider but weaker cold tongue in the tropical region of the Pacific Ocean compared with that of v3b, which could be attributed to differences in gap-filling assumptions since the latter features satellite observations whereas the former features in situ ones. This intercomparison confirms that the structural uncertainty arising from underlying assumptions on the treatment of diverse SST observations even in the same SST product family is the main source of significant SST differences in the temporal domain. Why this uncertainty introduces artificial decadal oscillations remains unknown. 展开更多
关键词 ERSST datasets SEA surface temperature global WARMING ARCTIC data intercomparison
下载PDF
Generation of Labelled Datasets to Quantify the Impact of Security Threats to Cloud Data Centers 被引量:1
7
作者 Sai Kiran Mukkavilli Sachin Shetty Liang Hong 《Journal of Information Security》 2016年第3期172-184,共13页
Anomaly based approaches in network intrusion detection suffer from evaluation, comparison and deployment which originate from the scarcity of adequate publicly available network trace datasets. Also, publicly availab... Anomaly based approaches in network intrusion detection suffer from evaluation, comparison and deployment which originate from the scarcity of adequate publicly available network trace datasets. Also, publicly available datasets are either outdated or generated in a controlled environment. Due to the ubiquity of cloud computing environments in commercial and government internet services, there is a need to assess the impacts of network attacks in cloud data centers. To the best of our knowledge, there is no publicly available dataset which captures the normal and anomalous network traces in the interactions between cloud users and cloud data centers. In this paper, we present an experimental platform designed to represent a practical interaction between cloud users and cloud services and collect network traces resulting from this interaction to conduct anomaly detection. We use Amazon web services (AWS) platform for conducting our experiments. 展开更多
关键词 Amazon Web Services Anomaly Detection Cloud Computing datasets Intrusion Detection Network Traces
下载PDF
Evaluation of CRU TS, GPCC, AgMERRA, and AgCFSR meteorological datasets for estimating climate and crop variables: A case study of maize in Qazvin Province, Iran
8
作者 Faraz GORGIN PAVEH Hadi RAMEZANI ETEDALI Brian COLLINS 《Journal of Arid Land》 SCIE CSCD 2022年第12期1361-1376,共16页
In the past few decades,meteorological datasets from remote sensing techniques in agricultural and water resources management have been used by various researchers and managers.Based on the literature,meteorological d... In the past few decades,meteorological datasets from remote sensing techniques in agricultural and water resources management have been used by various researchers and managers.Based on the literature,meteorological datasets are not more accurate than synoptic stations,but their various advantages,such as spatial coverage,time coverage,accessibility,and free use,have made these techniques superior,and sometimes we can use them instead of synoptic stations.In this study,we used four meteorological datasets,including Climatic Research Unit gridded Time Series(CRU TS),Global Precipitation Climatology Centre(GPCC),Agricultural National Aeronautics and Space Administration Modern-Era Retrospective Analysis for Research and Applications(AgMERRA),Agricultural Climate Forecast System Reanalysis(AgCFSR),to estimate climate variables,i.e.,precipitation,maximum temperature,and minimum temperature,and crop variables,i.e.,reference evapotranspiration,irrigation requirement,biomass,and yield of maize,in Qazvin Province of Iran during 1980-2009.At first,data were gathered from the four meteorological datasets and synoptic station in this province,and climate variables were calculated.Then,after using the AquaCrop model to calculate the crop variables,we compared the results of the synoptic station and meteorological datasets.All the four meteorological datasets showed strong performance for estimating climate variables.AgMERRA and AgCFSR had more accurate estimations for precipitation and maximum temperature.However,their normalized root mean square error was inferior to CRU for minimum temperature.Furthermore,they were all very efficient for estimating the biomass and yield of maize in this province.For reference evapotranspiration and irrigation requirement CRU TS and GPCC were the most efficient rather than AgMERRA and AgCFSR.But for the estimation of biomass and yield,all the four meteorological datasets were reliable.To sum up,GPCC and AgCFSR were the two best datasets in this study.This study suggests the use of meteorological datasets in water resource management and agricultural management to monitor past changes and estimate recent trends. 展开更多
关键词 climate variables crop variables meteorological datasets precipitation reference evapotranspiration irrigation requirement Iran
下载PDF
Service Life Design for Concrete Engineering in Marine Environments of Northern China Based on a Modified Theoretical Model of Chloride Diffusion and Large Datasets of Ocean Parameters
9
作者 Taotao Feng Hongfa Yu +3 位作者 Yongshan Tan Haiyan Ma Mei Xu Chengjun Yue 《Engineering》 SCIE EI CAS 2022年第10期123-139,共17页
In this study,through experimental research and an investigation on large datasets of the durability parameters in ocean engineering,the values,ranges,and types of distribution of the durability parameters employed fo... In this study,through experimental research and an investigation on large datasets of the durability parameters in ocean engineering,the values,ranges,and types of distribution of the durability parameters employed for the durability design in ocean engineering in northern China were confirmed.Based on a modified theoretical model of chloride diffusion and the reliability theory,the service lives of concrete structures exposed to the splash,tidal,and underwater zones were calculated.Mixed concrete proportions meeting the requirement of a service life of 100 or 120 years were designed,and a cover thickness requirement was proposed.In addition,the effects of the different time-varying relationships of the boundary condition(Cs)and diffusion coefficient(Df)on the service life were compared;the results showed that the time-varying relationships used in this study(i.e.,Cscontinuously increased and then remained stable,and Dfcontinuously decreased and then remained stable)were beneficial for the durability design of concrete structures in marine environment. 展开更多
关键词 Large datasets Modified theoretical model Reliability theory Service life Boundary condition Diffusion coefficient
下载PDF
COMPARISONS OF THE CHARACTERISTICS OF TROPICAL CYCLONES EXPERIENCING EXTRATROPICAL TRANSITION IN THE WESTERN NORTH PACIFIC BASED ON DIFFERENT DATASETS
10
作者 孙红梅 雷小途 +2 位作者 汤杰 姚才 罗小莉 《Journal of Tropical Meteorology》 SCIE 2017年第3期281-291,共11页
The differences in the climatology of extratropical transition(ET) of western North Pacific tropical cyclones(TCs) were investigated in this study using the TCs best-track datasets of China Meteorological Administrati... The differences in the climatology of extratropical transition(ET) of western North Pacific tropical cyclones(TCs) were investigated in this study using the TCs best-track datasets of China Meteorological Administration(CMA),Japan Meteorological Agency(JMA) and the Joint Typhoon Warning Center(JTWC). The results show that the ET identification, ET completion time, and post-ET duration reported in the JTWC dataset are greatly different from those in CMA and JMA datasets during 2004-2010. However, the key differences between the CMA and JMA datasets from 1951 to 2010 are the ET identification and the post-ET duration, because of inconsistent objective ET criteria used in the centers. Further analysis indicates that annual ET percentage of CMA was lower than that of JMA, and exhibited an interannual decreasing trend, while that of JMA was an unchanged trend. The western North Pacific ET events occurred mainly during the period June to November. The latitude of ET occurrence shifted northward from February to August,followed by a southward shift. Most of ET events were observed between 35°N and 45°N. From a regional perspective,TCs tended to undergo ET in Japan and the ocean east to it. It is found that TCs which experienced the ET process at higher latitudes were generally more intense at the ET completion time. TCs completing the ET overland or offshore were weaker than those finishing the ET over the ocean. Most of the TCs weakened 24 h before the completion of ET.In contrast, 21%(27%) of the TCs showed an intensification process based on the CMA(JMA) dataset during the post-ET period. The results presented in this study indicate that consistent ET determination criteria are needed to reduce the uncertainty involved in ET identification among the centers. 展开更多
关键词 Western North Pacific different datasets tropical cyclone extratropical transition climatic differences comparative analysis
下载PDF
Effectiveness of predicting tunneling-induced ground settlements using machine learning methods with small datasets 被引量:5
11
作者 Linan Liu Wendy Zhou Marte Gutierrez 《Journal of Rock Mechanics and Geotechnical Engineering》 SCIE CSCD 2022年第4期1028-1041,共14页
Prediction of tunneling-induced ground settlements is an essential task,particularly for tunneling in urban settings.Ground settlements should be limited within a tolerable threshold to avoid damages to aboveground st... Prediction of tunneling-induced ground settlements is an essential task,particularly for tunneling in urban settings.Ground settlements should be limited within a tolerable threshold to avoid damages to aboveground structures.Machine learning(ML)methods are becoming popular in many fields,including tunneling and underground excavations,as a powerful learning and predicting technique.However,the available datasets collected from a tunneling project are usually small from the perspective of applying ML methods.Can ML algorithms effectively predict tunneling-induced ground settlements when the available datasets are small?In this study,seven ML methods are utilized to predict tunneling-induced ground settlement using 14 contributing factors measured before or during tunnel excavation.These methods include multiple linear regression(MLR),decision tree(DT),random forest(RF),gradient boosting(GB),support vector regression(SVR),back-propagation neural network(BPNN),and permutation importancebased BPNN(PI-BPNN)models.All methods except BPNN and PI-BPNN are shallow-structure ML methods.The effectiveness of these seven ML approaches on small datasets is evaluated using model accuracy and stability.The model accuracy is measured by the coefficient of determination(R2)of training and testing datasets,and the stability of a learning algorithm indicates robust predictive performance.Also,the quantile error(QE)criterion is introduced to assess model predictive performance considering underpredictions and overpredictions.Our study reveals that the RF algorithm outperforms all the other models with the highest model prediction accuracy(0.9)and stability(3.0210^(-27)).Deep-structure ML models do not perform well for small datasets with relatively low model accuracy(0.59)and stability(5.76).The PI-BPNN architecture is proposed and designed for small datasets,showing better performance than typical BPNN.Six important contributing factors of ground settlements are identified,including tunnel depth,the distance between tunnel face and surface monitoring points(DTM),weighted average soil compressibility modulus(ACM),grouting pressure,penetrating rate and thrust force. 展开更多
关键词 Ground settlements TUNNELING Machine learning Small dataset Model accuracy Model stability Feature importance
下载PDF
Evaluating data-driven algorithms for predicting mechanical properties with small datasets:A case study on gear steel hardenability 被引量:1
12
作者 Bogdan Nenchev Qing Tao +4 位作者 Zihui Dong Chinnapat Panwisawas Haiyang Li Biao Tao Hongbiao Dong 《International Journal of Minerals,Metallurgy and Materials》 SCIE EI CAS CSCD 2022年第4期836-847,共12页
Data-driven algorithms for predicting mechanical properties with small datasets are evaluated in a case study on gear steel hardenability.The limitations of current data-driven algorithms and empirical models are iden... Data-driven algorithms for predicting mechanical properties with small datasets are evaluated in a case study on gear steel hardenability.The limitations of current data-driven algorithms and empirical models are identified.Challenges in analysing small datasets are discussed,and solution is proposed to handle small datasets with multiple variables.Gaussian methods in combination with novel predictive algorithms are utilized to overcome the challenges in analysing gear steel hardenability data and to gain insight into alloying elements interaction and structure homogeneity.The gained fundamental knowledge integrated with machine learning is shown to be superior to the empirical equations in predicting hardenability.Metallurgical-property relationships between chemistry,sample size,and hardness are predicted via two optimized machine learning algorithms:neural networks(NNs)and extreme gradient boosting(XGboost).A comparison is drawn between all algorithms,evaluating their performance based on small data sets.The results reveal that XGboost has the highest potential for predicting hardenability using small datasets with class imbalance and large inhomogeneity issues. 展开更多
关键词 machine learning small dataset XGboost HARDENABILITY gear steel
下载PDF
Evaluation of Daily Gridded Meteorological Datasets over the Niger Delta Region of Nigeria and Implication to Water Resources Management
13
作者 Ibrahim Hassan Robert M. Kalin +1 位作者 Christopher J. White Jamiu A. Aladejana 《Atmospheric and Climate Sciences》 2020年第1期21-39,共19页
Hydro-climatological study is difficult in most of the developing countries due to the paucity of monitoring stations. Gridded climatological data provides an opportunity to extrapolate climate to areas without monito... Hydro-climatological study is difficult in most of the developing countries due to the paucity of monitoring stations. Gridded climatological data provides an opportunity to extrapolate climate to areas without monitoring stations based on their ability to replicate the Spatio-temporal distribution and variability of observed datasets. Simple correlation and error analyses are not enough to predict the variability and distribution of precipitation and temperature. In this study, the coefficient of correlation (R2), Root mean square error (RMSE), mean bias error (MBE) and mean wet and dry spell lengths were used to evaluate the performance of three widely used daily gridded precipitation, maximum and minimum temperature datasets from the Climatic Research Unit (CRU), Princeton University Global Meteorological Forcing (PGF) and Climate Forecast System Reanalysis (CFSR) datasets available over the Niger Delta part of Nigeria. The Standardised Precipitation Index was used to assess the confidence of using gridded precipitation products on water resource management. Results of correlation, error, and spell length analysis revealed that the CRU and PGF datasets performed much better than the CFSR datasets. SPI values also indicate a good association between station and CRU precipitation products. The CFSR datasets in comparison with the other data products in many years overestimated and underestimated the SPI. This indicates weak accuracy in predictability, hence not reliable for water resource management in the study area. However, CRU data products were found to perform much better in most of the statistical assessments conducted. This makes the methods used in this study to be useful for the assessment of various gridded datasets in various hydrological and climatic applications. 展开更多
关键词 CLIMATE Research Unit (CRU) Princeton University Global METEOROLOGICAL FORCING Dataset (PGF) CLIMATE Forecast System REANALYSIS (CFSR) Standardised Precipitation Index (SPI)
下载PDF
Exploring Variational Auto-encoder Architectures, Configurations, and Datasets for Generative Music Explainable AI
14
作者 Nick Bryan-Kinns Bingyuan Zhang +1 位作者 Songyan Zhao Berker Banar 《Machine Intelligence Research》 EI CSCD 2024年第1期29-45,共17页
Generative AI models for music and the arts in general are increasingly complex and hard to understand.The field of ex-plainable AI(XAI)seeks to make complex and opaque AI models such as neural networks more understan... Generative AI models for music and the arts in general are increasingly complex and hard to understand.The field of ex-plainable AI(XAI)seeks to make complex and opaque AI models such as neural networks more understandable to people.One ap-proach to making generative AI models more understandable is to impose a small number of semantically meaningful attributes on gen-erative AI models.This paper contributes a systematic examination of the impact that different combinations of variational auto-en-coder models(measureVAE and adversarialVAE),configurations of latent space in the AI model(from 4 to 256 latent dimensions),and training datasets(Irish folk,Turkish folk,classical,and pop)have on music generation performance when 2 or 4 meaningful musical at-tributes are imposed on the generative model.To date,there have been no systematic comparisons of such models at this level of com-binatorial detail.Our findings show that measureVAE has better reconstruction performance than adversarialVAE which has better musical attribute independence.Results demonstrate that measureVAE was able to generate music across music genres with inter-pretable musical dimensions of control,and performs best with low complexity music such as pop and rock.We recommend that a 32 or 64 latent dimensional space is optimal for 4 regularised dimensions when using measureVAE to generate music across genres.Our res-ults are the first detailed comparisons of configurations of state-of-the-art generative AI models for music and can be used to help select and configure AI models,musical features,and datasets for more understandable generation of music. 展开更多
关键词 Variational auto-encoder explainable AI(XAI) generative music musical features datasets
原文传递
Multi-temporal urban semantic understanding based on GF-2 remote sensing imagery:from tri-temporal datasets to multi-task mapping
15
作者 Sunan Shi Yanfei Zhong +6 位作者 Yinhe Liu Jue Wang Yuting Wan Ji Zhao Pengyuan Lv Liangpei Zhang Deren Li 《International Journal of Digital Earth》 SCIE EI 2023年第1期3321-3347,共27页
High resolution satellite images are becoming increasingly available for urban multi-temporal semantic understanding.However,few datasets can be used for land-use/land-cover(LULC)classification,binary change detection... High resolution satellite images are becoming increasingly available for urban multi-temporal semantic understanding.However,few datasets can be used for land-use/land-cover(LULC)classification,binary change detection(BCD)and semantic change detection(SCD)simultaneously because classification datasets always have one time phase and BCD datasets focus only on the changed location,ignoring the changed classes.Public SCD datasets are rare but much needed.To solve the above problems,a tri-temporal SCD dataset made up of Gaofen-2(GF-2)remote sensing imagery(with 11 LULC classes and 60 change directions)was built in this study,namely,the Wuhan Urban Semantic Understanding(WUSU)dataset.Popular deep learning based methods for LULC classification,BCD and SCD are tested to verify the reliability of WUSU.A Siamese-based multi-task joint framework with a multi-task joint loss(MJ loss)named ChangeMJ is proposed to restore the object boundaries and obtains the best results in LULC classification,BCD and SCD,compared to the state-of-the-art(SOTA)methods.Finally,a large spatial-scale mapping for Wuhan central urban area is carried out to verify that the WUsU dataset and the ChangeMJ framework have good application values. 展开更多
关键词 GF-2 remote sensing imagery multi-temporal satellite datasets urban LULC mapping binary and semantic change detection multi-task framework
原文传递
An Improved Binary Quantum-based Avian Navigation Optimizer Algorithm to Select Effective Feature Subset from Medical Data:A COVID-19 Case Study
16
作者 Ali Fatahi Mohammad H.Nadimi-Shahraki Hoda Zamani 《Journal of Bionic Engineering》 SCIE EI CSCD 2024年第1期426-446,共21页
Feature Subset Selection(FSS)is an NP-hard problem to remove redundant and irrelevant features particularly from medical data,and it can be effectively addressed by metaheuristic algorithms.However,existing binary ver... Feature Subset Selection(FSS)is an NP-hard problem to remove redundant and irrelevant features particularly from medical data,and it can be effectively addressed by metaheuristic algorithms.However,existing binary versions of metaheuristic algorithms have issues with convergence and lack an effective binarization method,resulting in suboptimal solutions that hinder diagnosis and prediction accuracy.This paper aims to propose an Improved Binary Quantum-based Avian Navigation Optimizer Algorithm(IBQANA)for FSS in medical data preprocessing to address the suboptimal solutions arising from binary versions of metaheuristic algorithms.The proposed IBQANA’s contributions include the Hybrid Binary Operator(HBO)and the Distance-based Binary Search Strategy(DBSS).HBO is designed to convert continuous values into binary solutions,even for values outside the[0,1]range,ensuring accurate binary mapping.On the other hand,DBSS is a two-phase search strategy that enhances the performance of inferior search agents and accelerates convergence.By combining exploration and exploitation phases based on an adaptive probability function,DBSS effectively avoids local optima.The effectiveness of applying HBO is compared with five transfer function families and thresholding on 12 medical datasets,with feature numbers ranging from 8 to 10,509.IBQANA's effectiveness is evaluated regarding the accuracy,fitness,and selected features and compared with seven binary metaheuristic algorithms.Furthermore,IBQANA is utilized to detect COVID-19.The results reveal that the proposed IBQANA outperforms all comparative algorithms on COVID-19 and 11 other medical datasets.The proposed method presents a promising solution to the FSS problem in medical data preprocessing. 展开更多
关键词 Feature subset selection Optimization Binary metaheuristic algorithms BIOINSPIRED Machine learning Medical datasets
下载PDF
Towards combining multiple geophysical datasets to determine earthquake source parameters in China 被引量:6
17
作者 ZHENG Yong LIU ChengLi 《Science China Earth Sciences》 SCIE EI CAS CSCD 2016年第11期2260-2262,共3页
1.Difficulties of conventional seismic studies on earthquake source parameters Earthquake source parameters,including magnitude,location,focal mechanism,rupture process are key factors for understanding seismogenic en... 1.Difficulties of conventional seismic studies on earthquake source parameters Earthquake source parameters,including magnitude,location,focal mechanism,rupture process are key factors for understanding seismogenic environment,mitigating seismic hazards,estimating earthquake triggering,and tectonic analysis.Traditionally,source parameters are determined by seismological methods.For example,Fang L H et al.(2014)relocated the 2012 Ms6.6 Xinjiang Xinyuan earthquake sequence using local seismograms based on the double difference method,and obtained the distribution of 展开更多
关键词 INSAR GPS Towards combining multiple geophysical datasets to determine earthquake source parameters in China high data rate
原文传递
Methodology for credibility assessment of historical global LUCC datasets 被引量:4
18
作者 Xiuqi FANG Wanyi ZHAO +4 位作者 Chengpeng ZHANG Diyang ZHANG Xueqiong WEI Weili QIU Yu YE 《Science China Earth Sciences》 SCIE EI CAS CSCD 2020年第7期1013-1025,共13页
Land use-induced land cover change(LUCC)is an important anthropogenic driving force of global change that has influenced,and is still influencing,many aspects of regional and global environments.Accurate historical gl... Land use-induced land cover change(LUCC)is an important anthropogenic driving force of global change that has influenced,and is still influencing,many aspects of regional and global environments.Accurate historical global land use/cover datasets are essential for a better understanding of the impacts of LUCC on global change.However,there are not only evident inconsistencies in current historical global land use/cover datasets,but inaccuracies in the data in these global dataset revealed by historical record-based reconstructed regional data throughout the world.A focus in historical LUCC and global change research relates to how the accuracy of historical global land cover datasets can be improved.A methodology for assessing the credibility of existing historical global land cover datasets that addresses temporal as well as spatial changes in the amount and distribution of land cover is therefore needed.Theoretically,the credibility of a global land cover dataset could be assessed by comparing similarities or differences in the data according to actual land cover data(the"true value").However,it is extremely difficult to obtain historical evidence for assessing the credibility of historical global land cover datasets,which cannot be verified through field sampling like contemporary global land cover datasets.We proposed a methodological framework for assessing the credibility of global land cover datasets.Considering the types and characteristics of the available evidence used for assessments,we outlined four methodological approaches:(1)accuracy assessment based on regional quantitative reconstructed land cover data,(2)rationality assessment based on regional historical facts,(3)rationality assessment based on expertise,and(4)likelihood assessment based on the consistency of multiple datasets.These methods were illustrated through five case studies of credibility assessments of historical cropland cover data.This framework can also be applied in assessments of other land cover types,such as forest and grassland. 展开更多
关键词 LUCC Global datasets CREDIBILITY Assessment methodology
原文传递
Cloud-based parallel power flow calculation using resilient distributed datasets and directed acyclic graph 被引量:3
19
作者 Dewen WANG Fangfang ZHOU Jiangman LI 《Journal of Modern Power Systems and Clean Energy》 SCIE EI CSCD 2019年第1期65-77,共13页
With the integration of distributed generation and the construction of cross-regional long-distance power grids, power systems become larger and more complex.They require faster computing speed and better scalability ... With the integration of distributed generation and the construction of cross-regional long-distance power grids, power systems become larger and more complex.They require faster computing speed and better scalability for power flow calculations to support unit dispatch.Based on the analysis of a variety of parallelization methods, this paper deploys the large-scale power flow calculation task on a cloud computing platform using resilient distributed datasets(RDDs).It optimizes a directed acyclic graph that is stored in the RDDs to solve the low performance problem of the MapReduce model.This paper constructs and simulates a power flow calculation on a large-scale power system based on standard IEEE test data.Experiments are conducted on Spark cluster which is deployed as a cloud computing platform.They show that the advantages of this method are not obvious at small scale, but the performance is superior to the stand-alone model and the MapReduce model for large-scale calculations.In addition, running time will be reduced when adding cluster nodes.Although not tested under practical conditions, this paper provides a new way of thinking about parallel power flow calculations in large-scale power systems. 展开更多
关键词 Power flow calculation PARALLEL programming MODEL DISTRIBUTED memory-shared MODEL Resilient DISTRIBUTED datasets(RDDs) Directed ACYCLIC graph(DAG)
原文传递
基于全局注意力多任务网络方法的CT图像细小骨折检测研究
20
作者 李瑞瑞 杨晓光 +1 位作者 孙世豪 季尚蔚 《中国医学装备》 2024年第3期12-18,共7页
目的:通过全局注意力多任务网络提升CT图像细小骨折检测的感知,通过多任务实现实例级别细小骨折目标的检测,快速、准确地从大量CT图像中识别并定位骨折,以辅助临床及时开展治疗。方法:引入分组非局部(non-local)网络方法,计算CT图像连... 目的:通过全局注意力多任务网络提升CT图像细小骨折检测的感知,通过多任务实现实例级别细小骨折目标的检测,快速、准确地从大量CT图像中识别并定位骨折,以辅助临床及时开展治疗。方法:引入分组非局部(non-local)网络方法,计算CT图像连续切片任何位置和通道之间的远程依赖关系,将多目标检测模型3D RetinaNet单级检测器与医学图像语义分割(3D U-Net)架构相融合,实现端到端的多任务3D卷积网络,以多任务联合的方式实现对细小骨折的实例级别检测。选择医学图像计算与计算机辅助干预(MICCAI)2020挑战赛提供的肋骨骨折公开数据集(Rib Frac Dataset)600例CT扫描图像,通过5∶1的比例划分为训练集(500例)和验证集(100例),测试多任务3D卷积网络的精度性能。结果:多任务3D卷积网络方法的检测精度性能优于单任务网络FracNet、3D RetinaNet及3D Retina U-Net,其平均精度与3D RetinaNet和3D Retina U-Net网络相比分别高出7.8%和11.4%,且优于3D Faster R-CNN、3D Mask R-CNN两种单任务网络检测方法,平均精度分别高出约6.7%和3.1%。结论:全局注意力多任务网络融合不同模块,对于细小骨折检测性能均有提升,引入分组非局部(Non-local)网络方法能够进一步提升对细小骨折目标的检测精度性能。 展开更多
关键词 三维卷积神经网络 全局注意力 多任务网络 非局部 CT图像 肋骨骨折数据集(RibFrac Dataset)
下载PDF
上一页 1 2 20 下一页 到第
使用帮助 返回顶部