With the rapid development of the global economy, maritime transportation has become much more convenient due to large capacities and low freight. However, this means the sea lanes are becoming more and more crowded,l...With the rapid development of the global economy, maritime transportation has become much more convenient due to large capacities and low freight. However, this means the sea lanes are becoming more and more crowded,leading to high probabilities of marine accidents in complex maritime environments. According to relevant historical statistics, a large number of accidents have happened in water areas that lack high precision navigation data, which can be utilized to enhance navigation safety. The purpose of this work was to carry out ship route planning automatically, by mining historical big automatic identification system(AIS) data. It is well-known that experiential navigation information hidden in maritime big data could be automatically extracted using advanced data mining techniques;assisting in the generation of safe and reliable ship planning routes for complex maritime environments. In this paper, a novel method is proposed to construct a big data-driven framework for generating ship planning routes automatically, under varying navigation conditions. The method performs density-based spatial clustering of applications with noise first on a large number of ship trajectories to form different trajectory vector clusters. Then, it iteratively calculates its centerline in the trajectory vector cluster, and constructs the waterway network from the node-arc topology relationship among these centerlines. The generation of shipping route could be based on the waterway network and conducted by rasterizing the marine environment risks for the sea area not covered by the waterway network. Numerous experiments have been conducted on different AIS data sets in different water areas, and the experimental results have demonstrated the effectiveness of the framework of the ship route planning proposed in this paper.展开更多
With the deployment of modern infrastructure for public transportation, several studies have analyzed movement patterns of people using smart card data and have characterized different areas. In this paper, we propose...With the deployment of modern infrastructure for public transportation, several studies have analyzed movement patterns of people using smart card data and have characterized different areas. In this paper, we propose the “movement purpose hypothesis” that each movement occurs from two causes: where the person is and what the person wants to do at a given moment. We formulate this hypothesis to a synthesis model in which two network graphs generate a movement network graph. Then we develop two novel-embedding models to assess the hypothesis, and demonstrate that the models obtain a vector representation of a geospatial area using movement patterns of people from large-scale smart card data. We conducted an experiment using smart card data for a large network of railroads in the Kansai region of Japan. We obtained a vector representation of each railroad station and each purpose using the developed embedding models. Results show that network embedding methods are suitable for a large-scale movement of data, and the developed models perform better than existing embedding methods in the task of multi-label classification for train stations on the purpose of use data set. Our proposed models can contribute to the prediction of people flows by discovering underlying representations of geospatial areas from mobility data.展开更多
OpenStreetMap(OSM)data are widely used but their reliability is still variable.Many contributors to OSM have not been trained in geography or surveying and consequently their contributions,including geometry and attri...OpenStreetMap(OSM)data are widely used but their reliability is still variable.Many contributors to OSM have not been trained in geography or surveying and consequently their contributions,including geometry and attribute data inserts,deletions,and updates,can be inaccurate,incomplete,inconsistent,or vague.There are some mechanisms and applications dedicated to discovering bugs and errors in OSM data.Such systems can remove errors through user-checks and applying predefined rules but they need an extra control process to check the real-world validity of suspected errors and bugs.This paper focuses on finding bugs and errors based on patterns and rules extracted from the tracking data of users.The underlying idea is that certain characteristics of user trajectories are directly linked to the type of feature.Using such rules,some sets of potential bugs and errors can be identified and stored for further investigations.展开更多
Short-term GPS data based taxi pick-up area recommendation can improve the efficiency and reduce the overheads.But how to alleviate sparsity and further enhance accuracy is still challenging.Addressing at these issues...Short-term GPS data based taxi pick-up area recommendation can improve the efficiency and reduce the overheads.But how to alleviate sparsity and further enhance accuracy is still challenging.Addressing at these issues,we propose to fuse spatio-temporal contexts into deep factorization machine(STC_DeepFM)offline for pick-up area recommendation,and within the area to recommend pick-up points online using factorization machine(FM).Firstly,we divide the urban area into several grids with equal size.Spatio-temporal contexts are destilled from pick-up points or points-of-interest(POIs)belonged to the preceding grids.Secondly,the contexts are integrated into deep factorization machine(DeepFM)to mine high-order interaction relationships from grids.And a novel algorithm named STC_DeepFM is presented for offline pick-up area recommendation.Thirdly,we devise the architecture of offline-to-online(O2O)recommendation respectively based on DeepFM and FM model in order to tradeoff the accuracy and efficiency.Some experiments are designed on the DiDi dataset to evaluate step by step the performance of spatio-temporal contexts,different recommendation models,and the O2O architecture.The results show that the proposed STC_DeepFM algorithm exceeds several state-of-the-art methods,and the O2O architecture achieves excellent real-time performance.展开更多
Cruising route recommendation based on trajectory mining can improve taxi-drivers'income and reduce energy consumption.However,existing methods mostly recommend pick-up points for taxis only.Moreover,their perform...Cruising route recommendation based on trajectory mining can improve taxi-drivers'income and reduce energy consumption.However,existing methods mostly recommend pick-up points for taxis only.Moreover,their performance is not good enough since there lacks a good evaluation model for the pick-up points.Therefore,we propose an entropy-based model for recommendation of taxis'cruising route.Firstly,we select more positional attributes from historical pick-up points in order to obtain accurate spatial-temporal features.Secondly,the information entropy of spatial-temporal features is integrated in the evaluation model.Then it is applied for getting the next pick-up points and further recommending a series of successive points.These points are constructed a cruising route for taxi-drivers.Experimental results show that our method is able to obviously improve the recommendation accuracy of pick-up points,and help taxi-drivers make profitable benefits more than before.展开更多
Road network extraction is vital to both vehicle navigation and road planning.Existing approaches focus on mining urban trunk roads from GPS trajectories of floating cars.However,path extraction,which plays an importa...Road network extraction is vital to both vehicle navigation and road planning.Existing approaches focus on mining urban trunk roads from GPS trajectories of floating cars.However,path extraction,which plays an important role in earthquake relief and village tour,is always ignored.Addressing this issue,we propose a novel approach of extracting campus’road network from walking GPS trajectories.It consists of data preprocessing and road centerline generation.The patrolling GPS trajectories,collected at Hunan University of Science and Technology,were used as the experimental data.The experimental evaluation results show that our approach is able to effectively and accurately extract both campus’trunk roads and paths.The coverage rate is 96.21%while the error rate is 3.26%.展开更多
Nowadays, human activities and movements are recorded by a variety of tools, forming different trajectory sets which are usually isolated from one another. Thus, it is very important to link different trajectories of ...Nowadays, human activities and movements are recorded by a variety of tools, forming different trajectory sets which are usually isolated from one another. Thus, it is very important to link different trajectories of one person in different sets to provide massive information for facilitating trajectory mining tasks. Most prior work took advantages of only one dimensional information to link trajectories and can link trajectories in a one-to-many manner (providing several candidate trajectories to link to one specific trajectory). In this paper, we propose a novel approach called one-to-one constraint trajectory linking with multi-dimensional information (OCTL) that links the corresponding trajectories of one person in different sets in a one-to-one manner. We extract multidimensional features from different trajectory datasets for corresponding relationships prediction, including spatial, temporal and spatio-temporal information, which jointly describe the relationships between trajectories. Using these features, we calculate the corresponding probabilities between trajectories in different datasets. Then, we formulate the link inference problem as a bipartite graph matching problem and employ effective methods to link one trajectory to another. Moreover, the advantages of our approach are empirically verified on two real-world trajectory sets with convincing results.展开更多
Identifying the spatiotemporal interaction pattern of agricultural product circulation(APC)is crucial for agricultural resource adjustment and food security.Current studies are mostly based on static statistical data ...Identifying the spatiotemporal interaction pattern of agricultural product circulation(APC)is crucial for agricultural resource adjustment and food security.Current studies are mostly based on static statistical data over an entire year or a specific period,which cannot describe the spatial pattern of APC and its seasonal variation on a fine spatiotemporal scale.Thus,this study extracts an APC trip chain based on national truck trajectory data and constructs the flow network of the Beijing APC with the city as the spatial unit and the season as the temporal unit.The spatial interaction pattern and seasonal variation in APC are then analyzed from the network spatial form,city node function role,and transportation corridors.The results are as follows:(1)Compared with methods based on static statistical data,the proposed method provides a more complete and refined depiction of the spatiotemporal interaction pattern of APC.(2)The flow network of the Beijing APC involves 316 cities in China,of which 143 cities play a major role with typical seasonal characteristics.These cities can be divided into perennial core cities,perennial major cities,core cities in winter-spring,major cities in winter-spring,core cities in summer-autumn,and major cities in summer-autumn,contributing 2.6%-40.3%to the Beijing APC.(3)There are 6 transportation corridors for the Beijing APC.The Beijing-Tianjin-Hebei corridor and coastal corridor contribute 53.5%and 12.8%of the annual supply,respectively,with a balanced supply in all seasons.The Beijing-Kunming corridor and Beijing-Guangzhou corridor contribute 14.3%and 9.0%,respectively,with much higher supplies in winter and spring.The northeast and northwest corridors contribute 7.3%and 3.3%,respectively,mainly in the summer and autumn.These results help deepen the understanding of agricultural product supply patterns and provide a reference for the design and optimization of agricultural product transportation routes.展开更多
文摘With the rapid development of the global economy, maritime transportation has become much more convenient due to large capacities and low freight. However, this means the sea lanes are becoming more and more crowded,leading to high probabilities of marine accidents in complex maritime environments. According to relevant historical statistics, a large number of accidents have happened in water areas that lack high precision navigation data, which can be utilized to enhance navigation safety. The purpose of this work was to carry out ship route planning automatically, by mining historical big automatic identification system(AIS) data. It is well-known that experiential navigation information hidden in maritime big data could be automatically extracted using advanced data mining techniques;assisting in the generation of safe and reliable ship planning routes for complex maritime environments. In this paper, a novel method is proposed to construct a big data-driven framework for generating ship planning routes automatically, under varying navigation conditions. The method performs density-based spatial clustering of applications with noise first on a large number of ship trajectories to form different trajectory vector clusters. Then, it iteratively calculates its centerline in the trajectory vector cluster, and constructs the waterway network from the node-arc topology relationship among these centerlines. The generation of shipping route could be based on the waterway network and conducted by rasterizing the marine environment risks for the sea area not covered by the waterway network. Numerous experiments have been conducted on different AIS data sets in different water areas, and the experimental results have demonstrated the effectiveness of the framework of the ship route planning proposed in this paper.
文摘With the deployment of modern infrastructure for public transportation, several studies have analyzed movement patterns of people using smart card data and have characterized different areas. In this paper, we propose the “movement purpose hypothesis” that each movement occurs from two causes: where the person is and what the person wants to do at a given moment. We formulate this hypothesis to a synthesis model in which two network graphs generate a movement network graph. Then we develop two novel-embedding models to assess the hypothesis, and demonstrate that the models obtain a vector representation of a geospatial area using movement patterns of people from large-scale smart card data. We conducted an experiment using smart card data for a large network of railroads in the Kansai region of Japan. We obtained a vector representation of each railroad station and each purpose using the developed embedding models. Results show that network embedding methods are suitable for a large-scale movement of data, and the developed models perform better than existing embedding methods in the task of multi-label classification for train stations on the purpose of use data set. Our proposed models can contribute to the prediction of people flows by discovering underlying representations of geospatial areas from mobility data.
基金This research was supported financially by EU FP7 Marie Curie Initial Training Network MULTI-POS(Multi-technology Positioning Professionals)[grant number 316528].
文摘OpenStreetMap(OSM)data are widely used but their reliability is still variable.Many contributors to OSM have not been trained in geography or surveying and consequently their contributions,including geometry and attribute data inserts,deletions,and updates,can be inaccurate,incomplete,inconsistent,or vague.There are some mechanisms and applications dedicated to discovering bugs and errors in OSM data.Such systems can remove errors through user-checks and applying predefined rules but they need an extra control process to check the real-world validity of suspected errors and bugs.This paper focuses on finding bugs and errors based on patterns and rules extracted from the tracking data of users.The underlying idea is that certain characteristics of user trajectories are directly linked to the type of feature.Using such rules,some sets of potential bugs and errors can be identified and stored for further investigations.
基金supported by the National Natural Science Foundation of China(41871320,61873316)the Key Project of Hunan Provincial Education Department(19A172)+1 种基金the Scientific Research Fund of Hunan Provincial Education Department(18K060)the Postgraduate Scientific Research Innovation Project of Hunan Province(CX20211000).
文摘Short-term GPS data based taxi pick-up area recommendation can improve the efficiency and reduce the overheads.But how to alleviate sparsity and further enhance accuracy is still challenging.Addressing at these issues,we propose to fuse spatio-temporal contexts into deep factorization machine(STC_DeepFM)offline for pick-up area recommendation,and within the area to recommend pick-up points online using factorization machine(FM).Firstly,we divide the urban area into several grids with equal size.Spatio-temporal contexts are destilled from pick-up points or points-of-interest(POIs)belonged to the preceding grids.Secondly,the contexts are integrated into deep factorization machine(DeepFM)to mine high-order interaction relationships from grids.And a novel algorithm named STC_DeepFM is presented for offline pick-up area recommendation.Thirdly,we devise the architecture of offline-to-online(O2O)recommendation respectively based on DeepFM and FM model in order to tradeoff the accuracy and efficiency.Some experiments are designed on the DiDi dataset to evaluate step by step the performance of spatio-temporal contexts,different recommendation models,and the O2O architecture.The results show that the proposed STC_DeepFM algorithm exceeds several state-of-the-art methods,and the O2O architecture achieves excellent real-time performance.
基金funded by the National Natural Science Foundation of China(61872139,41871320)Provincial and Municipal Joint Fund of Hunan Provincial Natural Science Foundation of China(2018JJ4052)+2 种基金Hunan Provincial Natural Science Foundation of China(2017JJ2081)the Key Project of Hunan Provincial Education Department(17A070,19A172)the Project of Hunan Provincial Education Department(17C0646).
文摘Cruising route recommendation based on trajectory mining can improve taxi-drivers'income and reduce energy consumption.However,existing methods mostly recommend pick-up points for taxis only.Moreover,their performance is not good enough since there lacks a good evaluation model for the pick-up points.Therefore,we propose an entropy-based model for recommendation of taxis'cruising route.Firstly,we select more positional attributes from historical pick-up points in order to obtain accurate spatial-temporal features.Secondly,the information entropy of spatial-temporal features is integrated in the evaluation model.Then it is applied for getting the next pick-up points and further recommending a series of successive points.These points are constructed a cruising route for taxi-drivers.Experimental results show that our method is able to obviously improve the recommendation accuracy of pick-up points,and help taxi-drivers make profitable benefits more than before.
基金This project was funded by the National Natural Science Foundation of China(61872139,41871320)Provincial and Municipal Joint Fund of Hunan Provincial Natural Science Foundation of China(2018JJ4052)+2 种基金Hunan Provincial Natural Science Foundation of China(2017JJ2081)the Key Project of Hunan Provincial Education Department(17A070,19A172)the Project of Hunan Provincial Education Department(17C0646).
文摘Road network extraction is vital to both vehicle navigation and road planning.Existing approaches focus on mining urban trunk roads from GPS trajectories of floating cars.However,path extraction,which plays an important role in earthquake relief and village tour,is always ignored.Addressing this issue,we propose a novel approach of extracting campus’road network from walking GPS trajectories.It consists of data preprocessing and road centerline generation.The patrolling GPS trajectories,collected at Hunan University of Science and Technology,were used as the experimental data.The experimental evaluation results show that our approach is able to effectively and accurately extract both campus’trunk roads and paths.The coverage rate is 96.21%while the error rate is 3.26%.
文摘Nowadays, human activities and movements are recorded by a variety of tools, forming different trajectory sets which are usually isolated from one another. Thus, it is very important to link different trajectories of one person in different sets to provide massive information for facilitating trajectory mining tasks. Most prior work took advantages of only one dimensional information to link trajectories and can link trajectories in a one-to-many manner (providing several candidate trajectories to link to one specific trajectory). In this paper, we propose a novel approach called one-to-one constraint trajectory linking with multi-dimensional information (OCTL) that links the corresponding trajectories of one person in different sets in a one-to-one manner. We extract multidimensional features from different trajectory datasets for corresponding relationships prediction, including spatial, temporal and spatio-temporal information, which jointly describe the relationships between trajectories. Using these features, we calculate the corresponding probabilities between trajectories in different datasets. Then, we formulate the link inference problem as a bipartite graph matching problem and employ effective methods to link one trajectory to another. Moreover, the advantages of our approach are empirically verified on two real-world trajectory sets with convincing results.
基金Innovation Project of LREIS,No.KPI003National Natural Science Foundation of China,No.42101423Strategic Priority Research Program of the Chinese Academy of Sciences,No.XDA23010202。
文摘Identifying the spatiotemporal interaction pattern of agricultural product circulation(APC)is crucial for agricultural resource adjustment and food security.Current studies are mostly based on static statistical data over an entire year or a specific period,which cannot describe the spatial pattern of APC and its seasonal variation on a fine spatiotemporal scale.Thus,this study extracts an APC trip chain based on national truck trajectory data and constructs the flow network of the Beijing APC with the city as the spatial unit and the season as the temporal unit.The spatial interaction pattern and seasonal variation in APC are then analyzed from the network spatial form,city node function role,and transportation corridors.The results are as follows:(1)Compared with methods based on static statistical data,the proposed method provides a more complete and refined depiction of the spatiotemporal interaction pattern of APC.(2)The flow network of the Beijing APC involves 316 cities in China,of which 143 cities play a major role with typical seasonal characteristics.These cities can be divided into perennial core cities,perennial major cities,core cities in winter-spring,major cities in winter-spring,core cities in summer-autumn,and major cities in summer-autumn,contributing 2.6%-40.3%to the Beijing APC.(3)There are 6 transportation corridors for the Beijing APC.The Beijing-Tianjin-Hebei corridor and coastal corridor contribute 53.5%and 12.8%of the annual supply,respectively,with a balanced supply in all seasons.The Beijing-Kunming corridor and Beijing-Guangzhou corridor contribute 14.3%and 9.0%,respectively,with much higher supplies in winter and spring.The northeast and northwest corridors contribute 7.3%and 3.3%,respectively,mainly in the summer and autumn.These results help deepen the understanding of agricultural product supply patterns and provide a reference for the design and optimization of agricultural product transportation routes.