期刊文献+

Trajectory Big Data Processing Based on Frequent Activity 被引量:10

Trajectory Big Data Processing Based on Frequent Activity
原文传递
导出
摘要 With the rapid development and wide use of Global Positioning System in technology tools, such as smart phones and touch pads, many people share their personal experience through their trajectories while visiting places of interest. Therefore, trajectory query processing has emerged in recent years to help users find their best trajectories. However, with the huge amount of trajectory points and text descriptions, such as the activities practiced by users at these points, organizing these data in the index becomes tedious. Therefore, the parallel method becomes indispensable. In this paper, we have investigated the problem of distributed trajectory query processing based on the distance and frequent activities. The query is specified by start and final points in the trajectory, the distance threshold, and a set of frequent activities involved in the point of interest of the trajectory.As a result, the query returns the shortest trajectory including the most frequent activities with high support and high confidence. To simplify the query processing, we have implemented the Distributed Mining Trajectory R-Tree index(DMTR-Tree). For this method, we initially managed the large trajectory dataset in distributed R-Tree indexes.Then, for each index, we applied the frequent itemset Apriori algorithm for each point to select the frequent activity set. For the faster computation of the above algorithms, we utilized the cluster computing framework of Apache Spark with MapReduce as the programing model. The experimental results show that the DMTR-Tree index and the query-processing algorithm are efficient and can achieve the scalability. With the rapid development and wide use of Global Positioning System in technology tools, such as smart phones and touch pads, many people share their personal experience through their trajectories while visiting places of interest. Therefore, trajectory query processing has emerged in recent years to help users find their best trajectories. However, with the huge amount of trajectory points and text descriptions, such as the activities practiced by users at these points, organizing these data in the index becomes tedious. Therefore, the parallel method becomes indispensable. In this paper, we have investigated the problem of distributed trajectory query processing based on the distance and frequent activities. The query is specified by start and final points in the trajectory, the distance threshold, and a set of frequent activities involved in the point of interest of the trajectory.As a result, the query returns the shortest trajectory including the most frequent activities with high support and high confidence. To simplify the query processing, we have implemented the Distributed Mining Trajectory R-Tree index(DMTR-Tree). For this method, we initially managed the large trajectory dataset in distributed R-Tree indexes.Then, for each index, we applied the frequent itemset Apriori algorithm for each point to select the frequent activity set. For the faster computation of the above algorithms, we utilized the cluster computing framework of Apache Spark with MapReduce as the programing model. The experimental results show that the DMTR-Tree index and the query-processing algorithm are efficient and can achieve the scalability.
出处 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2019年第3期317-332,共16页 清华大学学报(自然科学版(英文版)
基金 partially supported by the National Natural Science Foundation of China (Nos. U1509216 and 61472099) the National Sci-Tech Support Plan (No. 2015BAH10F01) the Scientific Research Foundation for the Returned Overseas Chinese Scholars of Heilongjiang Provience (No. LC2016026) MOECMicrosoft Key Laboratory of Natural Language Processing and Speech, Harbin Institute of Technology
关键词 DISTRIBUTED R-TREE TRAJECTORY frequent ACTIVITY QUERY distributed R-tree trajectory frequent activity query
  • 相关文献

参考文献1

二级参考文献22

  • 1Li Z, Ding B, Han J, Kays R. Swarm: Mining relaxed tem- poral moving object clusters. Proceedings of the VLDB En- dowment, 2010, 3(1/2): 723-734.
  • 2Zheng K, Zheng Y, Yuan N, Shang S, Zhou X. Online dis- covery of gathering patterns over trajectories. IEEE Trans. Knowledge and Data Engineering, 2014, 26(8): 1974-1988.
  • 3Huang M, Hu P, Xia L. A grid based trajectory indexing method for moving objects on fixed network. In Proc. the 18th Int. Conf. Geoinformatics, June 2010.
  • 4Popa L S, Zeitouni K, Oria V, et al. Indexing in-network trajectory flows. The VLDB Journal, 2011, 20(5): 643-669.
  • 5Chu S, Yeh C, Huang C. A cloud-based trajectory index scheme. In Proc. the 12th ICEBE, October 2009, pp.602- 607.
  • 6Vlachos M, Kollios G, Gunopulos D. Discovering similar multidimensional trajectories. In Proc. the 18th ICDE, Feb. 26-Mar. 1, 2002, pp.673-684.
  • 7Chen L, Ozsu M T, Oria V. Robust and fast similarity search for moving object trajectories. In Proc. the 2gth SIG- MOD, June 2005, pp.491-502.
  • 8Chen Z, Shen H, Zhou X, Zheng Y, Xie X. Searching tra- jectories by locations: An efficiency study. In Proc. the 29th SIGMOD, June 2010, pp.255-266.
  • 9Chen Z, Shen H, Zhou X. Discovering popular routes from trajectories. In Proc. the 27th /CDE, April 2011, pp.900- 911.
  • 10Zheng K, Shang S, Yuan N J, 5rang Y. Towards efficient search for activity trajectories. In Proc. the 29th ICDE, April 2013, pp.230-241.

共引文献3

同被引文献58

引证文献10

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部