摘要
从日志信息采集、处理、存储等方面研究了分布式技术在日志处理平台的应用。使用Flume进行采集历史数据以及实时日志数据,并将收集的数据使用Kafka来缓存进行离线与实时清洗,最后将日志数据存储到数据库中,进行数据分析以图表的形式展示,做出相应策略从而提高效益。
The applications of distributed technology in log processing platform is studied from the aspects of log information collection,processing and storage.Using the Flume real time collecting history data and log data,and will collect the data using the Kafka to cache offline and real-time cleaning,finally the log data is stored in the database,for data analysis in the form of charts show,make corresponding strategy to improve efficiency.
作者
杨潇黎
蒋廷耀
金鑫
罗神
Yang Xiaoli;Jiang Tingyao;Jin Xin;Luo Shen(School of Computer and Information,China Three Gorges University,Yichang 443000 China)
出处
《信息通信》
2019年第3期146-148,共3页
Information & Communications