基于数据挖掘技术的防震减灾科普资源管理平台

Public Science Resource Management Platform for Earthquake Prevention and Disaster Reduction Based on Data Mining Techniques

摘要: 鉴于防震减灾科普资料存在数据类型多、管理难度大，海量数据检索困难、利用率低，资料科学性难以保证等问题，设计了基于Hadoop架构的防震减灾科普资源管理平台架构，设定了防震减灾科普资源搜索指标，通过网络爬虫及人工输入等方式，对相关数据进行采集、清洗、转换及合并，实现不同类型数据资源的分类，为防震减灾科普资源的管理提供科学方法。

Abstract: In view of the problems existing in the public science resources of earthquake prevention and disaster reduction, such as multiple data types, difficulty in management and massive data retrieval, low utilization rate and uncertain accuracy of publicity materials, this paper designs a public science resources management platform based on Hadoop architecture. By means of web crawler and manual entry under the searching index catalog, the platform supports collecting relevant data of earthquake prevention and disaster reduction that enables classification of different data types and scientific methods for the management of public science resources.