Current location - Loan Platform Complete Network - Big data management - What changes big data brings to digital libraries
What changes big data brings to digital libraries
Digital libraries encounter challenges

"All types of data are growing dramatically and are moving toward massive data, and the National Digital Library is facing challenges in various aspects such as long-term preservation of digital resources, resource integration, information security and service innovation." Wei Dawei said that by the end of 2013, the total amount of digital resources of the National Digital Library has reached 874.5TB, of which the amount of self-built digital resources is 737.9TB, the amount of network information collection reaches 45.7TB, the number of outsourced Chinese and foreign language databases *** counts 273, and the metadata assembled by Wenjin search has reached 290 million; with the expansion of reader services to computers, digital TV, cell phones, Handheld readers, tablet PCs, electronic touch screens and other service terminals, the increasing volume of services, the business systems will produce a large amount of log data every day, which contains a large amount of user behavior information, for example, the Aleph system produces an average of about 20GB of log data per day, and the Wenjin search system produces an average of more than 300G of log data per day.

We will set up a super-large-scale metadata warehousing

Wei Dawei pointed out that, faced with the new environment, background, the National Library in order to achieve a high degree of integration of traditional and digital library business, maximize the effectiveness of the National Library services, the integration of resources as a working hand.

He further emphasized that the integration of digital resources must be carried out in combination with the characteristics of big data and the current situation of resources, oriented to user needs, drawing on the strengths of all, highlighting the characteristics of the implementation of a phased and planned. The establishment of mega metadata warehousing is one of the ideas for resource integration in future digital libraries, so as to realize the unified aggregation of resources and one-stop retrieval, combine cloud services with Linked Data to realize the organization and aggregation of digital collections, and build a "resource - user "Relationship model and other ideas to carry out the work, but resource integration is also facing challenges in terms of funding, talent, technology and so on.