大數(shù)據(jù)環(huán)境下圖書館公共媒體數(shù)據(jù)庫建設與利用研究
發(fā)布時間:2018-09-07 19:10
【摘要】:在云技術(shù)和物聯(lián)網(wǎng)的推動下,全球已經(jīng)進入了大數(shù)據(jù)時代。數(shù)據(jù)信息是圖書館的核心和提供一切服務的基礎,F(xiàn)有圖書館中的資源是靜態(tài)的、結(jié)構(gòu)化和少量半結(jié)構(gòu)化的學術(shù)科研文獻、基礎常識文獻、歷史小說文獻等正式出版物,而網(wǎng)絡上的非正式出版物,尤其是公共媒體平臺上的用戶行為信息、社交網(wǎng)絡灰色信息和政府非公開公布的公共管理信息嚴重缺損,致使圖書館的信息資源在這個大數(shù)據(jù)時代滯后和不完整。公共媒體信息不僅能完善圖書館的資源,而且能為圖書館提供新的知識服務,幫助圖書館在大數(shù)據(jù)時代掌握和整合更多的數(shù)據(jù)信息以增強其核心競爭力。采用傳感器、網(wǎng)絡爬蟲等大數(shù)據(jù)采集技術(shù)從公共媒體平臺上獲取用戶行為數(shù)據(jù)、社交網(wǎng)絡灰色文獻以及政府公共管理數(shù)據(jù),并基于HBase數(shù)據(jù)庫的存儲理論和主題分類、知識地圖等組織方法對采集到的三類數(shù)據(jù)進行整合,初步探討由用戶行為數(shù)據(jù)庫、社交網(wǎng)絡數(shù)據(jù)庫和公共管理數(shù)據(jù)庫三個子庫組成的公共媒體數(shù)據(jù)庫的規(guī)劃。 本文從國內(nèi)外大數(shù)據(jù)理論及圖書館數(shù)據(jù)庫建設的實踐兩方面進行分析研究并進行借鑒,來開始探討適合我國圖書館公共媒體數(shù)據(jù)庫的建設。本文包含如下六個部分: 第一章內(nèi)容是緒論,闡述選題背景、目的和意義。 第二章內(nèi)容是國內(nèi)外圖書館大數(shù)據(jù)研究現(xiàn)狀以及圖書館建立公共媒體數(shù)據(jù)庫必要性進行闡述。 第三章內(nèi)容是闡述公共媒體資源建設的理論基礎和大數(shù)據(jù)基礎理論。 第四章內(nèi)容是參照第二章中圖書館數(shù)據(jù)庫的建設,在第三章大數(shù)據(jù)環(huán)境下公共媒體資源建設的方式方法上分別論述用戶行為子數(shù)據(jù)庫、社交網(wǎng)絡文獻子數(shù)據(jù)庫和公共管理子數(shù)據(jù)庫三個字庫的建設。 第五章內(nèi)容是針對構(gòu)建的公共媒體數(shù)據(jù)庫的利用描述。 第六章對本文的研究內(nèi)容進行分析和總結(jié),并提出今后需要進行關(guān)注的問題。
[Abstract]:In cloud technology and the Internet of things, the world has entered the big data era. Data information is the core of library and the basis of providing all services. The resources in the existing libraries are static, structured and few semi-structured academic research literature, basic knowledge literature, historical novel literature and other official publications, while the network of informal publications, Especially, the information of user behavior on the public media platform, the grey information of social network and the information of public management released by the government are seriously defective, which makes the information resources of the library lag behind and incomplete in this era of big data. The public media information can not only perfect the library's resources, but also provide new knowledge service for the library, help the library to master and integrate more data information in the era of big data in order to enhance its core competitiveness. Big data acquisition technology such as sensor, web crawler and so on is used to obtain user behavior data from public media platform, social network grey literature and government public management data, and based on HBase database storage theory and subject classification. The three kinds of data collected are integrated by knowledge map, and the planning of public media database composed of user behavior database, social network database and public management database is discussed preliminarily. This paper analyzes and studies the theory of big data at home and abroad and the practice of library database construction and uses it for reference to begin to explore the construction of library public media database suitable for our country. This paper includes six parts as follows: the first chapter is an introduction, explaining the background, purpose and significance of the topic. The second chapter is about the research status of big data and the necessity of establishing public media database. The third chapter expounds the theoretical basis of the construction of public media resources and big data's basic theory. The fourth chapter refers to the construction of library database in Chapter 2, and discusses the user behavior sub-database on the way and method of the construction of public media resources under the environment of big data in the third chapter. The construction of social network document sub-database and public management sub-database. The fifth chapter is a description of the use of the public media database. The sixth chapter analyzes and summarizes the research contents of this paper, and points out the problems that need to be paid attention to in the future.
【學位授予單位】:遼寧師范大學
【學位級別】:碩士
【學位授予年份】:2014
【分類號】:G250.74
[Abstract]:In cloud technology and the Internet of things, the world has entered the big data era. Data information is the core of library and the basis of providing all services. The resources in the existing libraries are static, structured and few semi-structured academic research literature, basic knowledge literature, historical novel literature and other official publications, while the network of informal publications, Especially, the information of user behavior on the public media platform, the grey information of social network and the information of public management released by the government are seriously defective, which makes the information resources of the library lag behind and incomplete in this era of big data. The public media information can not only perfect the library's resources, but also provide new knowledge service for the library, help the library to master and integrate more data information in the era of big data in order to enhance its core competitiveness. Big data acquisition technology such as sensor, web crawler and so on is used to obtain user behavior data from public media platform, social network grey literature and government public management data, and based on HBase database storage theory and subject classification. The three kinds of data collected are integrated by knowledge map, and the planning of public media database composed of user behavior database, social network database and public management database is discussed preliminarily. This paper analyzes and studies the theory of big data at home and abroad and the practice of library database construction and uses it for reference to begin to explore the construction of library public media database suitable for our country. This paper includes six parts as follows: the first chapter is an introduction, explaining the background, purpose and significance of the topic. The second chapter is about the research status of big data and the necessity of establishing public media database. The third chapter expounds the theoretical basis of the construction of public media resources and big data's basic theory. The fourth chapter refers to the construction of library database in Chapter 2, and discusses the user behavior sub-database on the way and method of the construction of public media resources under the environment of big data in the third chapter. The construction of social network document sub-database and public management sub-database. The fifth chapter is a description of the use of the public media database. The sixth chapter analyzes and summarizes the research contents of this paper, and points out the problems that need to be paid attention to in the future.
【學位授予單位】:遼寧師范大學
【學位級別】:碩士
【學位授予年份】:2014
【分類號】:G250.74
【參考文獻】
相關(guān)期刊論文 前10條
1 楊海燕;;大數(shù)據(jù)時代的圖書館服務淺析[J];圖書與情報;2012年04期
2 王忠;;美國推動大數(shù)據(jù)技術(shù)發(fā)展的戰(zhàn)略價值及啟示[J];中國發(fā)展觀察;2012年06期
3 王天泥;;知識咨詢:大數(shù)據(jù)時代圖書館的知識服務增長點[J];圖書與情報;2013年02期
4 王珊;王會舉;覃雄派;周p,
本文編號:2229165
本文鏈接:http://sikaile.net/guanlilunwen/gonggongguanlilunwen/2229165.html
最近更新
教材專著