天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當前位置:主頁 > 科技論文 > 搜索引擎論文 >

林業(yè)動態(tài)信息快速搜索與集成

發(fā)布時間:2018-08-18 10:03
【摘要】:我國林業(yè)是國民經(jīng)濟的基礎(chǔ)產(chǎn)業(yè),擔負著生態(tài)環(huán)境建設(shè)和促進社會可持續(xù)發(fā)展的重大使命,人類是林業(yè)產(chǎn)業(yè)建設(shè)中的主要受益群體,當林業(yè)產(chǎn)業(yè)結(jié)構(gòu)形成時,人們就在其中發(fā)揮其各自不同的作用。近年來,林業(yè)信息化推進,促進了林業(yè)信息資源的共享,為公眾提供了便利,促進了林業(yè)產(chǎn)業(yè)的發(fā)展。但是,林業(yè)信息化的發(fā)展還有其必要性,如何更好地利用林業(yè)信息資源,為我國林業(yè)科學領(lǐng)域廣大科研人員、教學工作者以及林農(nóng)服務(wù),就迫切需要對林業(yè)信息實現(xiàn)快速搜索與集成。 如何從海量信息中快速查找到用戶所需要的信息,已經(jīng)成為公眾對特定領(lǐng)域信息的查找所面臨的主要問題;ヂ(lián)網(wǎng)上林業(yè)信息越來越龐雜且無序,普通的搜索引擎已經(jīng)不能滿足大眾對個性化信息的需求。針對用戶在進行林業(yè)主題信息查詢時,通用搜索引擎需要花費大量時間及精力去查找所需要的信息,且主題信息的召回率和精確率都比較低,不能滿足用戶的需求,因此公眾急需一個分類精確、數(shù)據(jù)全面、更新及時的林業(yè)主題搜索引擎。 本論文的研究內(nèi)容來自導(dǎo)師主持的湖南省科技計劃重點項目(2010nk2004)。本文以系統(tǒng)科學、林學、信息學和統(tǒng)計學等理論為指導(dǎo),對林業(yè)動態(tài)信息搜索與集成進行了全面的研究。研究過程中,對國內(nèi)外林業(yè)動態(tài)信息搜索與集成等方面的研究進行了綜述。主要從林業(yè)動態(tài)信息的需求分析與分類、主題爬蟲搜索器以及文本識別分類器等三個方面進行了研究,主要研究工作如下: (1)綜合分析了國內(nèi)外對于搜索引擎的既有理論和實踐成果,表明了目前建立一個林業(yè)主題搜索引擎的重要性和必要性,并對其中的關(guān)鍵技術(shù)進行了深入研究。本研究將林業(yè)主題搜索引擎分為數(shù)據(jù)收集層、數(shù)據(jù)存儲層以及數(shù)據(jù)表示層三個層次,并對這三個層次中涉及的相關(guān)方法進行了探討和總結(jié)。 (2)利用網(wǎng)頁上公布的信息,結(jié)合各部門及公眾對林業(yè)動態(tài)信息的需求,明確對各部門及公眾真正有實際意義的林業(yè)動態(tài)信息類別,并對所需林業(yè)動態(tài)信息進行分類、分塊,使各種林業(yè)動態(tài)信息具體化,主要分為以下七類:林業(yè)科技信息、林業(yè)生產(chǎn)資料、林產(chǎn)品市場供求信息、花卉信息、林業(yè)政策法規(guī)、林業(yè)勞務(wù)信息、氣象與環(huán)境信息。 (3)根據(jù)已構(gòu)建的林業(yè)動態(tài)信息類別體系,搜集與之相關(guān)的林業(yè)專業(yè)網(wǎng)站,明確信息采集的網(wǎng)站來源。采集我們所需要的數(shù)據(jù)所提供的網(wǎng)站域名,并采集域名后的內(nèi)容,同時辨別所采集的網(wǎng)站類別,實現(xiàn)對林業(yè)動態(tài)信息源的搜集及分類。 (4)運用基于內(nèi)容分析與基于鏈接結(jié)構(gòu)分析相結(jié)合的一種新型搜索策略,通過綜合分析評價,對候選URL所指向的頁面進行主題相關(guān)度判斷以及對候選URL進行排序,實現(xiàn)最優(yōu)的林業(yè)主題爬蟲搜索器,從而使所下載的網(wǎng)頁按與林業(yè)主題相關(guān)且重要性突出遞減的順序排列。 (5)采用計算機智能的SVM自動文本分類技術(shù),對樣本數(shù)據(jù)進行機器訓練,實現(xiàn)對主題爬蟲搜索器所采集到的林業(yè)動態(tài)信息進行分類存儲,達到對林業(yè)主題搜索引擎的數(shù)據(jù)收集層的構(gòu)建進行優(yōu)化的目的。 林業(yè)動態(tài)信息搜索與集成是在對現(xiàn)有的搜索與集成技術(shù)進行研究和優(yōu)化的基礎(chǔ)上,融合了公眾對林業(yè)動態(tài)信息的需求,使得公眾在獲取林業(yè)動態(tài)信息時的準確率、全面率和成功率都得到了明顯提高。隨著科學技術(shù)的快速發(fā)展,新理論、新方法、新技術(shù)將進一步運用于林業(yè)動態(tài)信息快速搜索與集成,林業(yè)信息管理與服務(wù)也將邁上新臺階。
[Abstract]:Forestry in China is the basic industry of the national economy and undertakes the important mission of ecological environment construction and social sustainable development. Mankind is the main beneficiary group in the construction of forestry industry. When the forestry industrial structure forms, people play their different roles in it. In recent years, forestry informatization has promoted forestry credit. The sharing of information resources provides convenience for the public and promotes the development of forestry industry. However, the development of forestry informatization is still necessary. How to make better use of forestry information resources and provide services for scientific researchers, teaching workers and foresters in the field of Forestry Science in China is an urgent need to search and collect forestry information quickly. It is.
How to quickly find the information users need from the mass of information has become a major problem facing the public in the search of information in a specific field. Forestry information on the Internet is becoming more and more complex and disorderly, and ordinary search engines can no longer meet the needs of the public for personalized information. When searching, the general search engine needs to spend a lot of time and energy to find the information needed, and the recall rate and accuracy of the subject information are relatively low, which can not meet the needs of users. Therefore, the public urgently needs a forestry subject search engine with accurate classification, comprehensive data and timely update.
The research content of this paper comes from the key project of Hunan Science and Technology Program (2010 nk2004), which is presided over by the tutor. Guided by the theories of system science, forestry, informatics and statistics, this paper makes a comprehensive study on the search and integration of forestry dynamic information. In the course of the study, the research on the search and integration of forestry dynamic information at home and abroad is carried out. In this paper, the demand analysis and classification of forestry dynamic information, subject crawler searcher and text recognition classifier are summarized.
(1) The existing theories and practices of search engines at home and abroad are analyzed comprehensively, which indicates the importance and necessity of establishing a forestry subject search engine at present, and the key technologies are studied deeply. The forestry subject search engine is divided into three layers: data collection layer, data storage layer and data representation layer. At the same time, we discuss and summarize the relevant methods in these three levels.
(2) Using the information published on the web pages and combining with the demand of various departments and the public for forestry dynamic information, the types of forestry dynamic information which are really meaningful to the departments and the public are defined, and the required forestry dynamic information is classified and divided into seven groups, so as to concretize the various forestry dynamic information. Forestry means of production, market supply and demand information for forest products, flower information, forestry policies and regulations, Forestry labor information, meteorological and environmental information.
(3) According to the established forestry dynamic information classification system, collect the relevant forestry professional websites, identify the source of information collection websites, collect the domain name of the websites provided by the data we need, and collect the content after the domain name, at the same time identify the websites collected, so as to realize the collection and classification of forestry dynamic information sources.
(4) Using a new search strategy based on content analysis and link structure analysis, through comprehensive analysis and evaluation, the topic relevance of the pages pointed by the candidate URLs is judged and the candidate URLs are sorted to achieve the optimal forestry theme crawler searcher, so that the downloaded pages are related to forestry topics. And the importance is highlighted in decreasing order.
(5) Adopting SVM automatic text categorization technology of computer intelligence, the sample data is trained by machine, and the dynamic forestry information collected by the subject crawler searcher is classified and stored, so as to optimize the data collection layer of the forestry subject search engine.
Forestry dynamic information search and integration is based on the research and optimization of existing search and integration technology, which integrates the public demand for forestry dynamic information. The accuracy, comprehensiveness and success rate of public access to forestry dynamic information have been significantly improved. New methods and new technologies will be further applied to the rapid search and integration of forestry dynamic information, and forestry information management and service will also take a new step.
【學位授予單位】:中南林業(yè)科技大學
【學位級別】:碩士
【學位授予年份】:2013
【分類號】:S712

【參考文獻】

相關(guān)期刊論文 前10條

1 方鴻錦;孫旭東;劉燕德;;江西省農(nóng)業(yè)信息化與新農(nóng)村建設(shè)的研究[J];安徽農(nóng)業(yè)科學;2007年34期

2 張黎爍;李鑫;徐猛;;基于PageRank的網(wǎng)頁主題相關(guān)性算法研究[J];光盤技術(shù);2008年12期

3 王灝,黃厚寬,田盛豐;文本分類實現(xiàn)技術(shù)[J];廣西師范大學學報(自然科學版);2003年01期

4 劉林,汪濤,樊孝忠;主題爬蟲的解決方案[J];華南理工大學學報(自然科學版);2004年S1期

5 鄭麗桑;蘭樟仁;盧毅敏;;福建省林業(yè)信息服務(wù)平臺的研究[J];集美大學學報(自然科學版);2006年02期

6 錢功偉;倪林;曹榮;;基于網(wǎng)頁鏈接和內(nèi)容分析的改進PageRank算法[J];計算機工程與應(yīng)用;2007年21期

7 歐陽柳波,李學勇,李國徽,王鑫;專業(yè)搜索引擎搜索策略綜述[J];計算機工程;2004年13期

8 吳明禮,施水才;一種結(jié)合超鏈接分析的搜索引擎排序方法[J];計算機工程;2004年15期

9 李勇;韓亮;;主題搜索引擎中網(wǎng)絡(luò)爬蟲的搜索策略研究[J];計算機工程與科學;2008年03期

10 牛振國,符海芳,崔偉宏;面向多層用戶的農(nóng)業(yè)信息分類初步研究[J];計算機與農(nóng)業(yè).綜合版;2003年03期

相關(guān)碩士學位論文 前7條

1 陳杰;主題搜索引擎中網(wǎng)絡(luò)蜘蛛搜索策略研究[D];浙江大學;2006年

2 鄭火國;農(nóng)業(yè)信息服務(wù)平臺的構(gòu)建與實現(xiàn)[D];中國農(nóng)業(yè)科學院;2006年

3 劉瑋瑋;搜索引擎中主題爬蟲的研究與實現(xiàn)[D];南京理工大學;2006年

4 鄭健珍;定題爬蟲搜索策略研究[D];廈門大學;2007年

5 陳叢叢;主題爬蟲搜索策略研究[D];山東大學;2009年

6 王冬坡;基于Lucene的主題搜索引擎的研究與實現(xiàn)[D];河北科技大學;2010年

7 馮明麗;面向個性化主題搜索的用戶—查詢詞語義本體構(gòu)建[D];西華大學;2010年

,

本文編號:2189131

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/2189131.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶bff1e***提供,本站僅收錄摘要或目錄,作者需要刪除請E-mail郵箱bigeng88@qq.com
免费亚洲黄色在线观看| 深夜少妇一区二区三区| 国产一区二区精品高清免费| 中文字幕一区二区免费| 太香蕉久久国产精品视频| 欧美一区二区三区高潮菊竹| 中日韩美女黄色一级片| 老司机精品在线你懂的| 国产精品国产亚洲看不卡| 日韩视频在线观看成人| 国产精品久久三级精品| 老富婆找帅哥按摩抠逼视频| 日韩和欧美的一区二区三区| 人妻露脸一区二区三区| 日本东京热加勒比一区二区| 欧美日韩乱码一区二区三区| 中文字幕日韩欧美一区| 精品国自产拍天天青青草原| 国产成人精品视频一区二区三区| 成年男女午夜久久久精品| 日韩美成人免费在线视频| 欧美丝袜诱惑一区二区| 91播色在线免费播放| 国产精品一区欧美二区| 欧美国产精品区一区二区三区| 成人精品视频一区二区在线观看| 国产成人人人97超碰熟女| 日韩少妇人妻中文字幕| 九九热国产这里只有精品| 欧美视频在线观看一区| 日本欧美一区二区三区就 | 久久综合狠狠综合久久综合| 亚洲中文字幕视频一区二区| 日本欧美视频在线观看免费| 亚洲一区二区精品免费视频| 九九九热在线免费视频| 国产日产欧美精品大秀| 神马午夜福利一区二区| 国产日韩精品激情在线观看| a久久天堂国产毛片精品| 狠狠做深爱婷婷久久综合|