天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當(dāng)前位置:主頁 > 科技論文 > 搜索引擎論文 >

基于Android的數(shù)碼產(chǎn)品垂直搜索引擎研究與實現(xiàn)

發(fā)布時間:2019-03-31 20:55
【摘要】:信息技術(shù)的迅速發(fā)展給我們的生活帶來了很多樂趣,然而,信息量的增多給我們查詢所需要的信息帶來了很大的難度,直到搜索引擎的出現(xiàn)才緩解了這一局勢。通用搜索引擎將搜索結(jié)果不加區(qū)分的返回給用戶,用戶還需要從五花八門的結(jié)果中再進(jìn)行大量的人工篩選才能找到自己想要的結(jié)果。作為搜索引擎的高級形式,垂直搜索引擎帶來了明顯的查詢優(yōu)勢。它查詢精準(zhǔn),分類明確,使用戶可以方便、準(zhǔn)確的找到自己所需,增加了用戶黏性,推廣起來也不成問題。 進(jìn)入新世紀(jì)以來,智能移動終端快速普及,3G、WLAN等無線通信技術(shù)也得到了較好的應(yīng)用,這都標(biāo)志著移動互聯(lián)網(wǎng)離我們的生活越來越近,隨之而來的是蓬勃發(fā)展的移動應(yīng)用開發(fā)。當(dāng)前,Android系統(tǒng)占據(jù)了移動應(yīng)用開發(fā)系統(tǒng)較大比例的市場,受到了廣大移動應(yīng)用開發(fā)者的青睞。手機(jī)客戶端搜索引擎能夠起到實時搜索、降低購物成本等作用,人們希望能隨時隨地從因特網(wǎng)獲得更豐富的信息,這時一個移動終端的智能搜索系統(tǒng)就可以滿足用戶當(dāng)前的需要。 本文以筆記本電腦和手機(jī)產(chǎn)品的資源庫為背景,通過研究垂直搜索引擎的特點,設(shè)計并實現(xiàn)了一個數(shù)碼產(chǎn)品垂直搜索引擎系統(tǒng):其中包括對開源網(wǎng)絡(luò)爬蟲Heritrix的擴(kuò)展與改進(jìn),對網(wǎng)頁抓取過程中存在的問題進(jìn)行了優(yōu)化和處理;結(jié)合HTMLParser技術(shù),將爬蟲定制抓取下來的網(wǎng)頁解析成結(jié)構(gòu)化的文本并進(jìn)行存儲;結(jié)合Lucene技術(shù),對結(jié)構(gòu)化的文本建立了索引,同時還構(gòu)建了專業(yè)詞庫,實現(xiàn)了檢索模塊;采用JAVA EE三層架構(gòu),利用Spring和DWR技術(shù),開發(fā)了用戶接口。在此搜索系統(tǒng)的基礎(chǔ)上,通過制定和實現(xiàn)Android客戶端與服務(wù)器端之間的通信接口,將Android客戶端的數(shù)碼產(chǎn)品搜索納入到本文的研究內(nèi)容中,更好地滿足用戶的需要。最后通過對系統(tǒng)測試,移動數(shù)碼產(chǎn)品垂直搜索引擎的方案是切實可行的,提高了查詢的效率和準(zhǔn)確度。 本文及系統(tǒng)的創(chuàng)新點有:構(gòu)建本系統(tǒng)的時候,采用了相關(guān)策略和算法實現(xiàn)了主題網(wǎng)絡(luò)爬蟲模塊,使獲取到的信息更加精確、更符合用戶預(yù)期;在Android系統(tǒng)上實現(xiàn)了門戶網(wǎng)站的垂直搜索功能。
[Abstract]:The rapid development of information technology has brought a lot of fun to our lives. However, the increase of information brings us a lot of difficulty in searching for the information we need, and it is not until the emergence of search engine that this situation is alleviated. The general search engine returns the search results indiscriminately to the user, and users need to do a lot of manual filtering from a wide variety of results in order to find the results they want. As the advanced form of search engine, vertical search engine brings obvious query advantage. It is accurate query, clear classification, so that users can easily and accurately find their own needs, increase the viscosity of users, it is not a problem to popularize. Since the beginning of the new century, intelligent mobile terminals have been rapidly popularized, 3G, WLAN and other wireless communication technologies have also been better used, which indicates that the mobile Internet is getting closer and nearer to our lives. What follows is the vigorous development of mobile applications. At present, Android system occupies a large proportion of the mobile application development system market, and has been favored by the majority of mobile application developers. Mobile client search engine can play a real-time search, reduce the cost of shopping, and so on, people want to get more information from the Internet anytime, anywhere. At this time, a mobile terminal intelligent search system can meet the current needs of users. Based on the resource base of notebook and mobile phone products, this paper designs and implements a vertical search engine system for digital products by studying the characteristics of vertical search engine. This system includes the extension and improvement of open source crawler Heritrix. The problems existing in the process of web page crawling are optimized and dealt with. Combined with HTMLParser technology, the crawler customized web page is parsed into structured text and stored, and combined with Lucene technology, the index of structured text is established, at the same time, the specialized thesaurus is constructed, and the retrieval module is realized. The user interface is developed by using JAVA EE three-tier architecture, Spring and DWR technology. On the basis of this search system, through the establishment and implementation of the communication interface between the Android client and the server, the digital product search of the Android client is included in the research content of this paper, so as to better meet the needs of the users. Finally, through the system test, the scheme of vertical search engine for mobile digital products is feasible, and the efficiency and accuracy of query are improved. The innovations of this paper and the system are as follows: when the system is built, the related strategies and algorithms are used to realize the topic network crawler module, which makes the obtained information more accurate and more in line with the user's expectation; The vertical search function of portal is realized on Android system.
【學(xué)位授予單位】:昆明理工大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2013
【分類號】:TP391.3

【參考文獻(xiàn)】

相關(guān)期刊論文 前10條

1 張麗敏;;垂直搜索引擎的主題爬蟲策略[J];電腦知識與技術(shù);2010年15期

2 吳濤;;PAGERANK算法下的網(wǎng)站鏈接優(yōu)化策略研究[J];電子商務(wù);2009年07期

3 張義忠,趙明生,朱精南;基于內(nèi)容的網(wǎng)頁特征提取[J];計算機(jī)工程與應(yīng)用;2001年10期

4 吉根林,孫志揮;Web挖掘技術(shù)研究[J];計算機(jī)工程;2002年10期

5 王琦;張戈;何婧;;基于Lucene與Heritrix的圖書垂直搜索引擎的研究與實現(xiàn)[J];計算機(jī)時代;2010年02期

6 陳再良;凌力;周強(qiáng);;dPageRank——一種改進(jìn)的分布式PageRank算法[J];計算機(jī)應(yīng)用;2006年01期

7 白坤;耿國華;;基于Lucene/Heritrix的垂直搜索引擎的研究與應(yīng)用[J];計算機(jī)應(yīng)用與軟件;2009年01期

8 劉運強(qiáng);;垂直搜索引擎的研究與設(shè)計[J];計算機(jī)應(yīng)用與軟件;2010年07期

9 邱戰(zhàn)宏;顧國慶;陳江洪;;搜索引擎的現(xiàn)狀及發(fā)展趨勢探析[J];科技廣場;2009年09期

10 王繼明;楊國林;;基于Lucene的中文文本分詞[J];內(nèi)蒙古工業(yè)大學(xué)學(xué)報(自然科學(xué)版);2007年03期

,

本文編號:2451268

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/2451268.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶45846***提供,本站僅收錄摘要或目錄,作者需要刪除請E-mail郵箱bigeng88@qq.com