基于Android的數(shù)碼產(chǎn)品垂直搜索引擎研究與實現(xiàn)
[Abstract]:The rapid development of information technology has brought a lot of fun to our lives. However, the increase of information brings us a lot of difficulty in searching for the information we need, and it is not until the emergence of search engine that this situation is alleviated. The general search engine returns the search results indiscriminately to the user, and users need to do a lot of manual filtering from a wide variety of results in order to find the results they want. As the advanced form of search engine, vertical search engine brings obvious query advantage. It is accurate query, clear classification, so that users can easily and accurately find their own needs, increase the viscosity of users, it is not a problem to popularize. Since the beginning of the new century, intelligent mobile terminals have been rapidly popularized, 3G, WLAN and other wireless communication technologies have also been better used, which indicates that the mobile Internet is getting closer and nearer to our lives. What follows is the vigorous development of mobile applications. At present, Android system occupies a large proportion of the mobile application development system market, and has been favored by the majority of mobile application developers. Mobile client search engine can play a real-time search, reduce the cost of shopping, and so on, people want to get more information from the Internet anytime, anywhere. At this time, a mobile terminal intelligent search system can meet the current needs of users. Based on the resource base of notebook and mobile phone products, this paper designs and implements a vertical search engine system for digital products by studying the characteristics of vertical search engine. This system includes the extension and improvement of open source crawler Heritrix. The problems existing in the process of web page crawling are optimized and dealt with. Combined with HTMLParser technology, the crawler customized web page is parsed into structured text and stored, and combined with Lucene technology, the index of structured text is established, at the same time, the specialized thesaurus is constructed, and the retrieval module is realized. The user interface is developed by using JAVA EE three-tier architecture, Spring and DWR technology. On the basis of this search system, through the establishment and implementation of the communication interface between the Android client and the server, the digital product search of the Android client is included in the research content of this paper, so as to better meet the needs of the users. Finally, through the system test, the scheme of vertical search engine for mobile digital products is feasible, and the efficiency and accuracy of query are improved. The innovations of this paper and the system are as follows: when the system is built, the related strategies and algorithms are used to realize the topic network crawler module, which makes the obtained information more accurate and more in line with the user's expectation; The vertical search function of portal is realized on Android system.
【學(xué)位授予單位】:昆明理工大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2013
【分類號】:TP391.3
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 張麗敏;;垂直搜索引擎的主題爬蟲策略[J];電腦知識與技術(shù);2010年15期
2 吳濤;;PAGERANK算法下的網(wǎng)站鏈接優(yōu)化策略研究[J];電子商務(wù);2009年07期
3 張義忠,趙明生,朱精南;基于內(nèi)容的網(wǎng)頁特征提取[J];計算機(jī)工程與應(yīng)用;2001年10期
4 吉根林,孫志揮;Web挖掘技術(shù)研究[J];計算機(jī)工程;2002年10期
5 王琦;張戈;何婧;;基于Lucene與Heritrix的圖書垂直搜索引擎的研究與實現(xiàn)[J];計算機(jī)時代;2010年02期
6 陳再良;凌力;周強(qiáng);;dPageRank——一種改進(jìn)的分布式PageRank算法[J];計算機(jī)應(yīng)用;2006年01期
7 白坤;耿國華;;基于Lucene/Heritrix的垂直搜索引擎的研究與應(yīng)用[J];計算機(jī)應(yīng)用與軟件;2009年01期
8 劉運強(qiáng);;垂直搜索引擎的研究與設(shè)計[J];計算機(jī)應(yīng)用與軟件;2010年07期
9 邱戰(zhàn)宏;顧國慶;陳江洪;;搜索引擎的現(xiàn)狀及發(fā)展趨勢探析[J];科技廣場;2009年09期
10 王繼明;楊國林;;基于Lucene的中文文本分詞[J];內(nèi)蒙古工業(yè)大學(xué)學(xué)報(自然科學(xué)版);2007年03期
,本文編號:2451268
本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/2451268.html