天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當(dāng)前位置:主頁(yè) > 科技論文 > 搜索引擎論文 >

數(shù)學(xué)搜索中索引模型研究

發(fā)布時(shí)間:2018-11-26 17:58
【摘要】:搜索引擎是從互聯(lián)網(wǎng)的海量數(shù)據(jù)中檢索有用信息的高效工具,然而隨著互聯(lián)網(wǎng)的迅猛發(fā)展,用戶群體的增長(zhǎng),數(shù)字信息化程度的不斷提高和新技術(shù)的飛速發(fā)展,人們對(duì)信息的需求越來(lái)越多樣化,搜索引擎面臨越來(lái)越多的挑戰(zhàn)。近幾年來(lái),數(shù)學(xué)公式的檢索己成為信息學(xué)科研究的熱點(diǎn)和難點(diǎn)問(wèn)題,它對(duì)學(xué)習(xí)和科研非常重要,而通用的文本搜索引擎在對(duì)數(shù)學(xué)內(nèi)容的檢索上有很大的局限性,使用戶無(wú)法得到滿意的搜索結(jié)果。 數(shù)學(xué)公式有著復(fù)雜的二維結(jié)構(gòu)以及蘊(yùn)含著豐富的語(yǔ)義,不同結(jié)構(gòu)的數(shù)學(xué)公式可能有著相同的數(shù)學(xué)含義,一個(gè)數(shù)學(xué)公式也可能有多種描述方法。此外子公式的查詢也是數(shù)學(xué)搜索中很有意義的一項(xiàng)研究?jī)?nèi)容,用戶輸入的查詢公式有可能就是某個(gè)數(shù)學(xué)表達(dá)式的子公式,在返回檢索結(jié)果時(shí),應(yīng)將包含該查詢公式的原公式也返回給用戶。目前,國(guó)內(nèi)外也有一些專(zhuān)門(mén)從事數(shù)學(xué)搜索研究的機(jī)構(gòu),但他們大多數(shù)都是針對(duì)完全相同的數(shù)學(xué)公式進(jìn)行檢索,未涉及數(shù)學(xué)公式的語(yǔ)義,對(duì)于子公式的檢索也未進(jìn)行深入的探討和研究。因此,本文在深入分析對(duì)比了現(xiàn)存的一些數(shù)學(xué)搜索引擎索引模型的構(gòu)建方法和技術(shù)基礎(chǔ)上,將計(jì)算機(jī)代數(shù)系統(tǒng)(CAS)與數(shù)學(xué)搜索相結(jié)合,提出了一種基于語(yǔ)義的索引模型構(gòu)建方法。系統(tǒng)采用抽象樹(shù)倒排索引模型,在建立索引前對(duì)數(shù)學(xué)公式進(jìn)行預(yù)處理,利用CAS對(duì)數(shù)學(xué)公式規(guī)范化,并借鑒文本搜索引擎的N-gram方法,對(duì)數(shù)學(xué)公式進(jìn)行子公式的劃分,將它們也插入到索引項(xiàng)中,以此實(shí)現(xiàn)等價(jià)和相關(guān)數(shù)學(xué)公式的有效存儲(chǔ)與管理,大大提升了數(shù)學(xué)搜索的語(yǔ)義檢索能力。
[Abstract]:Search engine is an efficient tool for retrieving useful information from the mass data of the Internet. However, with the rapid development of the Internet, the growth of user groups, the continuous improvement of digital information level and the rapid development of new technologies, the search engine is an efficient tool for retrieving useful information from the mass data of the Internet. People's demand for information is more and more diverse, search engine is facing more and more challenges. In recent years, the retrieval of mathematical formulas has become a hot and difficult problem in the field of information science. It is very important for learning and scientific research, while the general text search engine has great limitations on the retrieval of mathematical content. Prevents the user from obtaining satisfactory search results. Mathematical formulas have complex two-dimensional structure and rich semantics. The mathematical formulas of different structures may have the same mathematical meaning, and a mathematical formula may also have a variety of description methods. In addition, the query of subformula is also a meaningful research content in mathematical search. The query formula input by the user may be a subformula of a mathematical expression, and when the retrieval result is returned, The original formula that contains the query formula should also be returned to the user. At present, there are also some institutions specialized in the research of mathematical search, but most of them search for the exact same mathematical formula, not involving the semantics of the mathematical formula. The search for subformulas has not been deeply discussed and studied. Therefore, on the basis of in-depth analysis and comparison of some existing mathematical search engine index models, this paper combines computer algebra system (CAS) with mathematical search. A semantic-based index model construction method is proposed. The system adopts the Abstract Tree inverted Index Model, preprocesses the mathematical formula before establishing the index, normalizes the mathematical formula by using CAS, and uses the N-gram method of the text search engine to divide the mathematical formula into sub-formulas. They are also inserted into the index items to realize the effective storage and management of equivalent and related mathematical formulas, which greatly improves the semantic retrieval ability of mathematical search.
【學(xué)位授予單位】:蘭州大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2013
【分類(lèi)號(hào)】:TP391.3

【參考文獻(xiàn)】

相關(guān)期刊論文 前1條

1 聶俊;陳天瑩;符紅光;;基于Latex的互聯(lián)網(wǎng)數(shù)學(xué)公式搜索引擎[J];計(jì)算機(jī)應(yīng)用;2010年S2期

,

本文編號(hào):2359236

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/2359236.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶f2ed3***提供,本站僅收錄摘要或目錄,作者需要?jiǎng)h除請(qǐng)E-mail郵箱bigeng88@qq.com