天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當(dāng)前位置:主頁 > 管理論文 > 財稅論文 >

基于有向圖的虛開增值稅發(fā)票行為檢測方法研究

發(fā)布時間:2018-06-27 07:58

  本文選題:虛開增值稅發(fā)票行為 + 有向圖; 參考:《西安科技大學(xué)》2017年碩士論文


【摘要】:在偵破虛開增值稅發(fā)票案件的實踐中,從資金交易流中找到資金環(huán)路是檢測虛開增值稅發(fā)票行為的關(guān)鍵環(huán)節(jié)。資金交易流可以抽象成由頂點和邊組成的有向圖,因此研究基于有向圖的虛開增值稅發(fā)票行為檢測方法是非常必要的。本文首先研究了虛開增值稅發(fā)票行為中的增值稅發(fā)票環(huán)路虛開行為表象與檢測方法,將從資金交易流中搜索資金環(huán)路問題抽象成對具有多重邊的有向圖環(huán)路搜索問題,其次研究了有向圖的存儲以及圖的環(huán)路搜索常用算法,并對算法的適用性進行比較,發(fā)現(xiàn)常用的環(huán)路搜索算法均不能很好地解決具有多重邊的有向圖環(huán)路搜索問題,因此,提出了基于深度優(yōu)先的有向圖環(huán)路搜索算法。先對原始數(shù)據(jù)預(yù)處理,將資金交易流和賬戶信息使用賬號編號關(guān)聯(lián)起來,找出每個頂點的邊集合并按交易時間升序排列,避免在運算過程中多次遍歷所有邊,通過約束邊的權(quán)值來避免遍歷無效邊,搜索過程采用逐層退回的方式來保證能遍歷到有效的多重邊且能找到有向圖中的所有環(huán)路,并在單機上驗證了算法的正確性。但是當(dāng)有向圖的邊數(shù)增大到100萬條以上時,算法執(zhí)行會發(fā)生內(nèi)存溢出,從而無法得到結(jié)果。針對該問題又提出了一種基于Spark的有向圖環(huán)路搜索算法,先將原始數(shù)據(jù)抽象成Spark中GraphX圖數(shù)據(jù)結(jié)構(gòu),然后利用數(shù)據(jù)緩存、尾遞歸等方法并使用GraphX框架及Scala語言在本地模式下驗證了算法的正確性,但是在相同的數(shù)據(jù)集下該算法時間效率明顯低于最初提出的算法。為了提高該算法的執(zhí)行效率,通過優(yōu)化循環(huán)、建立索引等方法對其進行優(yōu)化,在本地模式下驗證后,使用邊數(shù)為10000條以上的圖數(shù)據(jù)集在Spark集群下實驗,結(jié)果表明,優(yōu)化后的算法不僅能夠解決具有多重邊的大規(guī)模有向圖環(huán)路搜索問題,具有良好的擴展性,而且執(zhí)行效率得到明顯提高。最后,本文設(shè)計并實現(xiàn)了界面友好且可操作性強的虛開增值稅發(fā)票行為檢測系統(tǒng)。經(jīng)測試,該系統(tǒng)能夠從資金交易流中搜索出資金環(huán)路并能夠查詢出環(huán)路關(guān)聯(lián)賬戶的詳細(xì)信息,為稽查部門檢測虛開增值稅發(fā)票行為提供了重要的依據(jù),節(jié)約了稽查成本。
[Abstract]:In the practice of solving the case of false VAT invoice, it is the key link to detect the false VAT invoice behavior to find the capital loop from the fund transaction flow. The flow of funds can be abstracted into a directed graph composed of vertex and edge, so it is necessary to study the method of detecting the behavior of false VAT invoice based on digraph. In this paper, we first study the behavior representation and detection method of Value-added tax (VAT) invoice Loop in the behavior of false VAT invoicing, and abstract the problem of searching the Loop of Capital from the flow of funds into a pair of directed Graph Loop search problem with multiple edges. Secondly, the storage of directed graph and the loop search algorithm of graph are studied, and the applicability of the algorithm is compared. It is found that the commonly used loop search algorithm can not solve the loop search problem of directed graph with multiple edges. An algorithm for searching the loop of directed graph based on depth first is proposed. First of all, preprocess the raw data, associate the fund flow and account information with account number, find out the edge set of each vertex and arrange it in ascending order according to the transaction time, avoid traversing all sides many times in the course of operation. The weights of the constrained edges are used to avoid traversing the invalid edges. The search process adopts the method of layer by layer regression to ensure that all loops in the directed graph can be traversed effectively and all the loops in the directed graph can be found, and the correctness of the algorithm is verified on a single machine. However, when the number of edges of directed graph increases to more than 1 million, memory overflow will occur in the execution of the algorithm, so the result can not be obtained. In order to solve this problem, a new algorithm of directed graph loop search based on Spark is proposed. The original data is abstracted into GraphX graph data structure in Spark, and then the data cache is used. The algorithm is proved to be correct in local mode by using GraphX framework and Scala language, but the time efficiency of the algorithm is obviously lower than that of the original algorithm under the same data set. In order to improve the efficiency of the algorithm, the algorithm is optimized by optimizing the loop and building the index. After being verified in the local mode, the experiment is carried out in Spark cluster by using the graph data set with more than 10000 edges. The optimized algorithm can not only solve the large scale directed graph loop search problem with multiple edges, but also improve the efficiency of execution. Finally, this paper designs and implements a friendly and operable VAT invoice detection system. The test shows that the system can search out the capital loop from the fund transaction flow and inquire the detailed information of the related account of the loop, which provides an important basis for the audit department to detect the false issuance of VAT invoices and saves the cost of audit.
【學(xué)位授予單位】:西安科技大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2017
【分類號】:F812.42;O157.5

【參考文獻】

相關(guān)期刊論文 前10條

1 崔淑芬;萬建國;;治理“虛開增值稅發(fā)票罪”應(yīng)與時俱進[J];新理財;2016年12期

2 胡巍;;增值稅專用發(fā)票“環(huán)開”的危害與防范[J];稅務(wù)研究;2016年09期

3 侯斌;涂紅兵;王云福;;基于CXF框架的可配置化企業(yè)內(nèi)容管理平臺Web服務(wù)[J];電信科學(xué);2016年05期

4 秦昆;朱文武;高艷龍;張俊青;宋雯;彭揚;李方舟;;最小獨立閉合環(huán)深度優(yōu)先算法的一點改進[J];測繪科學(xué)技術(shù)學(xué)報;2015年06期

5 王詔遠(yuǎn);王宏杰;邢煥來;李天瑞;;基于Spark的蟻群優(yōu)化算法[J];計算機應(yīng)用;2015年10期

6 馬洪磊;劉成龍;余樂義;孟凡超;;一種高效的最小獨立閉合環(huán)自動搜索算法[J];測繪工程;2014年08期

7 歐龍;;一種新的閉合環(huán)自動搜索算法[J];柳州師專學(xué)報;2014年01期

8 陳濤;秦天亦;陳曉森;王華;;最小獨立閉合環(huán)搜索算法在探測InSAR相位解纏誤差中的應(yīng)用[J];大地測量與地球動力學(xué);2013年04期

9 史青;王子平;李朝柱;栗紅宇;;生成樹算法在最小獨立閉合環(huán)搜索中的應(yīng)用[J];測繪地理信息;2013年01期

10 李靖;朱麗強;;改進的最小獨立閉合環(huán)剝蝕搜索算法研究[J];城市勘測;2012年04期

相關(guān)碩士學(xué)位論文 前8條

1 邴勝男;黑龍江國稅增值稅發(fā)票管控系統(tǒng)的設(shè)計與實現(xiàn)[D];哈爾濱工業(yè)大學(xué);2015年

2 李文棟;基于Spark的大數(shù)據(jù)挖掘技術(shù)的研究與實現(xiàn)[D];山東大學(xué);2015年

3 梁彥;基于分布式平臺Spark和YARN的數(shù)據(jù)挖掘算法的并行化研究[D];中山大學(xué);2014年

4 郭啟文;基于數(shù)據(jù)倉庫的增值稅監(jiān)控分析系統(tǒng)設(shè)計[D];山東大學(xué);2014年

5 陳美先;圖算法的并行化[D];上海交通大學(xué);2013年

6 鄭曼;基于SAS系統(tǒng)的稅務(wù)數(shù)據(jù)挖掘研究[D];鄭州大學(xué);2012年

7 張敏;云計算環(huán)境下的并行數(shù)據(jù)挖掘策略研究[D];南京郵電大學(xué);2011年

8 孫良清;跨國企業(yè)增值稅發(fā)票處理系統(tǒng)的設(shè)計與實現(xiàn)[D];北京郵電大學(xué);2008年



本文編號:2073231

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/guanlilunwen/shuishoucaizhenglunwen/2073231.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶e0d8e***提供,本站僅收錄摘要或目錄,作者需要刪除請E-mail郵箱bigeng88@qq.com