天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于SAP HANA數(shù)據(jù)庫(kù)的推薦方法研究

發(fā)布時(shí)間:2018-06-09 22:03

  本文選題:SAP + HANA ; 參考:《北京林業(yè)大學(xué)》2016年碩士論文


【摘要】:隨著電子商務(wù)在互聯(lián)網(wǎng)時(shí)代長(zhǎng)達(dá)二十年的發(fā)展,電子商務(wù)的學(xué)術(shù)研究也一直在進(jìn)步,針對(duì)消費(fèi)者行為的研究也越來(lái)越多?焖偬幚泶罅繑(shù)據(jù)和進(jìn)行實(shí)時(shí)分析的能力,將決定公司能否快速響應(yīng)市場(chǎng)變化,從而獲得優(yōu)勢(shì)。在這樣的背景下,提升分析速度顯得更為急迫,SAP HANA(SAP High-Performance Analytic Appliance)由此而生,它具有實(shí)時(shí)分析、存儲(chǔ)和處理大數(shù)據(jù)的能力,并充分發(fā)揮其商業(yè)數(shù)據(jù)的價(jià)值,幫助企業(yè)抓住機(jī)遇,進(jìn)行實(shí)時(shí)決策。本研究以HANA數(shù)據(jù)庫(kù)以及其上安裝的相應(yīng)組件為基礎(chǔ),利用大數(shù)據(jù)競(jìng)賽平臺(tái)kaggle網(wǎng)站中,日本領(lǐng)導(dǎo)團(tuán)購(gòu)網(wǎng)站Ponpare在該網(wǎng)站提供的一年交易信息,進(jìn)行預(yù)測(cè)分析研究。本論文進(jìn)行的研究工作主要如下:1.完成本文中系統(tǒng)整體架構(gòu)的設(shè)計(jì),保證在HANA中實(shí)現(xiàn)整體功能的順利運(yùn)行。主要包括數(shù)據(jù)抽取層,數(shù)據(jù)倉(cāng)庫(kù)層,數(shù)據(jù)處理和分析層。本文中數(shù)據(jù)最開始儲(chǔ)存在Oracle數(shù)據(jù)庫(kù)中作為數(shù)據(jù)源,E1M(企業(yè)信息管理)作為抽數(shù)工具將數(shù)據(jù)抽取到HANA中,PAL和基于HANA的R語(yǔ)言作為算法實(shí)現(xiàn)工具完成數(shù)據(jù)的預(yù)處理和分析。數(shù)據(jù)在幾個(gè)組件中可實(shí)現(xiàn)無(wú)障礙的流通,滿足系統(tǒng)的連貫性。2.利用HANA PAL(預(yù)測(cè)分析庫(kù))與AFM結(jié)合的工具來(lái)實(shí)現(xiàn)數(shù)據(jù)融合、缺失值填補(bǔ)以及數(shù)值歸一化的操作,從而得到可以用于研究的數(shù)據(jù)。在數(shù)據(jù)挖掘之前,針對(duì)客戶的瀏覽購(gòu)物信息和個(gè)人信息,以及優(yōu)惠券的原始信息進(jìn)行介紹分析,對(duì)網(wǎng)站提供的初始數(shù)據(jù)進(jìn)行數(shù)據(jù)預(yù)處理,以提高數(shù)據(jù)挖掘效率,降低挖掘所需要的時(shí)間。3.在HANA數(shù)據(jù)庫(kù)的環(huán)境中,采用基于HANA的R語(yǔ)言環(huán)境,完成推薦系統(tǒng)算法的實(shí)現(xiàn)。首先,利用cbind函數(shù)把向量和矩陣拼成一個(gè)新矩陣;其次,對(duì)屬性賦予不同的權(quán)重;最后,計(jì)算用戶屬性與優(yōu)惠券間的cosine相似度并進(jìn)行排序,得到客戶最有可能購(gòu)買的10個(gè)優(yōu)惠券ID。通過(guò)對(duì)比用戶實(shí)際購(gòu)買產(chǎn)品與推薦產(chǎn)品的類型及所在區(qū)域,得到推薦結(jié)果的正確率。本文將最近流行的數(shù)據(jù)挖掘與SAP近幾年新推出的數(shù)據(jù)庫(kù)HANA相結(jié)合。通過(guò)最新組件EIM、PAL完成數(shù)據(jù)的遷移、數(shù)據(jù)預(yù)處理以及數(shù)據(jù)預(yù)測(cè)分析。
[Abstract]:With the development of e-commerce in the Internet age for twenty years, the academic research of electronic commerce has also been progressing, and more and more research on consumer behavior. The ability to quickly deal with large amounts of data and carry out real-time analysis will determine whether the company can respond quickly to market changes and gain advantages. In this context, The speed of the rise analysis is more urgent, and SAP HANA (SAP High-Performance Analytic Appliance) is born. It has the ability to analyze, store and process large data in real time, and give full play to the value of its commercial data, help the enterprise to seize the opportunity to make real time decision. This research is based on the HANA database and the corresponding components installed on it. Based on the kaggle website of the big data competition platform, the Japanese leader group purchase website Ponpare provides the one year transaction information provided by the website for the prediction analysis. The main research work of this paper is as follows: 1. complete the design of the overall system architecture in this paper, and ensure the smooth operation of the whole function in the HANA. Data extraction layer, data warehouse layer, data processing and analysis layer. In this paper, data is first stored in Oracle database as data source. E1M (Enterprise Information Management) is used as a pumping tool to extract data into HANA. PAL and HANA based R language are used as algorithm implementation tools to complete data preprocessing and analysis. Data are in several components. .2. can achieve data fusion, missing value filling and numerical normalization, which can be used to achieve data fusion, missing value filling and numerical normalization, so as to get data that can be used for research. Before data mining, the customers' browsing and shopping information and personal information, as well as preferential treatment for customers, and preferential treatment. The original information of the voucher is introduced and analyzed. The initial data provided by the website is preprocessed to improve the efficiency of the data mining and reduce the time needed by the mining. In the environment of the HANA database, the HANA based R language environment is used to complete the implementation of the recommendation system algorithm. First, the vector and matrix of the cbind function are spelled together with the cbind function. A new matrix is given; secondly, the attributes are given different weights; finally, the cosine similarity between the user attributes and coupons is calculated and the 10 coupon ID. is most likely to be purchased by the customer to get the correct rate of the recommended results by comparing the types and areas where the user actually buys the product and the recommended product. Popular data mining is combined with the new SAP database HANA in recent years. Through the latest component EIM, PAL has completed the migration of data, data preprocessing and data prediction analysis.
【學(xué)位授予單位】:北京林業(yè)大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2016
【分類號(hào)】:TP311.13;TP391.3

【相似文獻(xiàn)】

相關(guān)期刊論文 前1條

1 ;SUSE助力SAP HANA實(shí)現(xiàn)高可用性[J];辦公自動(dòng)化;2014年13期

相關(guān)碩士學(xué)位論文 前1條

1 黃佳琪;基于SAP HANA數(shù)據(jù)庫(kù)的推薦方法研究[D];北京林業(yè)大學(xué);2016年

,

本文編號(hào):2000978

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/jingjilunwen/dianzishangwulunwen/2000978.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶31d5b***提供,本站僅收錄摘要或目錄,作者需要?jiǎng)h除請(qǐng)E-mail郵箱bigeng88@qq.com
国产亚洲午夜高清国产拍精品| 国产又长又粗又爽免费视频| 91在线国内在线中文字幕| 亚洲精品一区三区三区| 亚洲精品高清国产一线久久| 国产国产精品精品在线| 国产精品欧美激情在线| 亚洲精品国产精品日韩| 欧美大粗爽一区二区三区| 视频在线免费观看你懂的| 麻豆视传媒短视频免费观看 | 五月天婷亚洲天婷综合网| 国产av一区二区三区麻豆| 日本道播放一区二区三区| 亚洲精品中文字幕熟女| 年轻女房东2中文字幕| 成人精品视频在线观看不卡| 亚洲精品国产精品日韩| 国产亚州欧美一区二区| 欧美特色特黄一级大黄片| 黄色污污在线免费观看| 欧美午夜一级艳片免费看| 中国美女草逼一级黄片视频| 国产福利一区二区三区四区| 欧美区一区二区在线观看| 日韩精品一区二区三区四区| 欧美极品欧美精品欧美| 日韩国产中文在线视频| 夫妻性生活黄色录像视频| 国产综合香蕉五月婷在线| 亚洲欧美日韩色图七区| 一区二区不卡免费观看免费| 欧美日韩黄片免费试看 | 久久机热频这里只精品| 久久亚洲国产视频三级黄| 十八禁日本一区二区三区| 中国少妇精品偷拍视频| 色哟哟国产精品免费视频| 99久久免费中文字幕| 大胆裸体写真一区二区| 国产精品欧美在线观看|