一種生產(chǎn)銷售系統(tǒng)的生產(chǎn)及庫存控制優(yōu)化研究
[Abstract]:The upstream and downstream enterprises of supply chain can be regarded as the relationship between buyer and seller, and the upstream enterprises do not consider the demand of downstream enterprises, which leads to the imbalance of supply and demand between enterprises. Therefore, it is of great significance to study the production and sales problems between upstream and downstream enterprises in order to realize the highest profit of each enterprise in the long run. Firstly, a production and sales system based on demand driven by single retailer is established. The system consists of an upstream enterprise (manufacturer) and a downstream enterprise (retailer). The two enterprises have independent inventory units and decision-making process. The manufacturer can be regarded as a demand-driven delivery and delivery station system, and the inventory status of the finished product store is affected by the retailer's replenishment order and its own production capacity. The manufacturer's control variable is the forward distance of the processing station. The optimization goal is to find the optimal forward view control strategy. Considering the arrival time of the workpiece in the system, the processing time of the workpiece is difficult to be accurately determined. In this paper, a model independent reinforcement learning algorithm is used to solve the optimal or suboptimal forward view control strategy. The retailer adopts dynamic replenishment strategy and the inventory status is affected by the random customer demand and the actual replenishment volume of the manufacturer. The retailer's control variable is restocking point, and the optimization goal is to find the optimal inventory control strategy. Because of the dynamic randomness of customer demand, the reinforcement learning algorithm is used to solve the optimal control problem of retailer. Considering that manufacturers usually replenish more than one retailer in practice, this paper further studies the optimal control of a multi-retailer production and sales system. There is a static game relationship among many retailers in the system, that is, the replenishment demand of each retailer has an order in time, and each retailer does not know the replenishment strategy of the other retailers. The state of the finished goods warehouse changes immediately after the manufacturer completes the distribution according to the time order of the replenishment order. Finally, the reinforcement learning algorithm is used to solve the optimal or sub-optimal control strategy of manufacturers and multiple retailers, and the game relationship between multiple retailers is analyzed from three aspects: customer arrival rate, warehouse capacity and replenishment period.
【學(xué)位授予單位】:合肥工業(yè)大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2017
【分類號(hào)】:F274
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 王道平;張大川;楊岑;;基于加權(quán)網(wǎng)絡(luò)的敏捷供應(yīng)鏈知識(shí)服務(wù)網(wǎng)絡(luò)演化[J];系統(tǒng)管理學(xué)報(bào);2017年01期
2 陳群;韓景倜;;隨機(jī)訂貨成本擾動(dòng)下零售商魯棒與機(jī)會(huì)約束訂貨策略[J];統(tǒng)計(jì)與決策;2017年01期
3 徐廣業(yè);張旭梅;;風(fēng)險(xiǎn)規(guī)避下基于價(jià)格折扣的雙渠道供應(yīng)鏈協(xié)調(diào)[J];系統(tǒng)管理學(xué)報(bào);2016年06期
4 武玉英;李俊濤;蔣國瑞;;隨機(jī)需求下供應(yīng)鏈企業(yè)訂購多Agent協(xié)商模型[J];計(jì)算機(jī)應(yīng)用與軟件;2016年03期
5 葉一軍;顧新;李暉;陳一君;;基于大規(guī)模訂單生產(chǎn)模式的變革管理研究[J];軟科學(xué);2016年01期
6 王翔;張群;;多供應(yīng)商—單制造商—多經(jīng)銷商的供應(yīng)鏈協(xié)同度研究[J];北京工商大學(xué)學(xué)報(bào)(社會(huì)科學(xué)版);2015年01期
7 苗國英;馬倩;;多智能體系統(tǒng)的協(xié)調(diào)控制研究綜述[J];南京信息工程大學(xué)學(xué)報(bào)(自然科學(xué)版);2013年05期
8 趙曉華;李振龍;陳陽舟;榮建;;Q學(xué)習(xí)中基于模糊規(guī)則的強(qiáng)化函數(shù)設(shè)計(jì)方法[J];模式識(shí)別與人工智能;2008年02期
9 郭銳;吳敏;彭軍;彭姣;曹衛(wèi)華;;一種新的多智能體Q學(xué)習(xí)算法[J];自動(dòng)化學(xué)報(bào);2007年04期
10 鄧亞偉;;基于需求驅(qū)動(dòng)供應(yīng)鏈庫存管理的探討[J];現(xiàn)代企業(yè);2007年04期
,本文編號(hào):2341581
本文鏈接:http://sikaile.net/guanlilunwen/gongyinglianguanli/2341581.html