1
基于带惩罚的点概率距离策略优化算法在展示广告实时竞标中的研究
Research on policy optimization algorithm based on probability distance with penalized point in real-time bidding of display advertising
2022年第2期 : 461-467
doi:10.19734/j.issn.1001-3695.2021.07.0264