Abstract
Real-Time Bidding (RTB) is one of the most important forms of online advertising, where an auction is hosted in real time to sell the individual ad impression. How to design an automated bidding strategy in response to the dynamic auction environment is crucial for improving user experience, protecting the interests of advertisers, and promoting the long-term development of the advertising platform. As an exciting topic in the real-world industry, it has attracted great research interest from several disciplines, most notably data science. There have been abundant studies on bidding strategy design which are based on the large volume of historical ad requests. Despite its popularity and significance, few works provide a summary for bid optimization. In this survey, we present the latest overview of the recent works to shed light on the optimization techniques where most of them are validated in practice. We first explore the optimization problem in different works, explaining how these different settings affect the bidding strategy designs. Then, some forms of bidding functions and specific optimization techniques are illustrated. Further, we specifically discuss a new trend about bidding in first-price auctions, which have gradually become popular in recent years. From this survey, both practitioners and researchers can gain insights of the challenges and future prospects of bid optimization in RTB.
- [1] . 2014. Budget pacing for targeted online advertisements at Linkedin. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1613–1619.Google ScholarDigital Library
- [2] . 1999. Constrained Markov Decision Processes: Stochastic Modeling. Routledge.Google Scholar
- [3] . 2018. Deep neural net with attention for multi-channel multi-touch attribution. arXiv preprint arXiv:1809.02230 (2018).Google Scholar
- [4] . 2006. The lovely but lonely Vickrey auction. Combinatorial Auctions (2006).Google Scholar
- [5] . 2020. GMCM: Graph-based micro-behavior conversion model for post-click conversion rate estimation. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 2201–2210.Google ScholarDigital Library
- [6] . 1993. Development of the PID controller. IEEE Control Systems Magazine 13, 6 (1993), 58–62.Google ScholarCross Ref
- [7] . [n. d.]. Rolling out first price auctions to Google Ad Manager partners.Google Scholar
- [8] . 2021. Causal models for real time bidding with repeated user interactions. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 75–85.Google ScholarDigital Library
- [9] . 2017. Real-time bidding by reinforcement learning in display advertising. In Proceedings of the Tenth ACM International Conference on Web Search and Data Mining. 661–670.Google ScholarDigital Library
- [10] . 2014. Modeling delayed feedback in display advertising. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1097–1105.Google ScholarDigital Library
- [11] . 2020. Online display advertising markets: A literature review and future directions. Information Systems Research 31, 2 (2020), 556–575.Google ScholarDigital Library
- [12] . 2011. Bid landscape forecasting in online ad exchange marketplace. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 265–273.Google ScholarDigital Library
- [13] . 2018. Implicit quantile networks for distributional reinforcement learning. In International Conference on Machine Learning. PMLR, 1096–1105.Google Scholar
- [14] . 1986. Long-run competition in capacity, short-run competition in price, and the Cournot model. The RAND Journal of Economics (1986), 404–415.Google ScholarCross Ref
- [15] . 2021. Towards efficient auctions in an auto-bidding world. In Proceedings of the Web Conference 2021. 3965–3973.Google ScholarDigital Library
- [16] . 2021. First-price auctions in online display advertising. Journal of Marketing Research 58, 5 (2021), 888–907.Google ScholarCross Ref
- [17] . 2017. Attribution modeling increases efficiency of bidding in display advertising. In Proceedings of the ADKDD’17. 1–6.Google ScholarDigital Library
- [18] . 2013. Feedback Control Theory. Courier Corporation.Google Scholar
- [19] . 2022. Risk-aware bid optimization for online display advertisement. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management. 457–467.Google ScholarDigital Library
- [20] . 2021. Convergence analysis of no-regret bidding algorithms in repeated auctions. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 5399–5406.Google ScholarCross Ref
- [21] . 1989. Model predictive control: Theory and practice—A survey. Automatica 25, 3 (1989), 335–348.Google ScholarDigital Library
- [22] . 2016. Joint optimization of multiple performance metrics in online video advertising. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 471–480.Google ScholarDigital Library
- [23] . 2020. Bid shading in the brave new world of first-price auctions. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 2453–2460.Google ScholarDigital Library
- [24] . 2017. Profit maximization for online advertising demand-side platforms. In Proceedings of the ADKDD’17. 1–7.Google ScholarDigital Library
- [25] . 2019. Recurrent neural networks for stochastic control in real-time bidding. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2801–2809.Google ScholarDigital Library
- [26] . 2021. Multi-agent cooperative bidding games for multi-objective optimization in e-commercial sponsored search. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 2899–2909.Google ScholarDigital Library
- [27] . 2018. A review of multi-objective optimization: Methods and its applications. Cogent Engineering 5, 1 (2018), 1502242.Google ScholarCross Ref
- [28] . 2017. DeepFM: A factorization-machine based neural network for CTR prediction. In IJCAI.Google Scholar
- [29] . 2021. We know what you want: An advertising strategy recommender system for online advertising. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 2919–2927.Google ScholarDigital Library
- [30] . 2020. Learning to bid optimally and efficiently in adversarial first-price auctions. arXiv preprint arXiv:2007.04568 (2020).Google Scholar
- [31] . 2021. A unified solution to constrained bidding in online display advertising. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 2993–3001.Google ScholarDigital Library
- [32] . 1997. The widespread use of odd pricing in the retail sector. Marketing Bulletin-Department Of Marketing Massey University 8 (1997), 53–58.Google Scholar
- [33] . 1998. Multiagent reinforcement learning: Theoretical framework and an algorithm. In ICML, Vol. 98. Citeseer, 242–250.Google Scholar
- [34] . 2018. Optimization of a SSP’s header bidding strategy using Thompson sampling. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 425–432.Google ScholarDigital Library
- [35] . 2018. Real-time bidding with multi-agent reinforcement learning in display advertising. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 2193–2201.Google ScholarDigital Library
- [36] . 2020. Feedback control in programmatic advertising: The frontier of optimization in real-time bidding. IEEE Control Systems Magazine 40, 5 (2020), 40–77.Google ScholarCross Ref
- [37] . 2021. Adaptive bid shading optimization of first-price ad inventory. In 2021 American Control Conference (ACC). IEEE, 4983–4990.Google ScholarCross Ref
- [38] . 1953. Sequential minimax search for a maximum. Proceedings of the American Mathematical Society 4, 3 (1953), 502–506.Google ScholarCross Ref
- [39] . 2013. Auto-encoding variational Bayes. arXiv preprint arXiv:1312.6114 (2013).Google Scholar
- [40] . 2017. Ad serving with multiple KPIs. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1853–1861.Google ScholarDigital Library
- [41] . 2009. Auction Theory. Academic Press.Google Scholar
- [42] . 2019. Addressing delayed feedback for continuous training with neural networks in CTR prediction. In Proceedings of the 13th ACM Conference on Recommender Systems. 187–195.Google ScholarDigital Library
- [43] . 2022. Arbitrary distribution modeling with censorship in real-time bidding advertising. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 3250–3258.Google ScholarDigital Library
- [44] . 2015. Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971 (2015).Google Scholar
- [45] . 2016. Combining powers of two predictors in optimizing real-time bidding strategy under constrained budget. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. 2143–2148.Google ScholarDigital Library
- [46] . 2000. Value at risk. Financial Analysts Journal 56, 2 (2000), 47–67.
DOI: Google ScholarCross Ref - [47] . 2021. Neural auction: End-to-end learning of auction mechanisms for e-commerce advertising. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 3354–3364.Google ScholarDigital Library
- [48] . 2019. Reinforcement learning with sequential information clustering in real-time bidding. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management. 1633–1641.Google ScholarDigital Library
- [49] . 2018. Modeling task relationships in multi-task learning with multi-gate mixture-of-experts. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 1930–1939.Google ScholarDigital Library
- [50] . 2018. Entire space multi-task model: An effective approach for estimating post-click conversion rate. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval. 1137–1140.Google ScholarDigital Library
- [51] . 2018. Optimal bidding strategy for brand advertising. In IJCAI. 424–432.Google Scholar
- [52] . 2016. Asynchronous methods for deep reinforcement learning. In International Conference on Machine Learning. PMLR, 1928–1937.Google ScholarDigital Library
- [53] . 2015. Human-level control through deep reinforcement learning. Nature 518, 7540 (2015), 529–533.Google ScholarCross Ref
- [54] . 2020. Unbiased lift-based bidding system. arXiv preprint arXiv:2007.04002 (2020).Google Scholar
- [55] . 1981. Optimal auction design. Mathematics of Operations Research 6, 1 (1981), 58–73.Google ScholarDigital Library
- [56] . 2009. Handbook of PI and PID Controller Tuning Rules. World Scientific.Google ScholarCross Ref
- [57] . 2013. (More) efficient reinforcement learning via posterior sampling. Advances in Neural Information Processing Systems 26 (2013).Google Scholar
- [58] . 2023. Deep landscape forecasting in multi-slot real-time bidding. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 4685–4695.Google ScholarDigital Library
- [59] . 2020. Why Do Competitive Markets Converge to First-Price Auctions?596–605.Google Scholar
- [60] . 2020. Bid shading by win-rate estimation and surplus maximization. arXiv preprint arXiv:2009.09259 (2020).Google Scholar
- [61] . 2020. User behavior retrieval for click-through rate prediction. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 2347–2356.Google ScholarDigital Library
- [62] . 2016. Product-based neural networks for user response prediction. In 2016 IEEE 16th International Conference on Data Mining (ICDM). IEEE, 1149–1154.Google ScholarCross Ref
- [63] . 2018. Learning multi-touch conversion attribution with dual-attention mechanisms for online advertising. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 1433–1442.Google ScholarDigital Library
- [64] . 2019. Deep landscape forecasting for real-time bidding advertising. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 363–372.Google ScholarDigital Library
- [65] . 2017. Bidding machine: Learning to bid for directly optimizing profits in display advertising. IEEE Transactions on Knowledge and Data Engineering 30, 4 (2017), 645–659.Google ScholarCross Ref
- [66] . 2016. User response learning for directly optimizing campaign performance in display advertising. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. 679–688.Google ScholarDigital Library
- [67] . 2010. Factorization machines. In 2010 IEEE International Conference on Data Mining. IEEE, 995–1000.Google ScholarDigital Library
- [68] . 2009. Gaussian mixture models. Encyclopedia of Biometrics 741, 659-663 (2009).Google ScholarCross Ref
- [69] . 2010. Algorithmic game theory. Commun. ACM 53, 7 (2010), 78–86.Google ScholarDigital Library
- [70] . 2008. Entropic risk constraints for utility maximization. Festschrift in Celebration of Prof. Dr. Wilfried Grecksch’s 60th Birthday (2008), 149–180.Google Scholar
- [71] . 2021. One model to serve all: Star topology adaptive recommender for multi-domain CTR prediction. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 4104–4113.Google ScholarDigital Library
- [72] . 2012. Partially observable Markov Decision Processes. In Reinforcement Learning. Springer, 387–414.Google ScholarCross Ref
- [73] . 2018. Reinforcement Learning: An Introduction. MIT Press.Google ScholarDigital Library
- [74] . 2017. Multiagent cooperation and competition with deep reinforcement learning. PloS One 12, 4 (2017), e0172395.Google ScholarCross Ref
- [75] . 2020. Optimized cost per mille in feeds advertising. In Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems. 1359–1367.Google ScholarDigital Library
- [76] . 2022. ROI-constrained bidding via curriculum-guided Bayesian reinforcement learning. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 4021–4031.Google ScholarDigital Library
- [77] . 2016. Display advertising with real-time bidding (RTB) and behavioural targeting. arXiv preprint arXiv:1610.03013 (2016).Google Scholar
- [78] . 2017. Deep & cross network for ad click predictions. In Proceedings of the ADKDD’17. 1–7.Google ScholarDigital Library
- [79] . 2017. LADDER: A human-level bidding agent for large-scale real-time online auctions. arXiv preprint arXiv:1708.05565 (2017).Google Scholar
- [80] . 2022. A cooperative-competitive multi-agent framework for auto-bidding in online advertising. In Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining. 1129–1139.Google ScholarDigital Library
- [81] . 2020. Entire space multi-task modeling via post-click behavior decomposition for conversion rate prediction. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 2377–2386.Google ScholarDigital Library
- [82] . 2017. GSP: The Cinderella of mechanism design. In Proceedings of the 26th International Conference on World Wide Web. 25–32.Google ScholarDigital Library
- [83] . 2018. Budget constrained bidding by model-free reinforcement learning in display advertising. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 1443–1451.Google ScholarDigital Library
- [84] . 2022. Graph convolution machine for context-aware recommender system. Frontiers of Computer Science 16, 6 (2022), 166614.Google ScholarDigital Library
- [85] . 2015. Predicting winning price in real time bidding with censored data. In Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1305–1314.Google ScholarDigital Library
- [86] . 2015. Smart pacing for effective online ad campaign optimization. In Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2217–2226.Google ScholarDigital Library
- [87] . 2016. Lift-based bidding in ad selection. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 30.Google ScholarCross Ref
- [88] . 2019. Bid optimization by multivariable control in display advertising. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 1966–1974.Google ScholarDigital Library
- [89] . 2019. AiAds: Automated and intelligent advertising system for sponsored search. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 1881–1890.Google ScholarDigital Library
- [90] . 2014. An empirical study of reserve price optimisation in real-time bidding. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1897–1906.Google ScholarDigital Library
- [91] . 2017. Managing risk of bidding in display advertising. In Proceedings of the Tenth ACM International Conference on Web Search and Data Mining. 581–590.Google ScholarDigital Library
- [92] . 2021. MEOW: A space-efficient nonparametric bid shading algorithm. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 3928–3936.Google ScholarDigital Library
- [93] . 2016. Optimal real-time bidding frameworks discussion. arXiv preprint arXiv:1602.01007 (2016).Google Scholar
- [94] . 2016. Feedback control of real-time display advertising. In Proceedings of the Ninth ACM International Conference on Web Search and Data Mining. 407–416.Google ScholarDigital Library
- [95] . 2014. Optimal real-time bidding for display advertising. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1077–1086.Google ScholarDigital Library
- [96] . 2014. Real-time bidding benchmarking with iPinYou dataset. arXiv preprint arXiv:1407.7073 (2014).Google Scholar
- [97] . 2016. Bid-aware gradient descent for unbiased learning with censored data in display advertising. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 665–674.Google ScholarDigital Library
- [98] . 2018. Deep reinforcement learning for sponsored search real-time bidding. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 1021–1030.Google ScholarDigital Library
- [99] . 2019. Deep reinforcement learning for search, recommendation, and online advertising: A survey. ACM SIGWEB NewsletterSpring (2019), 1–15.Google ScholarDigital Library
- [100] . 2019. Deep interest evolution network for click-through rate prediction. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 5941–5948.Google ScholarDigital Library
- [101] . 2018. Deep interest network for click-through rate prediction. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 1059–1068.Google ScholarDigital Library
- [102] . 2021. An efficient deep distribution network for bid shading in first-price auctions. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 3996–4004.Google ScholarDigital Library
- [103] . 2008. Budget constrained bidding in keyword auctions and online knapsack problems. In International Workshop on Internet and Network Economics. Springer, 566–576.Google Scholar
- [104] . 2021. AIM: Automatic interaction machine for click-through rate prediction. IEEE Transactions on Knowledge and Data Engineering (2021).Google Scholar
- [105] . 2017. Optimized cost per click in Taobao display advertising. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2191–2200.Google ScholarDigital Library
- [106] . 2017. A gamma-based regression for winning price estimation in real-time bidding advertising. In 2017 IEEE International Conference on Big Data (Big Data). IEEE, 1610–1619.Google ScholarCross Ref
Index Terms
- A Survey on Bid Optimization in Real-Time Bidding Display Advertising
Recommendations
Optimal real-time bidding for display advertising
KDD '14: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data miningIn this paper we study bid optimisation for real-time bidding (RTB) based display advertising. RTB allows advertisers to bid on a display ad impression in real time when it is being generated. It goes beyond contextual advertising by motivating the ...
Scalable Bid Landscape Forecasting in Real-Time Bidding
Machine Learning and Knowledge Discovery in DatabasesAbstractIn programmatic advertising, ad slots are usually sold using second-price (SP) auctions in real-time. The highest bidding advertiser wins but pays only the second highest bid (known as the winning price). In SP, for a single item, the dominant ...
An empirical study of reserve price optimisation in real-time bidding
KDD '14: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data miningIn this paper, we report the first empirical study and live test of the reserve price optimisation problem in the context of Real-Time Bidding (RTB) display advertising from an operational environment. A reserve price is the minimum that the auctioneer ...
Comments