Many sequential decision making problems, including pool-based active learning and adaptive viral marketing, can be formulated as an adaptive submodular maximization problem. Most of existing studies on adaptive submodular optimization focus on either monotone case or non-monotone case. Specifically, if the utility function is monotone and adaptive submodular, \cite{golovin2011adaptive} developed a greedy policy that achieves a $(1-1/e)$ approximation ratio subject to a cardinality constraint. If the utility function is non-monotone and adaptive submodular, \cite{tang2021beyond} showed that a random greedy policy achieves a $1/e$ approximation ratio subject to a cardinality constraint. In this work, we aim to generalize the above mentioned results by studying the partial-monotone adaptive submodular maximization problem. To this end, we introduce the notation of adaptive monotonicity ratio $m\in[0,1]$ to measure the degree of monotonicity of a function. Our main result is to show that a random greedy policy achieves an approximation ratio of $m(1-1/e)+(1-m)(1/e)$ if the utility function is $m$-adaptive monotone and adaptive submodular. Notably this result recovers the aforementioned $(1-1/e)$ and $1/e$ approximation ratios when $m = 0$ and $m = 1$, respectively. We further extend our results to consider a knapsack constraint. We show that a sampling-based policy achieves an approximation ratio of $(m+1)/10$ if the utility function is $m$-adaptive monotone and adaptive submodular. One important implication of our results is that even for a non-monotone utility function, we still can achieve an approximation ratio close to $(1-1/e)$ if this function is ``close'' to a monotone function. This leads to improved performance bounds for many machine learning applications whose utility functions are almost adaptive monotone.
翻译:许多顺序决策问题,包括基于球库的积极学习和适应性病毒营销,可以被设计成适应性亚模式最大化问题。关于适应性亚模式优化的现有研究大多侧重于单调或非单调。具体地说,如果公用事业功能是单调和适应性亚模式,\cite{glovin2011adpti}制定了贪婪政策,在基点限制下,实现美元(1-1/e)近似接近率。如果公用事业功能是非单调和适应性亚模式,则该工具功能可以被设计成一个适应性一美元(1-1/e),如果非调和调适的亚特调,则该工具是非调和美元。我们的主要结果是,一个随机的贪婪政策效果是1美元/美元(1-1/美元)至美元(1美元)的调和(1美元)的调和(1美元)的调和(1美元)的调和(1美元)的调和(1美元)的调和(1美元)调的汇率功能。我们的主要结果是,当我们更替的汇率政策效果可以实现一个直调的汇率,如果一个美元的调的汇率,一个美元-美元-美元的调的功能是1美元-美元-美元的恢复性功能,则一个调和1美元(1美元的调和1美元。