在这篇短文中,我将基于过去一年中的研究经验,简单地分享一下我目前对行为决策建模的见解。
In this post, I will briefly discuss some of my current thoughts on decision behavior modeling based on my research experience over the past year.
抽象地来看,决策行为即在给定决策信息$s$(下称状态)的情况下,通过某种策略$\pi$采取动作$a$
From an abstract point of view, decision making is to take an action $a$ according to a certain strategy $\pi$ given decision information $s$ (hereinafter referred to as the state)
$$a \sim \pi(s).$$
而行为决策建模,则希望根据观察数据$\mathcal{D} = \{(s,a)\}_n$,获取决策机制$\pi$的相关信息。
While decision behavior modeling attempts to infer the strategy $\pi$ from observations $\mathcal{D} = \{(s,a)\}_n$.
由于数据集$\mathcal{D}$是有限的,能够很好地拟合该数据集的$\pi$可能有无穷多个。因此,研究者通常需要对$\pi$的形式做出额外假设。而一个最基础且核心的假设是$\pi$服从效用最大化原理。
Because the dataset $\mathcal{D}$ has finite datapoints, there can be an infinite number of $\pi$ that fits well into the dataset. Therefore, researchers often need to make additional assumptions about the form of $\pi$. And one of the most fundamental assumptions is that $\pi$ follows the utility-maximization principle.