This model computes belief state, which is the probability of the environment operating in a particular state, in a dynamic and uncertain operating environment
This model allows agents to place bids during auctions conducted by a centralized entity so that the winning agents receive rewards
Chen and Qiu [16], Jayaweera et al. [36], Fu and van der Schaar [37], and Xiao et al. [38]
Internal self-learning model
This model enables an agent to exchange its virtualactions continuously with rewards generated by a simulated internal environment within the agent itself in order to expedite the learning process
This model enables an agent to collaborate with its neighbor agents and subsequently make local decisions independently in distributed networks. A local decision is part of an optimal joint action, which is comprised of the actions taken by all the agents in a network
This model enables an agent to compete with its neighbor agents and subsequently make local decisions independently in worst-case scenarios in the presence of competitor agents, which attempt to minimize the accumulated rewards of the agent