Central to the design of many engineering systems and social networks is to solve the underlying resource sharing and exchange problems, in which multiple decentralized agents make sequential decisions over time to optimize some long-term performance metrics. It is challenging for the decentralized agents to make optimal sequential decisions because of the complicated coupling among the agents and across time. In this dissertation, we mainly focus on three important classes of multi-agent sequential resource sharing and exchange problems and derive optimal solutions to them.
First, we study multi-agent resource sharing with imperfect monitoring, in which self-interested agents have imperfect monitoring of the resource usage and inflict strong negative externality (i.e. strong interference and congestion) among each other. Despite of the imperfect monitoring, the strong negative externality, and the self-interested agents, we propose an optimal, distributed, easy-to-implement resource sharing policy that achieves Pareto optimal outcomes at the equilibrium. A key feature of the optimal resource sharing policy is that it is nonstationary, namely it makes decisions based on the history of past (imperfect)
monitoring of the resource usages. The applications of our proposed design in wireless spectrum sharing problems enable us to improve the spectrum efficiency by up to 200% and achieve up to 90% energy saving, compared to state-of-the-art (stationary) spectrum sharing policies.
Second, we study multi-agent resource sharing with decentralized information, in which each agent has a private, independently and stochastically changing state (whose transition may depend on the agent's action), and the agents' actions are coupled through resource sharing constraints. Despite of the decentralized information (i.e. private states), we propose distributed resource sharing policies that achieve the social optimum, and apply the proposed policies to demand-side management in smart grids, and joint resource allocation and packet scheduling in wireless video transmissions. The proposed policies demonstrate significant performance gains over existing myopic policies that do not take into account the state dynamics and the policies based on Lyapunov optimization that were proposed for single-agent problems.
Finally, we study multi-agent resource exchange with imperfect monitoring, in which self-interested, anonymous agents exchange services (e.g. task solving in crowdsourcing platforms, file sharing in peer-to-peer networks, answering in question-and-answer forums). Due to the anonymity of the agents and the lack of fixed partners, free-riding is prevalent, and can be addressed by rating protocols.
We propose the first rating protocol that can achieve the social optimum at the equilibrium under imperfect monitoring of the service quality. A key feature of the optimal rating protocol is again that it is nonstationary, namely it recommends desirable behaviors based on the history of past rating distributions of the agents.