×

Multi-agent team cooperation: a game theory approach. (English) Zbl 1179.93025

Summary: The main goal of this work is to design a team of agents that can accomplish consensus over a common value for the agents’ output using cooperative game theory approach. A semi-decentralized optimal control strategy that was recently introduced by the authors is utilized that is based on minimization of individual cost using local information. Cooperative game theory is then used to ensure team cooperation by considering a combination of individual cost as a team cost function. Minimization of this cost function results in a set of Pareto-efficient solutions. Among the Pareto-efficient solutions the Nash-bargaining solution is chosen. The Nash-bargaining solution is obtained by maximizing the product of the difference between the costs achieved through the optimal control strategy and the one obtained through the Pareto-efficient solution. The latter solution results in a lower cost for each agent at the expense of requiring full information set. To avoid this drawback some constraints are added to the structure of the controller that is suggested for the entire team using the Linear Matrix Inequality (LMI) formulation of the minimization problem. Consequently, although the controller is designed to minimize a unique team cost function, it only uses the available information set for each agent. A comparison between the average cost that is obtained by using the above two methods is conducted to illustrate the performance capabilities of our proposed solutions.

MSC:

93A14 Decentralized systems
91A12 Cooperative games
93C15 Control/observation systems governed by ordinary differential equations
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Ait-Rami, M.; Zhou, X. Y., Linear matrix inequalities, Riccati equations, and indefinite stochastic linear quadratic controls, IEEE Transactions on Automatic Control, 45, 6, 1131-1142 (2000) · Zbl 0981.93080
[2] Anderson, B. D.O.; Moore, J. B., (Optimal control: Linear quadratic methods. Optimal control: Linear quadratic methods, Prentice Hall information and system sciences series (1990))
[3] Arcak, M. (2006). Passivity as a design tool for group coordination. In Proc. American control conference; Arcak, M. (2006). Passivity as a design tool for group coordination. In Proc. American control conference
[4] Bauso, D., Giarre, L., & Pesenti, R. (2006). Mechanism design for optimal consensus problems. In Proc. conference on decision and control; Bauso, D., Giarre, L., & Pesenti, R. (2006). Mechanism design for optimal consensus problems. In Proc. conference on decision and control · Zbl 1111.68009
[5] Bošković, J.D., Li, S. M., & Mehra, R. K. (2002). Formation flight control design in the presence of unknown leader commands. In: Proc. American control conference; Bošković, J.D., Li, S. M., & Mehra, R. K. (2002). Formation flight control design in the presence of unknown leader commands. In: Proc. American control conference
[6] Boyd, S.; Ghaoui, L. E.; Feron, E.; Balakrishnan, V., Linear matrix inequalities in system and control theory (1994), SIAM · Zbl 0816.93004
[7] Engwerda, J. C., LQ dynamic optimization and differential games (2005), John Wiley & Sons
[8] Fax, J. A. (2002). Optimal and cooperative control of vehicle formations. Ph.D. thesis; Fax, J. A. (2002). Optimal and cooperative control of vehicle formations. Ph.D. thesis
[9] Fax, J. A.; Murray, R. M., Information flow and cooperative control of vehicle formations, IEEE Transactions on Automatic Control, 49, 9, 1465-1476 (2004) · Zbl 1365.90056
[10] Gazi, V. (2002). Stability analysis of swarms. Ph.D. thesis; Gazi, V. (2002). Stability analysis of swarms. Ph.D. thesis · Zbl 1365.92143
[11] Godsil, C.; Royle, G., Algebraic graph theory (2001), Springer · Zbl 0968.05002
[12] Inalhan, G., Stipanović, D. M., & Tomlin, C. J. (2002). Decentralized optimization, with application to multiple aircraft coordination. In Proc. conference on decision and control; Inalhan, G., Stipanović, D. M., & Tomlin, C. J. (2002). Decentralized optimization, with application to multiple aircraft coordination. In Proc. conference on decision and control
[13] Jadbabaie, A. (1997). Robust, non-fragile controller synthesis using model-based fuzzy systems: A linear matrix inequality approach. Master’s thesis; Jadbabaie, A. (1997). Robust, non-fragile controller synthesis using model-based fuzzy systems: A linear matrix inequality approach. Master’s thesis
[14] Jadbabaie, A.; Lin, J.; Morse, S., Coordination of groups of mobile autonomous agents using nearest neighbor rules, IEEE Transactions on Automatic Control, 48, 6, 988-1000 (2003) · Zbl 1364.93514
[15] Lee, D., & Spong, M. W. (2006). Agreement with non-uniform information delays. In Proc. American control conference; Lee, D., & Spong, M. W. (2006). Agreement with non-uniform information delays. In Proc. American control conference
[16] Olfati-Saber, R., Flocking for multi-agent dynamic systems: Algorithms and theory, IEEE Transactions on Automatic Control, 51, 3, 401-420 (2006) · Zbl 1366.93391
[17] Olfati-Saber, R., & Murray, R.M. (2002). Distributed structural stabilization and tracking for formations of dynamic multi-agents. In: Proc. conference on decision and control; Olfati-Saber, R., & Murray, R.M. (2002). Distributed structural stabilization and tracking for formations of dynamic multi-agents. In: Proc. conference on decision and control
[18] Olfati-Saber, R., & Murray, R. M. (2003a). Agreement problems in networks with directed graphs and switching topology. In Proc. conference on decision and control; Olfati-Saber, R., & Murray, R. M. (2003a). Agreement problems in networks with directed graphs and switching topology. In Proc. conference on decision and control
[19] Olfati-Saber, R., & Murray, R. M. (2003b). Consensus protocols for networks of dynamic agents. In: Proc. American control conference; Olfati-Saber, R., & Murray, R. M. (2003b). Consensus protocols for networks of dynamic agents. In: Proc. American control conference
[20] Olfati-Saber, R., & Murray, R. M. (2003c). Flocking with obstacle avoidance: Cooperation with limited communication in mobile networks. In Proc. conference on decision and control; Olfati-Saber, R., & Murray, R. M. (2003c). Flocking with obstacle avoidance: Cooperation with limited communication in mobile networks. In Proc. conference on decision and control
[21] Olfati-Saber, R.; Murray, R. M., Consensus problems in networks of agents with switching topology and time-delays, IEEE Transactions on Automatic Control, 49, 9, 1520-1533 (2004) · Zbl 1365.93301
[22] Paley, D., Leonard, N. E., & Sepulchre, R. (2004). Collective motion: Bistability and trajectory tracking. In Proc. conference on decision and control; Paley, D., Leonard, N. E., & Sepulchre, R. (2004). Collective motion: Bistability and trajectory tracking. In Proc. conference on decision and control
[23] Raffard, R. L., Tomlin, C. J., & Boyd, S. P. (2004). Distributed optimization for cooperative agents: Application to formation flight. In Proc. conference on decision and control; Raffard, R. L., Tomlin, C. J., & Boyd, S. P. (2004). Distributed optimization for cooperative agents: Application to formation flight. In Proc. conference on decision and control
[24] Ren, W. (2007). On consensus algorithms for double-integrator dynamics. In Proc. conference on decision and control; Ren, W. (2007). On consensus algorithms for double-integrator dynamics. In Proc. conference on decision and control
[25] Ren, W.; Beard, R. W., Formation feedback control for multiple spacecraft via virtual structures, IEE Proceedings Control Theory and Applications, 151, 3, 357-368 (2004)
[26] Ren, W.; Beard, R. W., Consensus seeking in multiagent systems under dynamically changing interaction topologies, IEEE Transactions on Automatic Control, 50, 5, 655-661 (2005) · Zbl 1365.93302
[27] Semsar, E., & Khorasani, K. (2006). Adaptive formation control of UAVs in the presence of unknown vortex forces and leader commands. In Proc. American control conference; Semsar, E., & Khorasani, K. (2006). Adaptive formation control of UAVs in the presence of unknown vortex forces and leader commands. In Proc. American control conference
[28] Semsar, E., & Khorasani, K. (2007). Optimal control and game theoretic approaches to cooperative control of a team of multi-vehicle unmanned systems. In Proc. IEEE international conference on networking, sensing and control; Semsar, E., & Khorasani, K. (2007). Optimal control and game theoretic approaches to cooperative control of a team of multi-vehicle unmanned systems. In Proc. IEEE international conference on networking, sensing and control
[29] Semsar-Kazerooni, E., & Khorasani, K. (2007a). Optimal cooperation in a modified leader-follower team of agents with partial availability of leader command. In Proc. IEEE international conference on systems, man, and cybernetics; Semsar-Kazerooni, E., & Khorasani, K. (2007a). Optimal cooperation in a modified leader-follower team of agents with partial availability of leader command. In Proc. IEEE international conference on systems, man, and cybernetics
[30] Semsar-Kazerooni, E., & Khorasani, K. (2007b). Semi-decentralized optimal control of a cooperative team of agents. In Proc. IEEE international conference on system of systems engineering; Semsar-Kazerooni, E., & Khorasani, K. (2007b). Semi-decentralized optimal control of a cooperative team of agents. In Proc. IEEE international conference on system of systems engineering · Zbl 1194.49056
[31] Semsar-Kazerooni, E., & Khorasani, K. (2007c). Semi-decentralized optimal control technique for a leader-follower team of unmanned systems with partial availability of the leader command. In Proc. IEEE international conference on control and automation; Semsar-Kazerooni, E., & Khorasani, K. (2007c). Semi-decentralized optimal control technique for a leader-follower team of unmanned systems with partial availability of the leader command. In Proc. IEEE international conference on control and automation
[32] Semsar-Kazerooni, E.; Khorasani, K., Optimal consensus algorithms for cooperative team of agents subject to partial information, Automatica, 44, 11, 2766-2777 (2008) · Zbl 1194.49056
[33] Semsar-Kazerooni, E.; Khorasani, K., An optimal cooperation in a team of agents subject to partial information, International Journal of Control, 82, 3, 571-583 (2009) · Zbl 1168.93309
[34] Sinopoli, B.; Sharp, C.; Schenato, L.; Schafferthim, S.; Sastry, S., Distributed control applications within sensor networks, Proceedings of the IEEE, 91, 8, 1235-1246 (2003)
[35] Stipanović, D. M.; Inalhan, G.; Teo, R.; Tomlin, C. J., Decentralized overlapping control of a formation of unmanned aerial vehicles, Automatica, 40, 1285-1296 (2004) · Zbl 1073.93556
[36] Tanner, H. G.; Jadbabaie, A.; Pappas, G. J., Flocking in fixed and switching networks, IEEE Transactions on Automatic Control, 52, 5, 863-868 (2007) · Zbl 1366.93414
[37] Willems, J. C., Least squares stationary optimal control and the algebraic Riccati equation, IEEE Transactions on Automatic Control, 16, 6, 621-634 (1971)
[38] Xiao, F.; Wang, L., Consensus problems for high-dimensional multi-agent systems, IET Control Theory Applications, 1, 3, 830-837 (2007)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.