Article

Security in multiagent systems by policy randomization

Authors:
Praveen Paruchuri

University of Southern California, Los Angeles, CA

University of Southern California, Los Angeles, CA
View Profile

,
Milind Tambe

University of Southern California, Los Angeles, CA

University of Southern California, Los Angeles, CA
View Profile

,
Fernando Ordóñez

University of Southern California, Los Angeles, CA

University of Southern California, Los Angeles, CA
View Profile

,
Sarit Kraus

University of Southern California, Los Angeles, CA and Bar-llan University, Israel

University of Southern California, Los Angeles, CA and Bar-llan University, Israel
View Profile

AAMAS '06: Proceedings of the fifth international joint conference on Autonomous agents and multiagent systemsMay 2006Pages 273–280https://doi.org/10.1145/1160633.1160681

Published:08 May 2006Publication History

AAMAS '06: Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems

Pages 273–280

ABSTRACT

Security in multiagent systems is commonly defined as the ability of the system to deal with intentional threats from other agents. This paper focuses on domains where such intentional threats are caused by unseen adversaries whose actions or payoffs are unknown. In such domains, action randomization can effectively deteriorate an adversary's capability to predict and exploit an agent/agent team's actions. Unfortunately, little attention has been paid to intentional randomization of agents' policies in single-agent or decentralized (PO)MDPs without significantly sacrificing rewards or breaking down coordination. This paper provides two key contributions to remedy this situation. First, it provides three novel algorithms, one based on a non-linear program and two based on linear programs (LP), to randomize single-agent policies, while attaining a certain level of expected reward. Second, it provides Rolling Down Randomization (RDR), a new algorithm that efficiently generates randomized policies for decentralized POMDPs via the single-agent LP method.

References

E. Altman. Constrained Markov Decision Process. Chapman and Hall, 1999.Google Scholar
R. Beard and T. McLain. Multiple uav cooperative search under collision avoidance and limited range communication constraints. In IEEE CDC, 2003.Google ScholarCross Ref
D. Bernstein, S. Zilberstein, and N. Immerman. The complexity of decentralized control of MDPs. In UAI, 2000. Google ScholarDigital Library
D. Dolgov and E. Durfee. Constructing optimal policies for agents with constrained architectures. Technical report, Univ of Michigan, CSE-TR-476-03, 2003.Google Scholar
R. Emery-Montemerlo, G. Gordon, J. Schneider, and S. Thrun. Approximate solutions for partially observable stochastic games with common payoffs. In AAMAS, 2004. Google ScholarDigital Library
E. Hansen, D. Bernstein, and S. Zilberstein. Dynamic programming for partially observable stochastic games. In AAAI, 2004. Google ScholarDigital Library
J. Hu and P. Wellman. Multiagent reinforcement learning: theoretical framework and an algorithm. In ICML, 1998. Google ScholarDigital Library
D. A. Huffman. A method for the construction of minimum redundancy codes. In Proc. IRE 40, 1952.Google Scholar
T. Jaakkola, S. Singh, and M. Jordan. Reinforcement learning algorithm for partially observable markov decision problems. Advances in NIPS, 7, 1994.Google Scholar
M. Littman. Markov games as a framework for multi-agent reinforcement learning. In ML, 1994.Google Scholar
R. Nair, D. Pynadath, M. Yokoo, M. Tambe, and S. Marsella. Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings. In IJCAI, 2003. Google ScholarDigital Library
S. Paquet, L. Tobin, and B. Chaib-draa. An online POMDP algorithm for complex multiagent environments. In AAMAS, 2005. Google ScholarDigital Library
P. Paruchuri, M. Tambe, F. Ordonez, and S. Kraus. Towards a formalization of teamwork with resource constraints. In AAMAS, 2004. Google ScholarDigital Library
D. V. Pynadath and M. Tambe. The communicative multiagent team decision problem: Analyzing teamwork theories and models. JAIR, 16:389--423, 2002. Google ScholarDigital Library
C. Shannon. A mathematical theory of communication. The Bell Labs Technical Journal, pages 379--457, 623, 656, 1948.Google Scholar
S. Vavasis. Nonlinear optimization: Complexity issues. In University Press, New York, 1991. Google ScholarDigital Library

Recommendations

Organizing Multiagent Systems

Despite all the research done in the last years on the development of methodologies for designing MAS, there is no methodology suitable for the specification and design of MAS in complex domains where both the agent view and the organizational view can ...
Read More
Computing effective communication policies in multiagent systems
AAMAS '07: Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems

Communication is a key tool for facilitating multiagent coordination in cooperative and uncertain domains. We focus on a class of multiagent problems modeled as Decentralized Markov Decision Processes with Communication (DEC-MDP-COM) with local ...
Read More
Norm Governed Multiagent Systems: The Delegation of Control to Autonomous Agents
IAT '03: Proceedings of the IEEE/WIC International Conference on Intelligent Agent Technology

When agents make decisions, they have to deal with norms regulating the system. In this paper we therefore propose a rule-based qualitative decision and game theory combining ideas from multiagent systems and normative systems. Whereas normative systems are ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
AAMAS '06: Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
May 2006
1631 pages
ISBN:1595933034
DOI:10.1145/1160633
General Chairs:
Hideyuki Nakashima
Future University - Hakodate, Japan
,
Michael Wellman
University of Michigan
,
Program Chairs:
Gerhard Weiss
Technical University Munich, Germany
,
Peter Stone
The University of Texas at Austin
Copyright © 2006 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 8 May 2006
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
MDP
POMDP
decentralized POMDP
safety and security in agent systems
teamwork
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate1,155of5,036submissions,23%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 64
  Total Citations
  View Citations
- 521
  Total Downloads
- Downloads (Last 12 months)13
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Security in multiagent systems by policy randomization

AAMAS '06: Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems

ABSTRACT

References

Cited By

Recommendations

Organizing Multiagent Systems

Computing effective communication policies in multiagent systems

Norm Governed Multiagent Systems: The Delegation of Control to Autonomous Agents