{"data":{"id":"10.48550/arxiv.1408.1484","type":"dois","attributes":{"doi":"10.48550/arxiv.1408.1484","prefix":"10.48550","suffix":"arxiv.1408.1484","identifiers":[{"identifier":"1408.1484","identifierType":"arXiv"}],"alternateIdentifiers":[{"alternateIdentifierType":"arXiv","alternateIdentifier":"1408.1484"}],"creators":[{"name":"Peshkin, Leonid","nameType":"Personal","givenName":"Leonid","familyName":"Peshkin","affiliation":[],"nameIdentifiers":[]},{"name":"Kim, Kee-Eung","nameType":"Personal","givenName":"Kee-Eung","familyName":"Kim","affiliation":[],"nameIdentifiers":[]},{"name":"Meuleau, Nicolas","nameType":"Personal","givenName":"Nicolas","familyName":"Meuleau","affiliation":[],"nameIdentifiers":[]},{"name":"Kaelbling, Leslie Pack","nameType":"Personal","givenName":"Leslie Pack","familyName":"Kaelbling","affiliation":[],"nameIdentifiers":[]}],"titles":[{"title":"Learning to Cooperate via Policy Search"}],"publisher":"arXiv","container":{},"publicationYear":2014,"subjects":[{"lang":"en","subject":"Artificial Intelligence (cs.AI)","subjectScheme":"arXiv"},{"subject":"FOS: Computer and information sciences","subjectScheme":"Fields of Science and Technology (FOS)"},{"subject":"FOS: Computer and information sciences","schemeUri":"http://www.oecd.org/science/inno/38235147.pdf","subjectScheme":"Fields of Science and Technology (FOS)"}],"contributors":[],"dates":[{"date":"2014-08-07T06:25:37Z","dateType":"Submitted","dateInformation":"v1"},{"date":"2014-08-08T00:06:05Z","dateType":"Updated","dateInformation":"v1"},{"date":"2014-08","dateType":"Available","dateInformation":"v1"},{"date":"2014","dateType":"Issued"}],"language":null,"types":{"ris":"GEN","bibtex":"misc","citeproc":"article","schemaOrg":"CreativeWork","resourceType":"Article","resourceTypeGeneral":"Preprint"},"relatedIdentifiers":[],"relatedItems":[],"sizes":[],"formats":[],"version":"1","rightsList":[{"rights":"arXiv.org perpetual, non-exclusive license","rightsUri":"http://arxiv.org/licenses/nonexclusive-distrib/1.0/"}],"descriptions":[{"description":"Cooperative games are those in which both agents share the same payoff structure. Value-based reinforcement-learning algorithms, such as variants of Q-learning, have been applied to learning cooperative games, but they only apply when the game state is completely observable to both agents. Policy search methods are a reasonable alternative to value-based methods for partially observable environments. In this paper, we provide a gradient-based distributed policy-search method for cooperative games and compare the notion of local optimum to that of Nash equilibrium. We demonstrate the effectiveness of this method experimentally in a small, partially observable simulated soccer domain.","descriptionType":"Abstract"},{"description":"Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)","descriptionType":"Other"}],"geoLocations":[],"fundingReferences":[],"xml":"PD94bWwgdmVyc2lvbj0iMS4wIiBlbmNvZGluZz0idXRmLTgiPz4KPHJlc291cmNlIHhtbG5zPSJodHRwOi8vZGF0YWNpdGUub3JnL3NjaGVtYS9rZXJuZWwtNCIgeG1sbnM6eHNpPSJodHRwOi8vd3d3LnczLm9yZy8yMDAxL1hNTFNjaGVtYS1pbnN0YW5jZSIgeHNpOnNjaGVtYUxvY2F0aW9uPSJodHRwOi8vZGF0YWNpdGUub3JnL3NjaGVtYS9rZXJuZWwtNCBodHRwOi8vc2NoZW1hLmRhdGFjaXRlLm9yZy9tZXRhL2tlcm5lbC00LjMvbWV0YWRhdGEueHNkIj4KICA8aWRlbnRpZmllciBpZGVudGlmaWVyVHlwZT0iRE9JIj4xMC40ODU1MC9BUlhJVi4xNDA4LjE0ODQ8L2lkZW50aWZpZXI+CiAgPGFsdGVybmF0ZUlkZW50aWZpZXJzPgogICAgPGFsdGVybmF0ZUlkZW50aWZpZXIgYWx0ZXJuYXRlSWRlbnRpZmllclR5cGU9ImFyWGl2Ij4xNDA4LjE0ODQ8L2FsdGVybmF0ZUlkZW50aWZpZXI+CiAgPC9hbHRlcm5hdGVJZGVudGlmaWVycz4KICA8Y3JlYXRvcnM+CiAgICA8Y3JlYXRvcj4KICAgICAgPGNyZWF0b3JOYW1lIG5hbWVUeXBlPSJQZXJzb25hbCI+UGVzaGtpbiwgTGVvbmlkPC9jcmVhdG9yTmFtZT4KICAgICAgPGdpdmVuTmFtZT5MZW9uaWQ8L2dpdmVuTmFtZT4KICAgICAgPGZhbWlseU5hbWU+UGVzaGtpbjwvZmFtaWx5TmFtZT4KICAgIDwvY3JlYXRvcj4KICAgIDxjcmVhdG9yPgogICAgICA8Y3JlYXRvck5hbWUgbmFtZVR5cGU9IlBlcnNvbmFsIj5LaW0sIEtlZS1FdW5nPC9jcmVhdG9yTmFtZT4KICAgICAgPGdpdmVuTmFtZT5LZWUtRXVuZzwvZ2l2ZW5OYW1lPgogICAgICA8ZmFtaWx5TmFtZT5LaW08L2ZhbWlseU5hbWU+CiAgICA8L2NyZWF0b3I+CiAgICA8Y3JlYXRvcj4KICAgICAgPGNyZWF0b3JOYW1lIG5hbWVUeXBlPSJQZXJzb25hbCI+TWV1bGVhdSwgTmljb2xhczwvY3JlYXRvck5hbWU+CiAgICAgIDxnaXZlbk5hbWU+Tmljb2xhczwvZ2l2ZW5OYW1lPgogICAgICA8ZmFtaWx5TmFtZT5NZXVsZWF1PC9mYW1pbHlOYW1lPgogICAgPC9jcmVhdG9yPgogICAgPGNyZWF0b3I+CiAgICAgIDxjcmVhdG9yTmFtZSBuYW1lVHlwZT0iUGVyc29uYWwiPkthZWxibGluZywgTGVzbGllIFBhY2s8L2NyZWF0b3JOYW1lPgogICAgICA8Z2l2ZW5OYW1lPkxlc2xpZSBQYWNrPC9naXZlbk5hbWU+CiAgICAgIDxmYW1pbHlOYW1lPkthZWxibGluZzwvZmFtaWx5TmFtZT4KICAgIDwvY3JlYXRvcj4KICA8L2NyZWF0b3JzPgogIDx0aXRsZXM+CiAgICA8dGl0bGU+TGVhcm5pbmcgdG8gQ29vcGVyYXRlIHZpYSBQb2xpY3kgU2VhcmNoPC90aXRsZT4KICA8L3RpdGxlcz4KICA8cHVibGlzaGVyPmFyWGl2PC9wdWJsaXNoZXI+CiAgPHB1YmxpY2F0aW9uWWVhcj4yMDE0PC9wdWJsaWNhdGlvblllYXI+CiAgPHN1YmplY3RzPgogICAgPHN1YmplY3QgeG1sOmxhbmc9ImVuIiBzdWJqZWN0U2NoZW1lPSJhclhpdiI+QXJ0aWZpY2lhbCBJbnRlbGxpZ2VuY2UgKGNzLkFJKTwvc3ViamVjdD4KICAgIDxzdWJqZWN0IHN1YmplY3RTY2hlbWU9IkZpZWxkcyBvZiBTY2llbmNlIGFuZCBUZWNobm9sb2d5IChGT1MpIj5GT1M6IENvbXB1dGVyIGFuZCBpbmZvcm1hdGlvbiBzY2llbmNlczwvc3ViamVjdD4KICA8L3N1YmplY3RzPgogIDxkYXRlcz4KICAgIDxkYXRlIGRhdGVUeXBlPSJTdWJtaXR0ZWQiIGRhdGVJbmZvcm1hdGlvbj0idjEiPjIwMTQtMDgtMDdUMDY6MjU6MzdaPC9kYXRlPgogICAgPGRhdGUgZGF0ZVR5cGU9IlVwZGF0ZWQiIGRhdGVJbmZvcm1hdGlvbj0idjEiPjIwMTQtMDgtMDhUMDA6MDY6MDVaPC9kYXRlPgogICAgPGRhdGUgZGF0ZVR5cGU9IkF2YWlsYWJsZSIgZGF0ZUluZm9ybWF0aW9uPSJ2MSI+MjAxNC0wODwvZGF0ZT4KICA8L2RhdGVzPgogIDxyZXNvdXJjZVR5cGUgcmVzb3VyY2VUeXBlR2VuZXJhbD0iUHJlcHJpbnQiPkFydGljbGU8L3Jlc291cmNlVHlwZT4KICA8dmVyc2lvbj4xPC92ZXJzaW9uPgogIDxyaWdodHNMaXN0PgogICAgPHJpZ2h0cyByaWdodHNVUkk9Imh0dHA6Ly9hcnhpdi5vcmcvbGljZW5zZXMvbm9uZXhjbHVzaXZlLWRpc3RyaWIvMS4wLyI+YXJYaXYub3JnIHBlcnBldHVhbCwgbm9uLWV4Y2x1c2l2ZSBsaWNlbnNlPC9yaWdodHM+CiAgPC9yaWdodHNMaXN0PgogIDxkZXNjcmlwdGlvbnM+CiAgICA8ZGVzY3JpcHRpb24gZGVzY3JpcHRpb25UeXBlPSJBYnN0cmFjdCI+Q29vcGVyYXRpdmUgZ2FtZXMgYXJlIHRob3NlIGluIHdoaWNoIGJvdGggYWdlbnRzIHNoYXJlIHRoZSBzYW1lIHBheW9mZiBzdHJ1Y3R1cmUuIFZhbHVlLWJhc2VkIHJlaW5mb3JjZW1lbnQtbGVhcm5pbmcgYWxnb3JpdGhtcywgc3VjaCBhcyB2YXJpYW50cyBvZiBRLWxlYXJuaW5nLCBoYXZlIGJlZW4gYXBwbGllZCB0byBsZWFybmluZyBjb29wZXJhdGl2ZSBnYW1lcywgYnV0IHRoZXkgb25seSBhcHBseSB3aGVuIHRoZSBnYW1lIHN0YXRlIGlzIGNvbXBsZXRlbHkgb2JzZXJ2YWJsZSB0byBib3RoIGFnZW50cy4gUG9saWN5IHNlYXJjaCBtZXRob2RzIGFyZSBhIHJlYXNvbmFibGUgYWx0ZXJuYXRpdmUgdG8gdmFsdWUtYmFzZWQgbWV0aG9kcyBmb3IgcGFydGlhbGx5IG9ic2VydmFibGUgZW52aXJvbm1lbnRzLiBJbiB0aGlzIHBhcGVyLCB3ZSBwcm92aWRlIGEgZ3JhZGllbnQtYmFzZWQgZGlzdHJpYnV0ZWQgcG9saWN5LXNlYXJjaCBtZXRob2QgZm9yIGNvb3BlcmF0aXZlIGdhbWVzIGFuZCBjb21wYXJlIHRoZSBub3Rpb24gb2YgbG9jYWwgb3B0aW11bSB0byB0aGF0IG9mIE5hc2ggZXF1aWxpYnJpdW0uIFdlIGRlbW9uc3RyYXRlIHRoZSBlZmZlY3RpdmVuZXNzIG9mIHRoaXMgbWV0aG9kIGV4cGVyaW1lbnRhbGx5IGluIGEgc21hbGwsIHBhcnRpYWxseSBvYnNlcnZhYmxlIHNpbXVsYXRlZCBzb2NjZXIgZG9tYWluLjwvZGVzY3JpcHRpb24+CiAgICA8ZGVzY3JpcHRpb24gZGVzY3JpcHRpb25UeXBlPSJPdGhlciI+QXBwZWFycyBpbiBQcm9jZWVkaW5ncyBvZiB0aGUgU2l4dGVlbnRoIENvbmZlcmVuY2Ugb24gVW5jZXJ0YWludHkgaW4gQXJ0aWZpY2lhbCBJbnRlbGxpZ2VuY2UgKFVBSTIwMDApPC9kZXNjcmlwdGlvbj4KICA8L2Rlc2NyaXB0aW9ucz4KPC9yZXNvdXJjZT4=","url":"https://arxiv.org/abs/1408.1484","contentUrl":null,"metadataVersion":0,"schemaVersion":"http://datacite.org/schema/kernel-4","source":"mds","isActive":true,"state":"findable","reason":null,"viewCount":0,"viewsOverTime":[],"downloadCount":0,"downloadsOverTime":[],"referenceCount":0,"citationCount":0,"citationsOverTime":[],"partCount":0,"partOfCount":0,"versionCount":0,"versionOfCount":0,"created":"2022-03-09T15:24:24.000Z","registered":"2022-03-09T15:24:25.000Z","published":"2014","updated":"2022-03-09T15:24:25.000Z"},"relationships":{"client":{"data":{"id":"arxiv.content","type":"clients"}},"provider":{"data":{"id":"arxiv","type":"providers"}},"media":{"data":{"id":"10.48550/arxiv.1408.1484","type":"media"}},"references":{"data":[]},"citations":{"data":[]},"parts":{"data":[]},"partOf":{"data":[]},"versions":{"data":[]},"versionOf":{"data":[]}}}}