The cross-entropy method for policy search in decentralized POMDPs