"Mori, S.,Tangkaratt, V.,Zhao, T.,Morimoto, J.,&Sugiyama, M.","Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation","IBISML2012-95","IEICE Technical Report",,,,"pp. 17-24",2013,