木村元研究業績一覧 | T2R2 東京科学大学リサーチリポジトリ

ヘルプ

木村元研究業績一覧 (26件)

論文

木村元. 強化学習による４足ロボットの歩行動作獲得, 電気学会電子情報システム部門誌, Vol. 122, No. 3, pp. 330-337, 2002.
佐藤　誠, 木村　元, 小林　重信. 報酬の分散を推定するＴＤアルゴリズムとMean-Variance強化学習法の提案, 人工知能学会論文誌, Vol. 16, No. 3, pp. 353-362, 2001.
荒牧　岳志, 木村　元, 小俣　透, 小林　重信. 強化学習による２アクチュエータ５リンク環状ロボットの移動動作獲得, 第１９回日本ロボット学会学術講演会, 2001.
後藤　正徳, 木村　元, 小林　重信. トランザクション処理におけるタイムアウト間隔の学習, 計測自動制御学会第２８回知能システムシンポジウム, pp. 257-262, 2001.
青木　圭, 木村　元, 小林　重信. 強化学習による上下水道系の制御, 計測自動制御学会第２８回知能システムシンポジウム, pp. 191-196, 2001.
木村　元, 小林　重信. 確率的２分木の行動選択を用いたActor-Criticアルゴリズム：多数の行動を扱う強化学習, 計測自動制御学会論文集, Vol. 37, No. 12, 2001.
Hajime Kimura, Toru Yamashita, Shigenobu Kobayashi. Reinforcement learning of walking behavior for a four-legged robot, Proceedings of the 40th IEEE Conference on Decision and Control, 2001.
木村　元, 小林　重信. ２分木構造のActor-Criticによる２自由度ロボットの強化学習, 計測自動制御学会第１３回自律分散システムシンポジウム, pp. 117-122, 2001.
山下　透, 木村　元, 小林　重信. 強化学習による多足歩行ロボットの実現, 計測自動制御学会第１３回自律分散システムシンポジウム, pp. 111-116, 2001.
井口　圭一, 木村　元, 小林　重信. ＧＡによる並列二重倒立振子の振り上げ安定化制御, 計測自動制御学会第１３回自律分散システムシンポジウム, pp. 277-282, 2001.
木村　元, 小林　重信. Actorに適正度の履歴を用いたActor-Criticアルゴリズム--不完全なValue-Functionのもとでの強化学習, 人工知能学会誌, Vol. 5, No. 2, pp. 267-275, 2000.
木村元. 確率的２分木の行動選択を用いた強化学習による多数の類似行動の扱いについて, 計測自動制御学会第27回知能システムシンポジウム, pp. 111-116, 2000.
HAJIME KIMURA. Reinforcement Learning for Crawling Robot Motion Using Stochastic Gradient Ascent., Journal of Japanese Society for Artificial Intelligerce, Vol. 14, No. 1, pp. 122-130, 1999.
Hajime Kimura, Shigenobu Kobayashi. Stochastic Real-Valued Reinforcement Learning to Solve a Non-Linear Control Problem, Proc. 1999 Int. Conf. on Systems, Man and Cybernetics, pp. V-510-515, 1999.
Hajime Kimura, Shigenobu Kobayashi. Stochastic Real-Valued Reinforcement Learning to Solve a Non-Linear Control Problem, 1999 IEEE International Conference on Systems, Man, and Cybernetics, 1999.
木村　元, 小林重信. 確率的傾斜法を用いた強化学習とロボットへの適用, 電気学会電子・情報・システム部門誌, Vol. 119, No. 9, pp. 931-934, 1999.
Hajime Kimura, Shigenobu Kobayashi. Efficient Non-Linear Control by Combining Q-learning with Local Linear Controllers, Proceedings of the 16th International Conference on Machine Learning, pp.210--219, Morgan Kaufmann Publishers (1999), pp. 210-219, 1999.
木村　元, 宮崎和光, 小林重信. 強化学習システムの設計指針, 計測と制御, Vol. 38, No. 10, pp. 618-623, 1999.
Hajime Kimura, Kazuteru Miyazaki, Shigenobu Kobayashi. An Analysis of Actor/Critic Algorithms using Eligibility Traces : Reinforcement Learning with Imperfect Value Function, Proceedings of the 15th International Conference on Machine Learning, pp. 278-286, 1998.
HAJIME KIMURA. Reinforcement Learning for Partially Observable Markov Decision processes., Journal of Japanese Society for Artificial Intelligence, Vol. 12, No. 6, pp. 822-830, 1997.
HAJIME KIMURA. Reinforcement Learning in POMDPs with Function Approximation, Proceedings of the 14th International Conference on Machine Learning, pp. 152-160, 1997.
HAJIME KIMURA. Reinforcement Learning in Partially observable Markov Decision Processes : A Stochastic Gradient Method, Journal of Japanese Society for Artificial Intelligence, Vol. 11, No. 5, pp. 761-768, 1996.
HAJIME KIMURA. Reinforcement Learning by Stochastic Hill Climbing on Discounted Reward, Proceedings of the 12th International Conference on Machine Learning, pp. 295-303, 1995.
HAJIME KIMURA. Reinforcement learning with delayed rewards on continuous state space, Proceedings of the 3rd International Conference on Fuzzy Logic, Neural Nets and Soft Computing, pp. 289-292, 1994.

著書

HAJIME KIMURA. Reinforcement Learning for Continuous Action using Stochastic Gradient Ascent, Intelligent Autonomous Systems, Intelligent Autonomous Systems, Vol. 5, pp. 288-295, 1998.

学位論文

部分観測マルコフ決定過程下での強化学習：確率的傾斜法による接近, 本文, 博士(工学), 東京工業大学, 1997/03/26,

[ BibTeX 形式で保存 ] [ 論文・著書をCSV形式で保存 ] [ 特許をCSV形式で保存 ]

Home

各種検索

サポート

T2R2について

関連リンク

木村元研究業績一覧 (26件)

論文

著書

学位論文

Home

各種検索

サポート

T2R2について

関連リンク

木村元 研究業績一覧 (26件)

論文

著書

学位論文

木村元研究業績一覧 (26件)