<![CDATA[RSS Feed]]>

<![CDATA[RSS Feed]]>http://direct.ecency.comhttp://direct.ecency.com/logo512.pngRSS Feedhttp://direct.ecency.comRSS for NodeTue, 21 Apr 2026 18:51:24 GMT<![CDATA[AI学习笔记——强化学习之探索-利用(Exploration-Exploitation)困境]]>http://direct.ecency.com/ai/@hongtao/ai-exploration-exploitationhttp://direct.ecency.com/ai/@hongtao/ai-exploration-exploitationThu, 17 Jan 2019 17:14:42 GMT<![CDATA[AI学习笔记——基于模型(Model Based)的强化学习]]>http://direct.ecency.com/ai/@hongtao/ai-model-basedhttp://direct.ecency.com/ai/@hongtao/ai-model-basedFri, 11 Jan 2019 15:56:00 GMT<![CDATA[AI学习笔记——Actor-Critic强化学习]]>http://direct.ecency.com/ai/@hongtao/ai-actor-critichttp://direct.ecency.com/ai/@hongtao/ai-actor-criticFri, 11 Jan 2019 14:30:33 GMT<![CDATA[AI学习笔记——基于策略的强化学习]]>http://direct.ecency.com/ai/@hongtao/44w57n-aihttp://direct.ecency.com/ai/@hongtao/44w57n-aiFri, 11 Jan 2019 14:00:54 GMT<![CDATA[AI学习笔记——强化学习之值函数近似(Value Function Approximation)(3)]]>http://direct.ecency.com/ai/@hongtao/ai-value-function-approximation-3http://direct.ecency.com/ai/@hongtao/ai-value-function-approximation-3Thu, 06 Dec 2018 23:44:18 GMT<![CDATA[AI学习笔记——强化学习之值函数近似(Value Function Approximation)(2)]]>http://direct.ecency.com/ai/@hongtao/ai-value-function-approximation-2http://direct.ecency.com/ai/@hongtao/ai-value-function-approximation-2Thu, 06 Dec 2018 21:57:24 GMT<![CDATA[AI学习笔记——强化学习之值函数近似(Value Function Approximation)(1)]]>http://direct.ecency.com/ai/@hongtao/ai-value-function-approximation-1http://direct.ecency.com/ai/@hongtao/ai-value-function-approximation-1Wed, 05 Dec 2018 12:25:03 GMT<![CDATA[AI学习笔记——强化学习之Model-Free Prediction--解决未知环境下的预测问题]]>http://direct.ecency.com/ai/@hongtao/ai-model-free-predictionhttp://direct.ecency.com/ai/@hongtao/ai-model-free-predictionWed, 17 Oct 2018 15:09:30 GMT