For my taste, the article is a bit too high-level to build a meaningful intuitio... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		TheCabin on May 20, 2018 \| parent \| context \| favorite \| on: The Markov Property, Chain, Reward Process and Dec... For my taste, the article is a bit too high-level to build a meaningful intuition. I think a more complex example would be beneficial.

thebillkidy on May 20, 2018 [–]

Thanks a lot for your feedback! Currently I am working on the high-level definitions of Reinforcement Learning, so that i can build on that to go deeper in the more advanced ones. I'd like to have a series that explains the OpenAI examples :)

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact