Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
TheCabin
on May 20, 2018
|
parent
|
context
|
favorite
| on:
The Markov Property, Chain, Reward Process and Dec...
For my taste, the article is a bit too high-level to build a meaningful intuition. I think a more complex example would be beneficial.
thebillkidy
on May 20, 2018
[–]
Thanks a lot for your feedback! Currently I am working on the high-level definitions of Reinforcement Learning, so that i can build on that to go deeper in the more advanced ones. I'd like to have a series that explains the OpenAI examples :)
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: