(2018). Deep reinforcement learning hands-on: Apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more / Maxim Lapan.
Cita Chicago (17th ed.)Deep Reinforcement Learning Hands-on: Apply Modern RL Methods, with Deep Q-networks, Value Iteration, Policy Gradients, TRPO, AlphaGo Zero and More / Maxim Lapan. 2018.
Cita MLA (9th ed.)Deep Reinforcement Learning Hands-on: Apply Modern RL Methods, with Deep Q-networks, Value Iteration, Policy Gradients, TRPO, AlphaGo Zero and More / Maxim Lapan. 2018.
Atenció: Aquestes cites poden no estar 100% correctes.