Cita APA (7th ed.)

(2018). Deep reinforcement learning hands-on: Apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more / Maxim Lapan.

Cita Chicago (17th ed.)

Deep Reinforcement Learning Hands-on: Apply Modern RL Methods, with Deep Q-networks, Value Iteration, Policy Gradients, TRPO, AlphaGo Zero and More / Maxim Lapan. 2018.

Cita MLA (9th ed.)

Deep Reinforcement Learning Hands-on: Apply Modern RL Methods, with Deep Q-networks, Value Iteration, Policy Gradients, TRPO, AlphaGo Zero and More / Maxim Lapan. 2018.

Atenció: Aquestes cites poden no estar 100% correctes.