Quantum Engineering

Research Article

Investigating the Effects of Hyperparameters in Quantum-Enhanced Deep Reinforcement Learning

Quantum enhanced deep Q-learning.

	Set replay memory M to state size N
	Initialize action-value function quantum circuit Q with arbitrary parameters θ
	For episode e = 1, 2, 3, 4, ……. E do
	Initialize State s1 from the set state S and encode it into
	the quantum state using basis encoding
	for the time step t = 1, 2, 3, …. T do
	With probability ε, select a random action a_t
	otherwise, select the optimal action at from the result of quantum circuit
	Execute the selected action a_t and see the reward r_t and the next state s_t+1
	Store transition in replay memory M
	Sample a random minibatch of transitions from the replay memory M

	Perform a gradient descent step on
	end for
	end for