The goal of reinforcement learning is to find out a policy, which happens to be a mapping from states to actions, that maximizes the expected cumulative reward with time.It will allow us to lessen the dimension of your data without the need of A great deal loss of knowledge. PCA lowers the dimension by getting a handful of orthogonal linear combina