State-Encompassing Training 

RL processes such as Q-Learning are based mapping outputs to a set of input states _I_. If _I_ is computable, would it be possible to train an agent by iterating though all the possible states systematically, as opposed to repeatedly running simulations and waiting for a sufficient number of states be reached through happenstance?