Nice article. I'd like to point out that in the Pacman example the agent is only...

Nice article. I'd like to point out that in the Pacman example the agent is only receiving a partial picture of the environment (details are in the video description), so it's unfair to criticize it for lack of planning.

As to why this is the case you'd have to ask the researcher, but I think it's because the observation space would be too big for the machine running the agent (both memory and run time)