This page covers the basics which are pretty timeless. You'd want to understand this before learning about any more recent advances.
It doesn't get into state of the art algorithms, for example proximal policy optimisation isn't mentioned although the paper on this was published in 2017 and is probably considered the best algorithm for at least some applications.
It doesn't get into state of the art algorithms, for example proximal policy optimisation isn't mentioned although the paper on this was published in 2017 and is probably considered the best algorithm for at least some applications.