Localized excitons play a vital role in the optical response of monolayers of transition metal dichalcogenides and can be exploited as single photon sources for quantum information technology. While the optical properties of such localized excitons are vastly studied, the ultrafast capture process of delocalized excitons into localized potentials is largely unexplored. We perform quantum kinetic calculations of exciton capture via acoustic and optical phonons showing that efficient capture takes place on an ultrafast time scale. The polaron formation in the low-temperature limit leads to higher-energy excitons which can then be efficiently trapped. We demonstrate that the interplay of acoustic and optical phonons leads to an efficient broadening of energy-selection rules. Our studies provide a deep understanding of the carrier trapping from two-dimensional materials into zero-dimensional potentials.