A critical question in the search for extraterrestrial life is whether exoEarths are Earth-like, in that they host life that progressively oxygenates their atmospheres roughly following Earths oxygenation history. This question could be answered statistically by searching for O$_2$ and O$_3$ on exoEarths detected by HabEx or LUVOIR. The point of this paper is to compare the ability of HabEx and LUVOIR to prevent a false negative answer to this question, in which we do not detect O$_2$ or O$_3$ on any planet even if all exoEarths are Earth-like. Our approach is to assign O$_2$ and O$_3$ values drawn from Earths history to a distribution of detectable exoEarths and determine whether O$_2$ and O$_3$ would be detectable using the Planetary Spectrum Generator. We find that if exoEarths tend to be Earth-like, we expect to detect O$_3$ with a LUVOIR-sized instrument. We also find that LUVOIR is unlikely to have a false negative scenario in the context of searching for Earth-like life on its targeted exoEarths. Because of that, if LUVOIR does not detect O$_2$ or O$_3$ on any exoEarths, we will be able to constrain the maximum number of exoEarths that could be Earth-like. In contrast, we find that even if all exoEarths are Earth-like, HabEx has up to a 22% chance of not detecting O$_2$ or O$_3$ on any of them. This is because HabEx will detect less planets and cannot reliably detect O$_2$ and O$_3$ at all potential Proterozoic levels. This is a strong argument for building a larger telescope such as LUVOIR if we want to determine whether exoEarths tend to be Earth-like.