We formulate the concept of non-linear and stochastic galaxy biasing in the framework of halo occupation statistics. Using two-point statistics in projection, we define the galaxy bias function, b_g(r_p), and the galaxy-dark matter cross-correlation function, R_{gm}(r_p), where r_p is the projected distance. We use the analytical halo model to predict how the scale dependence of b_g and R_{gm}, over the range 0.1 Mpc/h < r_p < 30 Mpc/h, depends on the non-linearity and stochasticity in halo occupation models. In particular we quantify the effect due to the presence of central galaxies, the assumption for the radial distribution of satellite galaxies, the richness of the halo, and the Poisson character of the probability to have a certain number of satellite galaxies in a halo of a certain mass. Overall, brighter galaxies reveal a stronger scale dependence, and out to a larger radius. In real-space, we find that galaxy bias becomes scale independent, with R_{gm} = 1, for radii r > 1 - 5 Mpc/h, depending on luminosity. However, galaxy bias is scale-dependent out to much larger radii when one uses the projected quantities defined in this paper. These projected bias functions have the advantage that they are more easily accessible observationally and that their scale dependence carries a wealth of information regarding the properties of galaxy biasing. To observationally constrain the parameters of the halo occupation statistics and to unveil the origin of galaxy biasing we propose the use of the bias function Gamma_{gm}(r_p)=b_g(r_p)/R_{gm}(r_p). This function is obtained via a combination of weak gravitational lensing and galaxy clustering, and it can be measured using existing and forthcoming imaging and spectroscopic galaxy surveys.