No Arabic abstract
The Sloan Digital Sky Survey (SDSS) automated spectroscopic reduction pipeline provides >1.5 million intermediate resolution, R~2000, moderate signal-to-noise ratio (SNR), SNR~15, astronomical spectra of unprecedented homogeneity that cover the wavelength range 3800-9200AA. However, there remain significant systematic residuals in many spectra due to the sub-optimal subtraction of the strong OH sky emission lines longward of 6700AA. The OH sky lines extend over almost half the wavelength range of the SDSS spectra, and the SNR over substantial wavelength regions in many spectra is reduced by more than a factor two over that expected from photon counting statistics. Following the OH line subtraction procedure presented in Wild & Hewett (2005), we make available to the community sky-residual subtracted spectra for the Sloan Digital Sky Survey Data Relase 7. Here we summarise briefly the method, including minor changes in the implementation of the procedure with respect to WH05. The spectra are suitable for many science applications but we highlight some limitations for certain investigations. Details of the data model for the sky-residual subtracted spectra and instructions on how to access the spectra are provided.
Using the k-means cluster analysis algorithm, we carry out an unsupervised classification of all galaxy spectra in the seventh and final Sloan Digital Sky Survey data release (SDSS/DR7). Except for the shift to restframe wavelengths, and the normalization to the g-band flux, no manipulation is applied to the original spectra. The algorithm guarantees that galaxies with similar spectra belong to the same class. We find that 99 % of the galaxies can be assigned to only 17 major classes, with 11 additional minor classes including the remaining 1%. The classification is not unique since many galaxies appear in between classes, however, our rendering of the algorithm overcomes this weakness with a tool to identify borderline galaxies. Each class is characterized by a template spectrum, which is the average of all the spectra of the galaxies in the class. These low noise template spectra vary smoothly and continuously along a sequence labeled from 0 to 27, from the reddest class to the bluest class. Our Automatic Spectroscopic K-means-based (ASK) classification separates galaxies in colors, with classes characteristic of the red sequence, the blue cloud, as well as the green valley. When red sequence galaxies and green valley galaxies present emission lines, they are characteristic of AGN activity. Blue galaxy classes have emission lines corresponding to star formation regions. We find the expected correlation between spectroscopic class and Hubble type, but this relationship exhibits a high intrinsic scatter. Several potential uses of the ASK classification are identified and sketched, including fast determination of physical properties by interpolation, classes as templates in redshift determinations, and target selection in follow-up works (we find classes of Seyfert galaxies, green valley galaxies, as well as a significant number of outliers). The ASK classification is publicly accessible through various websites.
We present the BOSS Lyman-alpha (Lya) Forest Sample from SDSS Data Release 9, comprising 54,468 quasar spectra with zqso > 2.15 suitable for Lya forest analysis. This data set probes the intergalactic medium with absorption redshifts 2.0 < z_alpha < 5.7 over an area of 3275 square degrees, and encompasses an approximate comoving volume of 20 h^-3 Gpc^3. With each spectrum, we have included several products designed to aid in Lya forest analysis: improved sky masks that flag pixels where data may be unreliable, corrections for known biases in the pipeline estimated noise, masks for the cores of damped Lya systems and corrections for their wings, and estimates of the unabsorbed continua so that the observed flux can be converted to a fractional transmission. The continua are derived using a principal component fit to the quasar spectrum redwards of restframe Lya (lambda > 1216 Ang), extrapolated into the forest region and normalized by a linear function to fit the expected evolution of the Lya forest mean-flux. The estimated continuum errors are ~5% rms. We also discuss possible systematics arising from uncertain spectrophotometry and artifacts in the flux calibration; global corrections for the latter are provided. Our sample provides a convenient starting point for users to analyze clustering in BOSS Lya forest data, and it provides a fiducial data set that can be used to compare results from different analyses of baryon acoustic oscillations in the Lya forest. The full data set is available from the SDSS-III DR9 web site.
The Dark Sky Simulations are an ongoing series of cosmological N-body simulations designed to provide a quantitative and accessible model of the evolution of the large-scale Universe. Such models are essential for many aspects of the study of dark matter and dark energy, since we lack a sufficiently accurate analytic model of non-linear gravitational clustering. In July 2014, we made available to the general community our early data release, consisting of over 55 Terabytes of simulation data products, including our largest simulation to date, which used $1.07 times 10^{12}~(10240^3)$ particles in a volume $8h^{-1}mathrm{Gpc}$ across. Our simulations were performed with 2HOT, a purely tree-based adaptive N-body method, running on 200,000 processors of the Titan supercomputer, with data analysis enabled by yt. We provide an overview of the derived halo catalogs, mass function, power spectra and light cone data. We show self-consistency in the mass function and mass power spectrum at the 1% level over a range of more than 1000 in particle mass. We also present a novel method to distribute and access very large datasets, based on an abstraction of the World Wide Web (WWW) as a file system, remote memory-mapped file access semantics, and a space-filling curve index. This method has been implemented for our data release, and provides a means to not only query stored results such as halo catalogs, but also to design and deploy new analysis techniques on large distributed datasets.
The Millennium N-body simulation and the Sloan Digital Sky Survey seventh data release (SDSS DR7) galaxy and galaxy group catalogues are compared to study the properties of galaxy groups and the distribution of galaxies in groups. We construct mock galaxy group catalogues for a Millennium semi-analytical galaxy catalogue by using the same friends-of-friends method, which was used by Tago et al to analyse the SDSS data. We analyse in detail the group luminosities, group richnesses, virial radii, sizes of groups and their rms velocities for four volume-limited samples from observations and simulations. Our results show that the spatial densities of groups agree within one order of magnitude in all samples with a rather good agreement between the mock catalogues and observations. All group property distributions have similar shapes and amplitudes for richer groups. For galaxy pairs and small groups, the group properties for observations and simulations are clearly different. In addition, the spatial distribution of galaxies in small groups is different: at the outskirts of the groups the galaxy number distributions do not agree, although the agreement is relatively good in the inner regions. Differences in the distributions are mainly due to the observational limitations in the SDSS sample and to the problems in the semi-analytical methods that produce too compact and luminous groups.
Based on galaxies from the Sloan Digital Sky Survey (SDSS) and subhalos in the corresponding reconstructed region from the constrained simulation of ELUCID, we study the alignment of central galaxies relative to their host groups in the group catalog, as well as the alignment relative to the corresponding subhalos in the ELUCID simulation. Galaxies in observation are matched to dark matter subhalos in the ELUCID simulation using a novel neighborhood abundance matching method. In observation, the major axes of galaxies are found to be preferentially aligned to the major axes of their host groups. There is a color dependence of galaxy-group alignment that red centrals have a stronger alignment along the major axes of their host groups than blue centrals. Combining galaxies in observation and subhalos in the ELUCID simulation, we also find that central galaxies have their major axes to be aligned to the major axes of their corresponding subhalos in the ELUCID simulation. We find that the galaxy-group and galaxy-subhalo alignment signals are stronger for galaxies in more massive halos. We find that the alignments between main subhalos and the SDSS matched subhalo systems in simulation are slightly stronger than the galaxy-group alignments in observation.