ترغب بنشر مسار تعليمي؟ اضغط هنا

Differential Expression Analysis for A Mouse p53KO Microarray Dataset

184   0   0.0 ( 0 )
 نشر من قبل Wanting Xu
 تاريخ النشر 2012
  مجال البحث علم الأحياء
والبحث باللغة English
 تأليف Wanting Xu




اسأل ChatGPT حول البحث

Affymetrix GeneChip technology is used to detect gene expression levels in samples of cells under different conditions. In this project, we analyzed the gene expression profiling data for mouse induced pluripotent stem cell (iPSCs) (Takahashi, 2006) on Affymetrix Mouse 430 2.0 GeneChip. Three biological conditions were present: p53KO, microRNA mir34aKO, and wild type, each with three biological replicates. The first part was devoted to identifying differentially expressed genes from around 45,000 of them, and looking into their biological meanings by pathway analysis. The second part dealt with repetitive elements represented in the pool of mRNAs. We identified repetitive elements that show a significant difference between two biological conditions. Both the comparison of p53KO versus WT and mir34aKO versus WT were done. However, the emphasis was on the former. Laboratory validation with qPCR confirmed our findings. This work was done under the Overseas Research Fellowship (ORF) Scheme 2012 for Science Students by the Faculty of Science, The University of Hong Kong. Many thanks are due to the University for the fellowship, and to Professors Terry Speed and Lin He and Drs Chao-po Lin and Anne Biton of the University of California at Berkeley for their supervision and generous support.

قيم البحث

اقرأ أيضاً

Stratifying cancer patients based on their gene expression levels allows improving diagnosis, survival analysis and treatment planning. However, such data is extremely highly dimensional as it contains expression values for over 20000 genes per patie nt, and the number of samples in the datasets is low. To deal with such settings, we propose to incorporate prior biological knowledge about genes from ontologies into the machine learning system for the task of patient classification given their gene expression data. We use ontology embeddings that capture the semantic similarities between the genes to direct a Graph Convolutional Network, and therefore sparsify the network connections. We show this approach provides an advantage for predicting clinical targets from high-dimensional low-sample data.
77 - Olga Zolotareva 2020
Aggregating transcriptomics data across hospitals can increase sensitivity and robustness of differential expression analyses, yielding deeper clinical insights. As data exchange is often restricted by privacy legislation, meta-analyses are frequentl y employed to pool local results. However, if class labels are inhomogeneously distributed between cohorts, their accuracy may drop. Flimma (https://exbio.wzw.tum.de/flimma/) addresses this issue by implementing the state-of-the-art workflow limma voom in a privacy-preserving manner, i.e. patient data never leaves its source site. Flimma results are identical to those generated by limma voom on combined datasets even in imbalanced scenarios where meta-analysis approaches fail.
We analyze publicly available data on Affymetrix microarrays spike-in experiments on the human HGU133 chipset in which sequences are added in solution at known concentrations. The spike-in set contains sequences of bacterial, human and artificial ori gin. Our analysis is based on a recently introduced molecular-based model [E. Carlon and T. Heim, Physica A 362, 433 (2006)] which takes into account both probe-target hybridization and target-target partial hybridization in solution. The hybridization free energies are obtained from the nearest-neighbor model with experimentally determined parameters. The molecular-based model suggests a rescaling that should result in a collapse of the data at different concentrations into a single universal curve. We indeed find such a collapse, with the same parameters as obtained before for the older HGU95 chip set. The quality of the collapse varies according to the probe set considered. Artificial sequences, chosen by Affymetrix to be as different as possible from any other human genome sequence, generally show a much better collapse and thus a better agreement with the model than all other sequences. This suggests that the observed deviations from the predicted collapse are related to the choice of probes or have a biological origin, rather than being a problem with the proposed model.
89 - Erik Andries 2006
Linear discrimination, from the point of view of numerical linear algebra, can be treated as solving an ill-posed system of linear equations. In order to generate a solution that is robust in the presence of noise, these problems require regularizati on. Here, we examine the ill-posedness involved in the linear discrimination of cancer gene expression data with respect to outcome and tumor subclasses. We show that a filter factor representation, based upon Singular Value Decomposition, yields insight into the numerical ill-posedness of the hyperplane-based separation when applied to gene expression data. We also show that this representation yields useful diagnostic tools for guiding the selection of classifier parameters, thus leading to improved performance.
We train a neural network to predict chemical toxicity based on gene expression data. The input to the network is a full expression profile collected either in vitro from cultured cells or in vivo from live animals. The output is a set of fine graine d predictions for the presence of a variety of pathological effects in treated animals. When trained on the Open TG-GATEs database it produces good results, outperforming classical models trained on the same data. This is a promising approach for efficiently screening chemicals for toxic effects, and for more accurately evaluating drug candidates based on preclinical data.
التعليقات
جاري جلب التعليقات جاري جلب التعليقات
سجل دخول لتتمكن من متابعة معايير البحث التي قمت باختيارها
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا