It is possible that a high diversity of breast cancer cell subtypes could be associated with active chromatin regions on sellectchem 17q that are different from the ERBB2 amplicon region. We focused on a CSAGA located on 17q11. 2 composed of five Inhibitors,Modulators,Libraries genes and including the convergent SAGP TNFAIP1 POLDIP2. We assume that novel CSAGAs important in breast cancer development could be found in highly unstable regions of the genome and that these complex archi tectures could play significant roles in transcription control, resulting in cancer phenotypes and impacting patient survival. Methods Patients, tumor specimens, cell lines and microarray data Clinical characteristics of breast cancer patients and tumor samples from two independent cohorts have been published previously.
The Stockholm cohort comprised Ks 159 Inhibitors,Modulators,Libraries patients with breast cancer, who were operated on in the Karolinska Hospital from 1 January 1994 to 31 December 1996 and identified in the Stockholm Gotland breast Cancer registry. The Uppsala cohort involved Ku 251 patients representing approximately 60% of all breast cancers resections in Uppsala County, Sweden, from 1 January 1987 to 31 December 1989. Information on patients disease free survival times events and the expression patterns of approximately 30,000 gene transcripts in primary breast tumors was obtained from the National Center for Biotechnology Information Gene Expression Omnibus. The microarray intensities were MAS5. 0 calibrated and the probe Inhibitors,Modulators,Libraries set signal intensities log transformed and scaled by adjusting the mean signal to a target value of log500.
For association studies of DNA copy number and gene expression we utilized single nucleotide polymorphism copy number microarray data for 46 breast cancer cell lines as well as gene expression profiling Inhibitors,Modulators,Libraries of 51 human breast cancer cell lines down where N is the sample size, p is the number of variables and R is the determinant Inhibitors,Modulators,Libraries of the sample correlation matrix. This quantity is distributed approximately as c2 with 1 2 p degrees of freedom. To test the significance of the statistic, we draw B 5,000 samples of p neighbouring genes at random from the set of 44,928 genes and estimate Bartletts t test, Tb, for each of the B 5,000 draws. The corresponding bootstrap P value is estimated as loaded from the GEO GSE12777. Correlation analysis Our primary goal is to identify whether the set of genes composing the TNFAIP1 POLDIP2 CSAGA forms a significant cluster.
First, we estimate Pearson correlation coefficients among these genes download catalog in the two large cohorts and subsequently test whether their matrices are significant at level a 1%. Using Pearson correlation coefficient requires that the data are normally distrib uted. To show that our data satisfy this assumption, we run Kolmogorov Smirnov test for Normality with Lilliefors P value correction.