WGCNA identifies modules of densely interconnected probes by correlating probes with high topological overlap (TO), a biologically meaningful measure of similarity that is highly effective at filtering spurious or isolated connections (Yip and Horvath, 2007). The TO matrix was computed based on the adjacency matrix (Supplemental Experimental Procedures) and average linkage hierarchical clustering was performed using 1 – TO as the distance metric.
Modules were defined using a dynamic tree cutting algorithm to prune the resulting dendrogram (Supplemental Experimental Procedures; Langfelder et al., 2008). Expression values within each module MDV3100 cost were summarized by computing module “eigengenes” (MEs): the first principal component
of each module obtained via singular value decomposition. We defined the module membership (MM) of individual probes as their correlations to the MEs, such Cabozantinib that every probe had a MM value in each module. To discover any significant relationships between gene expression perturbations within modules and traits, we computed the correlations between MEs and phenotypic measures, including age, acoustic features, number of motifs sung, and whether the bird sang or not (Figure 3B). p values were obtained via the Fisher transformation of each correlation; modules with correlations to singing traits that had p values below the Bonferroni corrected significance threshold (α = 1.7e-4) are referred to as the three “song modules” throughout
the text. We also performed the less conservative Benjamini and Hochberg (1995) FDR procedure and found significant correlations to singing for the black and salmon modules. p value corrections were performed using most the results from all phenotypic measures listed above, not just those highlighted in Figure 3B. Lists of unique gene annotations from each module were used for all module enrichment calculations using Fisher’s exact test, functional annotation studies in DAVID and Ingenuity, and when generating VisANT visualizations (Figures 6D–6F and S6, Supplemental Experimental Procedures; Hu et al., 2004). We thank Peter Langfelder and Michael Oldham for advice on microarray preprocessing and network analysis; Jason Howard and Erich Jarvis for the arrays through a partnership with Agilent Technologies; Patty Phelps, Sarah Bottjer, and Erica Sloan for material support; Felix Schweizer and Grace Xiao for statistical advice; and four anonymous reviewers for insightful commentary. This work was supported by NIH grants F31 MH082533 (ATH) and R01 MH070712 (SAW). Author contributions: J.E.M., A.T.H., and S.A.W. designed the experiments; J.E.M. collected the animals and tissue punches, analyzed the song, and, together with E.F., performed the biological validation; A.T.H.