times contigs were successfully classified into Gene Ontology cat

times contigs were successfully classified into Gene Ontology categories, 12,111 according to BP, 8,445 to CC and 14,116 to MF categories. The number of sequences exclusively assigned to each functional category was 2,417 for BP, 828 for CC and 4,328 for MF. Most significant etc BLAST hits were obtained against a small number of species represented in public databases including model fish species, cultured fish species and two mammalian species. G. aculeatus was the highest represented species followed by a group including T. rubripes, O. latipes and T. nigroviridis, all these species and turbot belonging to the Acanthopterygii superorder. Figure 4 summarizes the number of sequences repre senting the different 2nd level GO terms in the Turbot 3 database.

Cellular process and Meta bolic process were the most represented categories within BP terms, but categories re lated to immune function had also a high representation, Response to stimulus, Viral repro duction, Immune system process and Death. The reproductive system was also represented by the Reproduction and Reproductive process higher than the six libraries sequenced by Sanger together. When comparing to public turbot resources, our strategy allowed increasing by 34,400 the number of novel sequences identified for the first time in turbot. Annotation of the turbot 3 database Nearly half of the sequences 23,661 52,427 were automatically annotated by AutoFact and pro duced a significant BLAST hit against at least one of the public databases. A Venn diagram showing the number of sequences that matched with some of the commonly used databases is shown in Figure 2A.

A total of 14,194 sequences shared significant BLAST hit against all data bases including UniRef90, KEGG, PFam and others, while 8,556 contigs shared BLAST hits against UniRef90, KEGG and other databases and 885 with PFam and other databases. About 2 3 of the categories, and to a lower extent by Growth and Cell proliferation. Cell and Cell parts categories followed by Organelle were the highest represented within CC terms. Finally, within MF terms Binding and Catalytic activity were the most repre sented categories followed by Transporter activity and Structural molecule activity. Identification of genes related to the immune response The knowledge of the immune system of fish has greatly increased recently.

However, there are still many fish diseases which produce important losses to industry be cause still there is no an effective strategy for their control, including vaccines. The immune system of fish is composed of non specific and specific immune defenses, being the first more important than in higher vertebrates. Examples of innate immunity include anatomic barriers, mechanical removal of pathogens, bacterial antagonism, pattern recognition receptors, antigen nonspecific defense compounds, the complement pathway, phagocytosis, and inflammation. In the present study, the main organs of the immune system of fish such as head kidney

