Tag Archive | ftp

Access to N.gaditana B-31 data through the NCBI

After manual review by the NCBI experts, the genomic data of Nannochloropsis gaditana B-31 are now indexed in the NCBI databases and are accessible through the NCBI web interface and through the NCBI search tools (e.g. blast).

NCBI banner

The genomes of the nucleus and of the organelles and the complete annotation of the genomic sequences are registered as bioproject PRJNA170989 ID: 170989, and can be accessed through the following links:

http://www.ncbi.nlm.nih.gov/bioproject/170989 ;

http://www.ncbi.nlm.nih.gov/nuccore/585113370

The sequencing data used to assembly of the genomes were also submitted to the NCBI SRA database and are available for consulting and download. You can find the data of: a fragment library of Nannochloropsis gaditana B-31 whole genomic DNA (i.e. includes DNA from the nucleus and from the organelles) sequenced using 454FLX Titanium XL  sequencing kit, 2 half plates (http://www.ncbi.nlm.nih.gov/sra/SRX390591); a mate pair library of Nannochloropsis gaditana B-31 whole genomic DNA with an insert size of 1.5-3Kb sequenced using the SOLiD 3 Plus sequencing kit, half plate (http://www.ncbi.nlm.nih.gov/sra/SRX390674); a mate pair library of Nannochloropsis gaditana B-31 whole genomic DNA with an insert size of 3-5Kb sequenced using the SOLiD 3 Plus sequencing kit, half plate (http://www.ncbi.nlm.nih.gov/sra/SRX390681). Note that details about the biosamples and about the experiments are linked o the data.
Read More…

FTP area updated

Misc-Download-iconWe updated our FTP area, now including datasets from other Nannochloropsis species and strains and the list of families of orthologous proteins obtained from the comparison of N. gaditana and N. oceeanica predicted proteins.

Families of orthologous proteins: file type

venn_protein_clusters
Comparing the protein families of N. gaditana and N. oceanica, we produced the lists of exclusive proteins of each species. These lists are available for download in our FTP area.
Each file is a list of families of orthologous proteins. All the families of each file are populated only by proteins belonging to one species of Nannochloropsis. The families may contain proteins of one or more strains of the same Nannochloropsis species.
In the .txt files there is one family per line, described by the name of the organism (species and strain), then a “|”, the list of the proteins of that organism (indicated by the protein ID) the name of the following organism, another “|”, the proteins of the following organism and finally the annotation of the listed proteins.
 
example:
N.gaditanaB-31|Naga_100019g47.1 N.gaditanaCCMP526|Nga20827 nudix hydrolase;

Even though the differences and the imprecisions of the various gene predictions probably play a major role in the determination of the differences among the two species, a close look at the lists of proteins that are putatively assigned as characteristic of each species my reserve interesting surprises!

References

  1. Corteggiani Carpinelli, E. et al. “Chromosome scale genome assembly and transcriptome profiling of Nannochloropsis gaditana in nitrogen depletion.Molecular Plant (2014) 7 (2): 323-335.doi: 10.1093/mp/sst120