Save time and resources with the local CGSB repository of commonly used genomic data sets. Data is obtained from Ensembl and NCBI. New versions/releases will be added periodically or upon request. Previous versions/releases will be preserved.

All files are readable from within the shared genome resource. There is no need to copy the file(s) to your local directory. The second table below shows all available data types.

All data are stored in a common location with the following naming convention:

/scratch/work/cgsb/genomes/Public/<KINGDOM>/<SPECIES>/<SOURCE>/<VERSION>/

Search and find your genome of interest in the table below. Clicking on the row containing your genome will generate the specific path (see below table) to the data on the HPC.

Explore

Note: The original WordPress site featured an interactive search table here that allowed users to browse and filter available species by Kingdom, Species, Source, and Version. This interactive tool is not included in this static port of the site.

/scratch/work/cgsb/genomes/Public/<KINGDOM>/<SPECIES>/<SOURCE>/<VERSION>/

Note: The original WordPress site included an interactive table here listing all available data types (Name, Extensions, Built With), including Reference Sequence (.fa, .fna) and other formats. This interactive tool is not included in this static port of the site.

Request Data

To request a new organism, version, or data type, please email mk5636@nyu.edu.

1 Comment

Darach Miller · July 6, 2017

So clean.