Page tree
Skip to end of metadata
Go to start of metadata

Publicly available data resources in the Research Environment can be accessed from the desktop by clicking on the Home icon and then the 'public_data_resources' folder. This folder is also mounted on the HPC under: /public_data_resources.

If there are additional public data resources that you would like to be made available, please submit a service desk ticket and the additional data can be imported. Note that the public data resources folder is read-only. You will not be able to write any data into the folders, or perform any operations (such as unzipping) that use the same directory as the output directory. Any attempt to do so will be met with a 'permission denied' error.

The list of publicly available data are shown below: 

/public_data_resources/

├── 1000-genomes
│   ├── 20130502_GRCh37
│   ├── 20130502_GRCh38
│   ├── ALL_100G_phase1integrated_v3_impute
│   └── high_coverage
├── annovar_databases -> /tools/apps/annovar/humandb
├── Battenberg
├── bigzips
├── blast
│   └── db-v2.7.1
├── CADD
│   ├── v1.4
│   └── v1.5 -> /public_data_resources/krishna.gs.washington.edu/download/CADD/v1.5
├── centrifuge_databases
├── clinvar
│   └── 20190219
├── COSMIC
│   └── v85
├── data_repo [error opening dir]
├── deepvariant
│   └── models
├── ensembl-data
│   └── ensembl-regulation
├── expansion_hunter_3.0.0 -> /tools/apps/ExpansionHunter/3.0.0/variant_catalog/
├── FannsDB
├── ftp.ncbi.nlm.nih.gov
│   └── snp
├── GEL-40234 [error opening dir]
├── GENCODE
│   └── v29
├── genes_and_regions
├── gnomad
│   ├── 2.1.1
│   ├── GRCh37
│   ├── GRCh37_writeable
│   └── GRCh38
├── grch37
│   └── RegulatoryFeatureActivity
├── gs_genomics-public-data
│   └── resources
├── IGV
│   ├── backup
│   ├── hg19
│   ├── hg38
│   └── igv
├── INFRA-12661
├── kraken1_databases
│   └── gpfs
├── krishna.gs.washington.edu
│   └── download
├── liftover
├── mhcflurry [error opening dir]
├── mutsigcv_resources
│   └── chr_files_hg19
├── NanoPlot_dependency [error opening dir]
├── nanopore-wgs-consortium
│   └── dataset
├── nextflow_db
├── normalised_disease_terms
├── PathSeq
├── phylop100way
├── picard
│   └── GRCh38
├── RDF
│   ├── Default RDF
│   ├── HRG4+ 201718 Reference Costs Grouper Test Data and Expected Results v1.0
│   └── Sample Data
├── reference
│   ├── GRCh37
│   ├── GRCh38
│   └── GRCh38DeAlt_HLA
├── reference-GEL-37086
│   ├── GRCh37
│   ├── GRCh38
│   └── GRCh38DeAlt_HLA
├── RemoteDesktop [error opening dir]
├── Rpackages
│   ├── 3.3.3
│   ├── 3.4.0
│   ├── 3.4.3
│   ├── 3.5.0
│   └── 3.5.1
├── rvtests_ref
├── SpliceAI
│   ├── Predicting_splicing_from_primary_sequence-66029966
│   └── pyfaidx_packages
├── TOPMed
│   └── allele_frequencies
└── vep_resources
    ├── Build-37
    └── Build-38
  • No labels