Welcome to CRISPRBank, the current version of this database contains analysis of all the genomes from RefSeq 95 July 2019. All 151,845 bacterial and 855 archaeal genomes were analysed using CRISPRDetect 2.4  (also availble through github) . These included those marked as complete or lower levels of assembly (e.g. scaffold or contig). CRISPRDetect and CRISPRbank are part of CRISPRSuite described here:

To cite this data use:   Biswas A, Staals RH, Morales SE, Fineran PC, Brown CM: CRISPRDetect: A flexible algorithm to define CRISPR arrays. BMC Genomics 2016, 17(1):356.

Other recent analyses of the CRISPR array and Cas gene content of complete genome sequences can be found at the follwing links:

CRISPRCasdb (16,990 complete Genbank genomes 12/6/2019)

Makarova et al 2019, CRISPRclass19    (13,116 complete genomes 1/3/2019). We intend to include a comparison of this data to our CRISPRBank data in a later release of this interface (3/2020)

Archaeal or Bacterial genomes with CRISPRDetect defined repeats >=3, score>=4.0 and DR length>=23.  
KingdomCRISPR array containing genomes of all RefSeq genomes (%)
Number of CRISPR arraysNumber of CRISPR spacers
Archaea699 of 855 (81.8%)208671252
Bacteria70531 of 151845 (46.4%)1302931923258

Note: This table only considers arrays, some array containing genomes (5-10%) will lack functional cas genes. 2. Each RefSeq assembly (GCF..) is counted, there are many assemblies for some species e.g. E coli.

Please key in the organism name (or accession NC_...) or click on the down arrow [NEW 2020: now you can also search arrays by RefSeq assembly ID i.e. GCF...]: 
Query database using one or more of these fields :

 Family: determined by:
 Minimum likelihood score:
 Maximum likelihood score:
 Minimum number of repeats:
 Maximum number of repeats:
 Minimum repeat length:
 Maximum repeat length:

 Please paste a repeat sequence and click on the exact match search button:


 Please paste a spacer sequence and click on the exact match search button: