Alan__Collins Profile Banner
Alan Collins Profile
Alan Collins

@Alan__Collins

Followers
102
Following
174
Media
6
Statuses
99

Senior Bioinformatician at IHRC Inc.

Georgia, USA
Joined November 2017
Don't wanna be here? Send us removal request.
@Alan__Collins
Alan Collins
2 years
The CRISPR Comparison Toolkit (CCTK) is now published in @CRISPRjournal with @RachelJWhitaker! Github Conda Read the docs 1/3.
1
10
30
@Alan__Collins
Alan Collins
2 years
RT @LCSuttenfield: Anybody out there working in comparative CRIPSR biology/informatics? Check out this tool - produces incredible graphics….
0
1
0
@grok
Grok
1 day
Join millions who have switched to Grok.
86
153
1K
@Alan__Collins
Alan Collins
2 years
Thank you to reviewers and users who submitted bug reports for helping to improve CCTK over the last year! We hope to continue to improve it moving forward!.3/3.
0
0
1
@Alan__Collins
Alan Collins
2 years
See the thread associated with the preprint for a description of what CCTK can do and check out the releases page to see the new features and bug fixes since CCTK was first released almost a year ago. 2/3.
@Alan__Collins
Alan Collins
3 years
I’m excited to announce a new preprint with @RachelJWhitaker! We developed a command line toolkit for the identification and analysis of CRISPR arrays, called the CRISPR Comparison ToolKit (CCTK). A thread about what it can be used for: 1/15.
1
0
1
@Alan__Collins
Alan Collins
3 years
We hope that CCTK will be useful for your research! CCTK can be installed via conda (, source code is available on GitHub (, and documentation is available on RTD ( 15/15.
github.com
Tools to analyze the differences and similarities between CRISPR arrays - Alan-Collins/CRISPR_comparison_toolkit
0
0
2
@Alan__Collins
Alan Collins
3 years
You can visualize how many targets each spacer has (or other data), by using custom colour schemes for CCTK tools. Here is the CRISPRdiff plot with spacers coloured according to the number of targets we found. 14/15
Tweet media one
1
0
0
@Alan__Collins
Alan Collins
3 years
Spacerblast can identify targets with mismatches by extending truncated BLAST hits and can search for PAMs defined using either IUPAC nucleotide codes or regex. It can also mask regions from the search. 13/15.
1
0
0
@Alan__Collins
Alan Collins
3 years
Identification of matches can be done using BLAST. However it is laborious for large numbers of spacers or matches. We therefore included Spacerblast in CCTK, which is a BLASTn wrapper with a few extra steps. 12/15.
1
0
0
@Alan__Collins
Alan Collins
3 years
Finally, CRISPR systems are adaptive immune systems that provide protection via sequence-specific targeting MGEs. Targets can be identified by finding sequence matches to spacers where the matches are flanked by a motif called a PAM. 11/15.
1
0
0
@Alan__Collins
Alan Collins
3 years
Constrain infers how arrays may evolved if their history were *constrained* by a certain topology. In the case of these arrays, Constrain indicates that horizontal gene transfer may have occurred (indicated by red boxes). 10/15
Tweet media one
1
0
0
@Alan__Collins
Alan Collins
3 years
The tree inferred by CRISPRtree represents a hypothesis of the history of these arrays. However, the CRISPRtree tree differs from a tree based on a core genome alignment. The tool Constrain can be used to explore this difference. 9/15.
1
0
0
@Alan__Collins
Alan Collins
3 years
Looking at the CRISPRdiff plot, you can see that some arrays are very similar, and some are more distinct. CRISPRtree can infer a maximum parsimony tree to describe those relationships. It also produces a hypothesis of how the arrays evolved. 8/15
Tweet media one
1
0
2
@Alan__Collins
Alan Collins
3 years
There are other great tools for visualizing CRISPR arrays: CRISPRViz ( and CRISPRStudio (. We discuss key differences in the manuscript. 7/15.
mdpi.com
The CRISPR-Cas system biologically serves as an adaptive defense mechanism against phages. However, there is growing interest in exploiting the hypervariable nature of the CRISPR locus, often of...
1
0
0
@Alan__Collins
Alan Collins
3 years
Each cluster of homologous arrays can then be explored in more detail. First, which parts of each array are shared with, or distinct from, other arrays in the cluster? This is the CRISPRdiff visualization of the largest cluster. 6/15
Tweet media one
1
0
1
@Alan__Collins
Alan Collins
3 years
CCTK represents array homology using a network in which arrays (nodes) are connected by an edge if they have more than 1 identical spacer in common. Edge weight reflects the degree of array similarity. 5/15
Tweet media one
1
0
0
@Alan__Collins
Alan Collins
3 years
CCTK includes two utilities to identify CRISPR arrays. One uses MinCED ( and the other uses BLASTn. CCTK then orients the arrays relative to the leader end and identifies homologous clusters of arrays. 4/15.
github.com
Mining CRISPRs in Environmental Datasets. Contribute to ctSkennerton/minced development by creating an account on GitHub.
1
0
0
@Alan__Collins
Alan Collins
3 years
CCTK is therefore designed to identify CRISPR arrays in large numbers (100s to 1000s) of assemblies. CCTK then allows you to explore groups of arrays in more detail to visualize arrays and to infer their evolutionary history. 3/15.
1
0
1
@Alan__Collins
Alan Collins
3 years
CCTK was born out of a desire to analyze CRISPR arrays from large numbers of isolates. We wanted to know which arrays were homologous to one another (i.e., contain the same spacers), but also the nature of their relationships. 2/15.
1
0
1
@Alan__Collins
Alan Collins
3 years
I’m excited to announce a new preprint with @RachelJWhitaker! We developed a command line toolkit for the identification and analysis of CRISPR arrays, called the CRISPR Comparison ToolKit (CCTK). A thread about what it can be used for: 1/15.
@biorxivpreprint
bioRxiv
3 years
CRISPR comparison toolkit (CCTK): Rapid identification, visualization, and analysis of CRISPR array diversity #bioRxiv.
3
18
71