Protein
  Analysis
  Toolkit

CDHIT
Description Exhaustive database homology search
URL http://bioinformatics.org/cd-hit/
Help http://bioinformatics.org/cd-hit/manual.php
Information CDHIT clusters similar protein sequences from the input protein set. Protein sequences more similar than the chosen percent identity cutoff are grouped in the same cluster unless their sequence length difference is higher the chosen maximum. By default, the result lists one representative sequence per cluster unless the "Cluster lists ouptut" option is checked (in this case, the protein names are listed, one line per cluster).

Input
Type your input query in the box below:
(Input should correspond to at least two protein sequences)

(Click here to get an input example)

or upload your input query from a file :

or use previous input/output compatible with CDHIT:

CDHIT options
Percent identity cutoff Cluster lists output
Max sequence length difference Additional options

Output redirection
You may redirect the CDHIT output to one selected tool: