PAT: Protein Analysis Toolkit

CDHIT
Description	Exhaustive database homology search
URL	http://bioinformatics.org/cd-hit/
Help	http://bioinformatics.org/cd-hit/manual.php
Information	CDHIT clusters similar protein sequences from the input protein set. Protein sequences more similar than the chosen percent identity cutoff are grouped in the same cluster unless their sequence length difference is higher the chosen maximum. By default, the result lists one representative sequence per cluster unless the "Cluster lists ouptut" option is checked (in this case, the protein names are listed, one line per cluster).

Input

Type your input query in the box below:
(Input should correspond to at least two protein sequences)
(Click here to get an input example)

or upload your input query from a file :

or use previous input/output compatible with CDHIT:

CDHIT options
Percent identity cutoff		Cluster lists output	No Yes
Max sequence length difference		Additional options

Output redirection
You may redirect the CDHIT output to one selected tool: