Information | CDHIT clusters similar protein sequences from the input protein set. Protein sequences more similar than the chosen percent identity cutoff are grouped in the same cluster unless their sequence length difference is higher the chosen maximum. By default, the result lists one representative sequence per cluster unless the "Cluster lists ouptut" option is checked (in this case, the protein names are listed, one line per cluster). |