CATH Documentation
Homologous Superfamily
This level of the CATH hierarchy groups together protein domains which are thought to share a common ancestor and can therefore be described as homologous. Similarities are identified either by high sequence identity or structure comparison using SSAP. Structures are clustered into the same homologous superfamily if they satisfy two or more of the following criteria:
- Sequence identity >= 35%, overlap >= 60% of larger structure equivalent to smaller.
- SSAP score >= 80.0, sequence identity >= 20%, overlap 60% of larger structure equivalent to smaller.
- SSAP score >= 70.0, overlap 60% of larger structure equivalent to smaller; domains which have related functions, which is informed by the literature and Pfam protein family database.
- Significant similarity from HMM-sequence searches and HMM-HMM comparisons using SAM, HMMER (http://hmmer.wustl.edu) and PRC (http://supfam.org/PRC).
References
The Pfam protein families database.
Bateman A, Coin L, Durbin R, Finn RD, Hollich V, Griffiths-Jones S, Khanna A, Marshall M, Moxon S, Sonnhammer EL, Studholme DJ, Yeats C, Eddy SR
Nucleic Acids Res32pD138-41(2004 Jan 1)
Bateman A, Coin L, Durbin R, Finn RD, Hollich V, Griffiths-Jones S, Khanna A, Marshall M, Moxon S, Sonnhammer EL, Studholme DJ, Yeats C, Eddy SR
Nucleic Acids Res32pD138-41(2004 Jan 1)
Hidden Markov models for sequence analysis: extension and analysis of the basic method.
Hughey R, Krogh A
Comput Appl Biosci12p95-107(1996 Apr)
Hughey R, Krogh A
Comput Appl Biosci12p95-107(1996 Apr)