Protein domain repetition is enriched in Streptococcal cell-surface proteins

I. Hsuan Lin, Ming Ta Hsu, Chuan Hsiung Chang

Research output: Contribution to journalArticle

6 Citations (Scopus)

Abstract

Tandem repetition of domain in protein sequence occurs in all three domains of life. It creates protein diversity and adds functional complexity in organisms. In this work, we analyzed 52 streptococcal genomes and found 3748 proteins contained domain repeats. Proteins not harboring domain repeats are significantly enriched in cytoplasm, whereas proteins with domain repeats are significantly enriched in cytoplasmic membrane, cell wall and extracellular locations. Domain repetition occurs most frequently in S. pneumoniae and least in S. thermophilus and S. pyogenes. DUF1542 is the highest repeated domain in a single protein, followed by Rib, CW_binding_1, G5 and HemolysinCabind. 3D structures of 24 repeat-containing proteins were predicted to investigate the structural and functional effect of domain repetition. Several repeat-containing streptococcal cell surface proteins are known to be virulence-associated. Surface-associated tandem domain-containing proteins without experimental functional characterization may be potentially involved in the pathogenesis of streptococci and deserve further investigation.

Original languageEnglish
Pages (from-to)370-379
Number of pages10
JournalGenomics
Volume100
Issue number6
DOIs
Publication statusPublished - Dec 1 2012
Externally publishedYes

Fingerprint

Membrane Proteins
Proteins
Ribs
Streptococcus
Cell Wall
Virulence
Pneumonia
Cytoplasm
Cell Membrane
Genome
Protein Domains

Keywords

  • Domain repeats
  • Domain repetition
  • Protein structure modeling
  • Protein subcellular localization
  • Streptococcus
  • Virulence

ASJC Scopus subject areas

  • Genetics

Cite this

Protein domain repetition is enriched in Streptococcal cell-surface proteins. / Lin, I. Hsuan; Hsu, Ming Ta; Chang, Chuan Hsiung.

In: Genomics, Vol. 100, No. 6, 01.12.2012, p. 370-379.

Research output: Contribution to journalArticle

Lin, I. Hsuan ; Hsu, Ming Ta ; Chang, Chuan Hsiung. / Protein domain repetition is enriched in Streptococcal cell-surface proteins. In: Genomics. 2012 ; Vol. 100, No. 6. pp. 370-379.
@article{6b0f5db0513d459a816cab7487f017df,
title = "Protein domain repetition is enriched in Streptococcal cell-surface proteins",
abstract = "Tandem repetition of domain in protein sequence occurs in all three domains of life. It creates protein diversity and adds functional complexity in organisms. In this work, we analyzed 52 streptococcal genomes and found 3748 proteins contained domain repeats. Proteins not harboring domain repeats are significantly enriched in cytoplasm, whereas proteins with domain repeats are significantly enriched in cytoplasmic membrane, cell wall and extracellular locations. Domain repetition occurs most frequently in S. pneumoniae and least in S. thermophilus and S. pyogenes. DUF1542 is the highest repeated domain in a single protein, followed by Rib, CW_binding_1, G5 and HemolysinCabind. 3D structures of 24 repeat-containing proteins were predicted to investigate the structural and functional effect of domain repetition. Several repeat-containing streptococcal cell surface proteins are known to be virulence-associated. Surface-associated tandem domain-containing proteins without experimental functional characterization may be potentially involved in the pathogenesis of streptococci and deserve further investigation.",
keywords = "Domain repeats, Domain repetition, Protein structure modeling, Protein subcellular localization, Streptococcus, Virulence, Domain repeats, Domain repetition, Protein structure modeling, Protein subcellular localization, Streptococcus, Virulence",
author = "Lin, {I. Hsuan} and Hsu, {Ming Ta} and Chang, {Chuan Hsiung}",
year = "2012",
month = "12",
day = "1",
doi = "10.1016/j.ygeno.2012.08.001",
language = "English",
volume = "100",
pages = "370--379",
journal = "Genomics",
issn = "0888-7543",
publisher = "Academic Press Inc.",
number = "6",

}

TY - JOUR

T1 - Protein domain repetition is enriched in Streptococcal cell-surface proteins

AU - Lin, I. Hsuan

AU - Hsu, Ming Ta

AU - Chang, Chuan Hsiung

PY - 2012/12/1

Y1 - 2012/12/1

N2 - Tandem repetition of domain in protein sequence occurs in all three domains of life. It creates protein diversity and adds functional complexity in organisms. In this work, we analyzed 52 streptococcal genomes and found 3748 proteins contained domain repeats. Proteins not harboring domain repeats are significantly enriched in cytoplasm, whereas proteins with domain repeats are significantly enriched in cytoplasmic membrane, cell wall and extracellular locations. Domain repetition occurs most frequently in S. pneumoniae and least in S. thermophilus and S. pyogenes. DUF1542 is the highest repeated domain in a single protein, followed by Rib, CW_binding_1, G5 and HemolysinCabind. 3D structures of 24 repeat-containing proteins were predicted to investigate the structural and functional effect of domain repetition. Several repeat-containing streptococcal cell surface proteins are known to be virulence-associated. Surface-associated tandem domain-containing proteins without experimental functional characterization may be potentially involved in the pathogenesis of streptococci and deserve further investigation.

AB - Tandem repetition of domain in protein sequence occurs in all three domains of life. It creates protein diversity and adds functional complexity in organisms. In this work, we analyzed 52 streptococcal genomes and found 3748 proteins contained domain repeats. Proteins not harboring domain repeats are significantly enriched in cytoplasm, whereas proteins with domain repeats are significantly enriched in cytoplasmic membrane, cell wall and extracellular locations. Domain repetition occurs most frequently in S. pneumoniae and least in S. thermophilus and S. pyogenes. DUF1542 is the highest repeated domain in a single protein, followed by Rib, CW_binding_1, G5 and HemolysinCabind. 3D structures of 24 repeat-containing proteins were predicted to investigate the structural and functional effect of domain repetition. Several repeat-containing streptococcal cell surface proteins are known to be virulence-associated. Surface-associated tandem domain-containing proteins without experimental functional characterization may be potentially involved in the pathogenesis of streptococci and deserve further investigation.

KW - Domain repeats

KW - Domain repetition

KW - Protein structure modeling

KW - Protein subcellular localization

KW - Streptococcus

KW - Virulence

KW - Domain repeats

KW - Domain repetition

KW - Protein structure modeling

KW - Protein subcellular localization

KW - Streptococcus

KW - Virulence

UR - http://www.scopus.com/inward/record.url?scp=84869883870&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84869883870&partnerID=8YFLogxK

U2 - 10.1016/j.ygeno.2012.08.001

DO - 10.1016/j.ygeno.2012.08.001

M3 - Article

C2 - 22921469

AN - SCOPUS:84869883870

VL - 100

SP - 370

EP - 379

JO - Genomics

JF - Genomics

SN - 0888-7543

IS - 6

ER -