Diverse CRISPRs evolving in human microbiomes

Mina Rho, Yu Wei Wu, Haixu Tang, Thomas G. Doak, Yuzhen Ye

研究成果: 雜誌貢獻文章

75 引文 (Scopus)

摘要

CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) loci, together with cas (CRISPR-associated) genes, form the CRISPR/Cas adaptive immune system, a primary defense strategy that eubacteria and archaea mobilize against foreign nucleic acids, including phages and conjugative plasmids. Short spacer sequences separated by the repeats are derived from foreign DNA and direct interference to future infections. The availability of hundreds of shotgun metagenomic datasets from the Human Microbiome Project (HMP) enables us to explore the distribution and diversity of known CRISPRs in human-associated microbial communities and to discover new CRISPRs. We propose a targeted assembly strategy to reconstruct CRISPR arrays, which whole-metagenome assemblies fail to identify. For each known CRISPR type (identified from reference genomes), we use its direct repeat consensus sequence to recruit reads from each HMP dataset and then assemble the recruited reads into CRISPR loci; the unique spacer sequences can then be extracted for analysis. We also identified novel CRISPRs or new CRISPR variants in contigs from whole-metagenome assemblies and used targeted assembly to more comprehensively identify these CRISPRs across samples. We observed that the distributions of CRISPRs (including 64 known and 86 novel ones) are largely body-site specific. We provide detailed analysis of several CRISPR loci, including novel CRISPRs. For example, known streptococcal CRISPRs were identified in most oral microbiomes, totaling ~8,000 unique spacers: samples resampled from the same individual and oral site shared the most spacers; different oral sites from the same individual shared significantly fewer, while different individuals had almost no common spacers, indicating the impact of subtle niche differences on the evolution of CRISPR defenses. We further demonstrate potential applications of CRISPRs to the tracing of rare species and the virus exposure of individuals. This work indicates the importance of effective identification and characterization of CRISPR loci to the study of the dynamic ecology of microbiomes.
原文英語
文章編號e1002441
期刊PLoS Genetics
8
發行號6
DOIs
出版狀態已發佈 - 六月 2012
對外發佈Yes

指紋

Clustered Regularly Interspaced Short Palindromic Repeats
Microbiota
mouth
loci
immune system
nucleic acid
rare species
plasmid
microbial community
niche
virus
genome
ecology
DNA
Eubacteria
gene
consensus sequence
Archaea
crossover interference
bacteriophages

ASJC Scopus subject areas

  • Genetics
  • Molecular Biology
  • Ecology, Evolution, Behavior and Systematics
  • Cancer Research
  • Genetics(clinical)

引用此文

Diverse CRISPRs evolving in human microbiomes. / Rho, Mina; Wu, Yu Wei; Tang, Haixu; Doak, Thomas G.; Ye, Yuzhen.

於: PLoS Genetics, 卷 8, 編號 6, e1002441, 06.2012.

研究成果: 雜誌貢獻文章

Rho, Mina ; Wu, Yu Wei ; Tang, Haixu ; Doak, Thomas G. ; Ye, Yuzhen. / Diverse CRISPRs evolving in human microbiomes. 於: PLoS Genetics. 2012 ; 卷 8, 編號 6.
@article{4bf022c0717443a181f18901604168ae,
title = "Diverse CRISPRs evolving in human microbiomes",
abstract = "CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) loci, together with cas (CRISPR-associated) genes, form the CRISPR/Cas adaptive immune system, a primary defense strategy that eubacteria and archaea mobilize against foreign nucleic acids, including phages and conjugative plasmids. Short spacer sequences separated by the repeats are derived from foreign DNA and direct interference to future infections. The availability of hundreds of shotgun metagenomic datasets from the Human Microbiome Project (HMP) enables us to explore the distribution and diversity of known CRISPRs in human-associated microbial communities and to discover new CRISPRs. We propose a targeted assembly strategy to reconstruct CRISPR arrays, which whole-metagenome assemblies fail to identify. For each known CRISPR type (identified from reference genomes), we use its direct repeat consensus sequence to recruit reads from each HMP dataset and then assemble the recruited reads into CRISPR loci; the unique spacer sequences can then be extracted for analysis. We also identified novel CRISPRs or new CRISPR variants in contigs from whole-metagenome assemblies and used targeted assembly to more comprehensively identify these CRISPRs across samples. We observed that the distributions of CRISPRs (including 64 known and 86 novel ones) are largely body-site specific. We provide detailed analysis of several CRISPR loci, including novel CRISPRs. For example, known streptococcal CRISPRs were identified in most oral microbiomes, totaling ~8,000 unique spacers: samples resampled from the same individual and oral site shared the most spacers; different oral sites from the same individual shared significantly fewer, while different individuals had almost no common spacers, indicating the impact of subtle niche differences on the evolution of CRISPR defenses. We further demonstrate potential applications of CRISPRs to the tracing of rare species and the virus exposure of individuals. This work indicates the importance of effective identification and characterization of CRISPR loci to the study of the dynamic ecology of microbiomes.",
author = "Mina Rho and Wu, {Yu Wei} and Haixu Tang and Doak, {Thomas G.} and Yuzhen Ye",
year = "2012",
month = "6",
doi = "10.1371/journal.pgen.1002441",
language = "English",
volume = "8",
journal = "PLoS Genetics",
issn = "1553-7390",
publisher = "Public Library of Science",
number = "6",

}

TY - JOUR

T1 - Diverse CRISPRs evolving in human microbiomes

AU - Rho, Mina

AU - Wu, Yu Wei

AU - Tang, Haixu

AU - Doak, Thomas G.

AU - Ye, Yuzhen

PY - 2012/6

Y1 - 2012/6

N2 - CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) loci, together with cas (CRISPR-associated) genes, form the CRISPR/Cas adaptive immune system, a primary defense strategy that eubacteria and archaea mobilize against foreign nucleic acids, including phages and conjugative plasmids. Short spacer sequences separated by the repeats are derived from foreign DNA and direct interference to future infections. The availability of hundreds of shotgun metagenomic datasets from the Human Microbiome Project (HMP) enables us to explore the distribution and diversity of known CRISPRs in human-associated microbial communities and to discover new CRISPRs. We propose a targeted assembly strategy to reconstruct CRISPR arrays, which whole-metagenome assemblies fail to identify. For each known CRISPR type (identified from reference genomes), we use its direct repeat consensus sequence to recruit reads from each HMP dataset and then assemble the recruited reads into CRISPR loci; the unique spacer sequences can then be extracted for analysis. We also identified novel CRISPRs or new CRISPR variants in contigs from whole-metagenome assemblies and used targeted assembly to more comprehensively identify these CRISPRs across samples. We observed that the distributions of CRISPRs (including 64 known and 86 novel ones) are largely body-site specific. We provide detailed analysis of several CRISPR loci, including novel CRISPRs. For example, known streptococcal CRISPRs were identified in most oral microbiomes, totaling ~8,000 unique spacers: samples resampled from the same individual and oral site shared the most spacers; different oral sites from the same individual shared significantly fewer, while different individuals had almost no common spacers, indicating the impact of subtle niche differences on the evolution of CRISPR defenses. We further demonstrate potential applications of CRISPRs to the tracing of rare species and the virus exposure of individuals. This work indicates the importance of effective identification and characterization of CRISPR loci to the study of the dynamic ecology of microbiomes.

AB - CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) loci, together with cas (CRISPR-associated) genes, form the CRISPR/Cas adaptive immune system, a primary defense strategy that eubacteria and archaea mobilize against foreign nucleic acids, including phages and conjugative plasmids. Short spacer sequences separated by the repeats are derived from foreign DNA and direct interference to future infections. The availability of hundreds of shotgun metagenomic datasets from the Human Microbiome Project (HMP) enables us to explore the distribution and diversity of known CRISPRs in human-associated microbial communities and to discover new CRISPRs. We propose a targeted assembly strategy to reconstruct CRISPR arrays, which whole-metagenome assemblies fail to identify. For each known CRISPR type (identified from reference genomes), we use its direct repeat consensus sequence to recruit reads from each HMP dataset and then assemble the recruited reads into CRISPR loci; the unique spacer sequences can then be extracted for analysis. We also identified novel CRISPRs or new CRISPR variants in contigs from whole-metagenome assemblies and used targeted assembly to more comprehensively identify these CRISPRs across samples. We observed that the distributions of CRISPRs (including 64 known and 86 novel ones) are largely body-site specific. We provide detailed analysis of several CRISPR loci, including novel CRISPRs. For example, known streptococcal CRISPRs were identified in most oral microbiomes, totaling ~8,000 unique spacers: samples resampled from the same individual and oral site shared the most spacers; different oral sites from the same individual shared significantly fewer, while different individuals had almost no common spacers, indicating the impact of subtle niche differences on the evolution of CRISPR defenses. We further demonstrate potential applications of CRISPRs to the tracing of rare species and the virus exposure of individuals. This work indicates the importance of effective identification and characterization of CRISPR loci to the study of the dynamic ecology of microbiomes.

UR - http://www.scopus.com/inward/record.url?scp=84864042533&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84864042533&partnerID=8YFLogxK

U2 - 10.1371/journal.pgen.1002441

DO - 10.1371/journal.pgen.1002441

M3 - Article

C2 - 22719260

AN - SCOPUS:84864042533

VL - 8

JO - PLoS Genetics

JF - PLoS Genetics

SN - 1553-7390

IS - 6

M1 - e1002441

ER -