A genome-wide survey of human pseudogenes

David Torrents, Mikita Suyama, Evgeny Zdobnov, Peer Bork

Research output: Contribution to journalArticle

197 Citations (Scopus)

Abstract

We screened all intergenic regions in the human genome to identify pseudogenes with a combination of homology searches and a functionality test using the ratio of silent to replacement nucleotide substitutions (KA/KS). We identified 19,724 regions of which 95% ± 3% are estimated to evolve neutrally and thus are likely to encode pseudogenes. Half of these have no detectable truncation in their pseudocoding regions and therefore are not identifiable by methods that require the presence of truncations to prove nonfunctionality. A comparative analysis with the mouse genome showed that 70% of these pseudogenes have a retrotranspositional origin (processed), and the rest arose by segmental duplication (nonprocessed). Although the spread of both types of pseudogenes correlates with chromosome size, nonprocessed pseudogenes appear to be enriched in regions with high gene density. It is likely that the human pseudogenes identified here represent only a small fraction of the total, which probably exceeds the number of genes.

Original languageEnglish
Pages (from-to)2559-2567
Number of pages9
JournalGenome Research
Volume13
Issue number12
DOIs
Publication statusPublished - Dec 1 2003
Externally publishedYes

Fingerprint

Pseudogenes
Genome
Genomic Segmental Duplications
Intergenic DNA
Human Genome
Genes
Surveys and Questionnaires
Nucleotides
Chromosomes

All Science Journal Classification (ASJC) codes

  • Genetics

Cite this

Torrents, D., Suyama, M., Zdobnov, E., & Bork, P. (2003). A genome-wide survey of human pseudogenes. Genome Research, 13(12), 2559-2567. https://doi.org/10.1101/gr.1455503

A genome-wide survey of human pseudogenes. / Torrents, David; Suyama, Mikita; Zdobnov, Evgeny; Bork, Peer.

In: Genome Research, Vol. 13, No. 12, 01.12.2003, p. 2559-2567.

Research output: Contribution to journalArticle

Torrents, D, Suyama, M, Zdobnov, E & Bork, P 2003, 'A genome-wide survey of human pseudogenes', Genome Research, vol. 13, no. 12, pp. 2559-2567. https://doi.org/10.1101/gr.1455503
Torrents, David ; Suyama, Mikita ; Zdobnov, Evgeny ; Bork, Peer. / A genome-wide survey of human pseudogenes. In: Genome Research. 2003 ; Vol. 13, No. 12. pp. 2559-2567.
@article{197d92f94e3941daa06f2146c55eb754,
title = "A genome-wide survey of human pseudogenes",
abstract = "We screened all intergenic regions in the human genome to identify pseudogenes with a combination of homology searches and a functionality test using the ratio of silent to replacement nucleotide substitutions (KA/KS). We identified 19,724 regions of which 95{\%} ± 3{\%} are estimated to evolve neutrally and thus are likely to encode pseudogenes. Half of these have no detectable truncation in their pseudocoding regions and therefore are not identifiable by methods that require the presence of truncations to prove nonfunctionality. A comparative analysis with the mouse genome showed that 70{\%} of these pseudogenes have a retrotranspositional origin (processed), and the rest arose by segmental duplication (nonprocessed). Although the spread of both types of pseudogenes correlates with chromosome size, nonprocessed pseudogenes appear to be enriched in regions with high gene density. It is likely that the human pseudogenes identified here represent only a small fraction of the total, which probably exceeds the number of genes.",
author = "David Torrents and Mikita Suyama and Evgeny Zdobnov and Peer Bork",
year = "2003",
month = "12",
day = "1",
doi = "10.1101/gr.1455503",
language = "English",
volume = "13",
pages = "2559--2567",
journal = "Genome Research",
issn = "1088-9051",
publisher = "Cold Spring Harbor Laboratory Press",
number = "12",

}

TY - JOUR

T1 - A genome-wide survey of human pseudogenes

AU - Torrents, David

AU - Suyama, Mikita

AU - Zdobnov, Evgeny

AU - Bork, Peer

PY - 2003/12/1

Y1 - 2003/12/1

N2 - We screened all intergenic regions in the human genome to identify pseudogenes with a combination of homology searches and a functionality test using the ratio of silent to replacement nucleotide substitutions (KA/KS). We identified 19,724 regions of which 95% ± 3% are estimated to evolve neutrally and thus are likely to encode pseudogenes. Half of these have no detectable truncation in their pseudocoding regions and therefore are not identifiable by methods that require the presence of truncations to prove nonfunctionality. A comparative analysis with the mouse genome showed that 70% of these pseudogenes have a retrotranspositional origin (processed), and the rest arose by segmental duplication (nonprocessed). Although the spread of both types of pseudogenes correlates with chromosome size, nonprocessed pseudogenes appear to be enriched in regions with high gene density. It is likely that the human pseudogenes identified here represent only a small fraction of the total, which probably exceeds the number of genes.

AB - We screened all intergenic regions in the human genome to identify pseudogenes with a combination of homology searches and a functionality test using the ratio of silent to replacement nucleotide substitutions (KA/KS). We identified 19,724 regions of which 95% ± 3% are estimated to evolve neutrally and thus are likely to encode pseudogenes. Half of these have no detectable truncation in their pseudocoding regions and therefore are not identifiable by methods that require the presence of truncations to prove nonfunctionality. A comparative analysis with the mouse genome showed that 70% of these pseudogenes have a retrotranspositional origin (processed), and the rest arose by segmental duplication (nonprocessed). Although the spread of both types of pseudogenes correlates with chromosome size, nonprocessed pseudogenes appear to be enriched in regions with high gene density. It is likely that the human pseudogenes identified here represent only a small fraction of the total, which probably exceeds the number of genes.

UR - http://www.scopus.com/inward/record.url?scp=0348013073&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0348013073&partnerID=8YFLogxK

U2 - 10.1101/gr.1455503

DO - 10.1101/gr.1455503

M3 - Article

C2 - 14656963

AN - SCOPUS:0348013073

VL - 13

SP - 2559

EP - 2567

JO - Genome Research

JF - Genome Research

SN - 1088-9051

IS - 12

ER -