We present a database, dbQSNP (http://qsnp.gen.kyushu-u.ac.jp/), that provides sequence and allele frequency information for single-nucleotide polymorphisms (SNPs) located in the promoter regions of human genes, which were defined by the 5′ ends of full-length cDNA clones. We searched for the SNPs in these regions by sequencing or single-strand conformation polymorphism (SSCP) analysis. The allele frequencies of the identified SNPs in two ethnic groups were quantified by SSCP analyses of pooled DNA samples. The accuracy of our estimation is supported by strong correlations between the frequencies in our data and those in other databases for the same ethnic groups. The frequencies vary considerably between the two ethnic groups studied, suggesting the need for population-based collections and allele frequency determination of SNPs, in, e.g., association studies of diseases. We show profiles of SNP densities that are characteristic of transcription start site regions. A fraction of the SNPs revealed a significantly different allele frequency between the groups, suggesting differential selection of the genes involved.
All Science Journal Classification (ASJC) codes