%0 Journal Article %J Science %D 2012 %T A systematic survey of loss-of-function variants in human protein-coding genes. %A MacArthur, Daniel G %A Balasubramanian, Suganthi %A Frankish, Adam %A Huang, Ni %A Morris, James %A Walter, Klaudia %A Jostins, Luke %A Habegger, Lukas %A Pickrell, Joseph K %A Montgomery, Stephen B %A Albers, Cornelis A %A Zhang, Zhengdong D %A Conrad, Donald F %A Lunter, Gerton %A Zheng, Hancheng %A Ayub, Qasim %A DePristo, Mark A %A Banks, Eric %A Hu, Min %A Handsaker, Robert E %A Rosenfeld, Jeffrey A %A Fromer, Menachem %A Jin, Mike %A Mu, Xinmeng Jasmine %A Khurana, Ekta %A Ye, Kai %A Kay, Mike %A Saunders, Gary Ian %A Suner, Marie-Marthe %A Hunt, Toby %A Barnes, If H A %A Amid, Clara %A Carvalho-Silva, Denise R %A Bignell, Alexandra H %A Snow, Catherine %A Yngvadottir, Bryndis %A Bumpstead, Suzannah %A Cooper, David N %A Xue, Yali %A Romero, Irene Gallego %A Wang, Jun %A Li, Yingrui %A Gibbs, Richard A %A McCarroll, Steven A %A Dermitzakis, Emmanouil T %A Pritchard, Jonathan K %A Barrett, Jeffrey C %A Harrow, Jennifer %A Hurles, Matthew E %A Gerstein, Mark B %A Tyler-Smith, Chris %K Disease %K Gene Expression %K Gene Frequency %K Genetic Variation %K Genome, Human %K Humans %K Phenotype %K Polymorphism, Single Nucleotide %K Proteins %K Selection, Genetic %X

Genome-sequencing studies indicate that all humans carry many genetic variants predicted to cause loss of function (LoF) of protein-coding genes, suggesting unexpected redundancy in the human genome. Here we apply stringent filters to 2951 putative LoF variants obtained from 185 human genomes to determine their true prevalence and properties. We estimate that human genomes typically contain ~100 genuine LoF variants with ~20 genes completely inactivated. We identify rare and likely deleterious LoF alleles, including 26 known and 21 predicted severe disease-causing variants, as well as common LoF variants in nonessential genes. We describe functional and evolutionary differences between LoF-tolerant and recessive disease genes and a method for using these differences to prioritize candidate genes found in clinical sequencing studies.

%B Science %V 335 %P 823-8 %8 2012 Feb 17 %G eng %N 6070 %1 https://www.ncbi.nlm.nih.gov/pubmed/22344438?dopt=Abstract %R 10.1126/science.1215040