Browse the corpus

Walk the Even Hospital Database by book and chapter — the raw source passages that ground Ask, DDx, and the rest.

37 passages

abstractpubmed· Abstract· item 34758253

100,000 Genomes Pilot on Rare-Disease Diagnosis in Health Care - Preliminary Report. BACKGROUND: The U.K. 100,000 Genomes Project is in the process of investigating the role of genome sequencing in patients with undiagnosed rare diseases after usual care and the alignment of this research with health care implementation in the U.K. National Health Service. Other parts of this project focus on patients with cancer and infection. METHODS: We conducted a pilot study involving 4660 participants from 2183 families, among whom 161 disorders covering a broad spectrum of rare diseases were present. We collected data on clinical features with the use of Human Phenotype Ontology terms, undertook genome sequencing, applied automated variant prioritization on the basis of applied virtual gene panels and phenotypes, and identified novel pathogenic variants through research analysis. RESULTS: Diagnostic yields varied among family structures and were highest in family trios (both parents and a proband) and families with larger pedigrees. Diagnostic yields were much higher for disorders likely to have a monogenic cause (35%) than for disorders likely to have a complex cause (11%). Diagnostic yields for intellectual disability, hearing disorders, and vision disorders ranged from 40 to 55%. We made genetic diagnoses in 25% of the probands. A total of 14% of the diagnoses were made by means of the combination of research and automated approaches, which was critical for cases in which we found etiologic noncoding, structural, and mitochondrial genome variants and coding variants poorly covered by exome sequencing. Cohortwide burden testing across 57,000 genomes enabled the discovery of three new disease genes and 19 new associations. Of the genetic diagnoses that we made, 25% had immediate ramifications for clinical decision making for the patients or their relatives. CONCLUSIONS: Our pilot study of genome sequencing in a national health care system showed an increase in diagnostic yield across a range of rare diseases. (Funded by the National Institute for Health Research and others.).

fulltextpubmed· METHODS· item 34758253

After approval from the national research ethics committee was obtained, we recruited participants who had been identified by health care professionals and researchers as having rare diseases (across a broad range of categories) that had not been diagnosed after receipt of usual care in the NHS, which included either no diagnostic tests (because none were available) or approved diagnostic tests that did not include genome sequencing. The participants were recruited at nine English hospitals, and written informed consent was obtained from the participants by the National Institute for Health Research (NIHR) BioResource for Rare Diseases.

fulltextpubmed· METHODS· item 34758253

agnostic tests (because none were available) or approved diagnostic tests that did not include genome sequencing. The participants were recruited at nine English hospitals, and written informed consent was obtained from the participants by the National Institute for Health Research (NIHR) BioResource for Rare Diseases. To test the broad applicability of genome sequencing, we determined that participants were eligible if they had a rare disease (as defined in the United Kingdom as a disorder affecting ≤1 in 2000 persons), were likely to have a single-gene or oligogenic cause, and had not received a genomic diagnosis. Data on previous testing in probands were collected when possible; testing included single-gene tests, karyotyping, single-nucleotide polymorphism arrays, next-generation sequencing panels, and exome sequencing. Probands and, when feasible, parents or other family members were enrolled across multiple clinical specialties in the NHS. Standardized baseline clinical data were recorded with the use of Human Phenotype Ontology (HPO) terms7 guided by disease-specific data models,8 and whole blood samples were obtained for DNA extraction. In the 100,000 Genomes Project, participants are followed over their life course with the use of electronic health records (all hospital episodes, registry entries, and cause of death).

fulltextpubmed· METHODS· item 34758253

man Phenotype Ontology (HPO) terms7 guided by disease-specific data models,8 and whole blood samples were obtained for DNA extraction. In the 100,000 Genomes Project, participants are followed over their life course with the use of electronic health records (all hospital episodes, registry entries, and cause of death). This pilot study was undertaken in partnership with the NIHR BioResource and is part of the portfolio of translational research at the NIHR Biomedical Research Centres at Barts, Cambridge University Hospitals NHS Foundation Trust, Great Ormond Street Hospital for Children NHS Foundation Trust, Manchester University NHS Foundation Trust, Moorfields Eye Hospital NHS Foundation Trust, Newcastle upon Tyne Hospitals NHS Foundation Trust, Oxford University Hospitals NHS Foundation Trust, and University College London Hospitals NHS Foundation Trust. Clinical data from the NHS and NHS Digital were used in this work.

fulltextpubmed· METHODS· item 34758253

Manchester University NHS Foundation Trust, Moorfields Eye Hospital NHS Foundation Trust, Newcastle upon Tyne Hospitals NHS Foundation Trust, Oxford University Hospitals NHS Foundation Trust, and University College London Hospitals NHS Foundation Trust. Clinical data from the NHS and NHS Digital were used in this work. Genome sequencing9 was performed with the use of the TruSeq DNA polymerase-chain-reaction (PCR)–free sample preparation kit (Illumina) on a HiSeq 2500 sequencer, which generates a mean depth of 32× (range, 27 to 54) and a depth greater than 15× for at least 95% of the reference human genome. Whole-genome sequencing reads were aligned to the Genome Reference Consortium human genome build 37 (GRCh37) with the use of Isaac Genome Alignment Software. Family-based variant calling of single-nucleotide variants (SNVs) and insertion or deletions (indels) for chromosomes 1 to 22, the X chromosome, and the mitochondrial genome (mean coverage, 2814×; range, 142 to 16,581) was performed with the use of the Platypus variant caller.10

fulltextpubmed· METHODS· item 34758253

use of Isaac Genome Alignment Software. Family-based variant calling of single-nucleotide variants (SNVs) and insertion or deletions (indels) for chromosomes 1 to 22, the X chromosome, and the mitochondrial genome (mean coverage, 2814×; range, 142 to 16,581) was performed with the use of the Platypus variant caller.10 We constructed an automated analytic pipeline to filter the genome down to rare, segregating, and predicted damaging candidate variants in coding regions. To limit the possibility of overlooking or inefficiently prioritizing diagnoses, we focused initially on applied virtual gene panels (applied panels) that were based on both the recruited clinical indication or disease and the submitted HPO terms. To address the issue of which genes have sufficient evidence to show causation and be included in these applied panels, we used our PanelApp software to enable expert, crowd-sourced review and curation of genes with diagnostic-grade evidence for each of our disease categories (e.g., evidence in at least three unrelated families).11 Loss-of-function or de novo protein-altering variants affecting genes in the applied panels were classified as tier 1, other variant types such as missense variants affecting these genes were classified as tier 2, and all other filtered variants were classified as tier 3 (Fig. S1 in the Supplementary Appendix, available with the full text of this article at NEJM.org). To further reduce the possibility of missing or inefficiently prioritized diagnoses, we used a phenotype-based approach with the Exomiser application12 to search across all genes in the genome for a diagnosis. Exomiser prioritizes rare, segregating, and predicted pathogenic variants in genes in which the patient phenotypes match previously referenced knowledge from human disease or model organism databases. The ontology-driven phenotype matching can identify patients who have an atypical profile for a disease. Additional details regarding the Exomiser are provided in the Diagnostic Pipeline section in the Supplementary Appendix.

fulltextpubmed· METHODS· item 34758253

enotypes match previously referenced knowledge from human disease or model organism databases. The ontology-driven phenotype matching can identify patients who have an atypical profile for a disease. Additional details regarding the Exomiser are provided in the Diagnostic Pipeline section in the Supplementary Appendix. Prioritization of variants and return of candidate variants for presentation to the 13 NHS Genomic Medicine Centres (GMCs) were performed with the use of decision-support systems and with assistance from clinical genetics teams from Congenica and Fabric Genomics.13,14 These variants were reviewed by NHS clinical scientists and clinicians using the guidelines of the American College of Medical Genetics and Genomics, and a diagnostic report was issued for each proband.15 Final clinical outcomes included whether a genetic diagnosis was obtained, identification of the variant or variants involved, whether the variant or variants explained all or some of the phenotypes, and whether an intervention was used. Recruitment of the participants in the pilot study and sequencing were performed during the period from January 2014 through December 2016, while the infrastructure to collect, quality check, process, and return data was being established. Results were returned to the GMCs from May 2016 through April 2019. Now that the information pipeline has been established (post-pilot phase), results are returned to the GMCs within 6 weeks after the sample is obtained.

fulltextpubmed· METHODS· item 34758253

while the infrastructure to collect, quality check, process, and return data was being established. Results were returned to the GMCs from May 2016 through April 2019. Now that the information pipeline has been established (post-pilot phase), results are returned to the GMCs within 6 weeks after the sample is obtained. Researchers investigated coding and noncoding regions to detect novel diagnostic variants in genes matching the patients’ phenotypes, including the presence of de novo variants in highly constrained coding regions16 in the 95th percentile. We use the term novel to describe diagnostic variants we have detected that have not previously been described in the literature as causative. This is distinct from de novo variants, which are present for the first time in a family member due to either a new variant in an egg or sperm or a new mutation at conception. The variant may have been previously described. We used a new method described by Wei et al.17 to analyze mitochondrial DNA that accounts for heteroplasmy, the Genomiser to detect noncoding pathogenic variants,18 and the ExpansionHunter software tool to detect simple tandem repeat expansions.19 Finally we used a new random forest method to analyze Canvas20 and Manta21 calls and to identify potentially pathogenic copy-number and structural variants.

fulltextpubmed· METHODS· item 34758253

ounts for heteroplasmy, the Genomiser to detect noncoding pathogenic variants,18 and the ExpansionHunter software tool to detect simple tandem repeat expansions.19 Finally we used a new random forest method to analyze Canvas20 and Manta21 calls and to identify potentially pathogenic copy-number and structural variants. Gene-based burden testing to detect enrichment of rare, predicted pathogenic, and segregating variants in novel genes in specific disease cohorts relative to controls was performed on the genomes in the pilot study as well as on additional genomes from the rest of the 100,000 Genomes Project to increase power (57,002 genomes; see the Supplementary Methods in the Supplementary Appendix). The genomic and clinical data from the pilot study are freely accessible to members of a Genomics England Clinical Interpretation Partnership domain (https://www.genomicsengland.co.uk/about-gecip/). Testing was performed with the use of the R software, version 3.6.0 (R Foundation for Statistical Computing), and Stata software, version 16 (StataCorp). Further details on the individual methods used in the study are provided in the Supplementary Appendix.

fulltextpubmed· GENOME SEQUENCING· item 34758253

Genome sequencing9 was performed with the use of the TruSeq DNA polymerase-chain-reaction (PCR)–free sample preparation kit (Illumina) on a HiSeq 2500 sequencer, which generates a mean depth of 32× (range, 27 to 54) and a depth greater than 15× for at least 95% of the reference human genome. Whole-genome sequencing reads were aligned to the Genome Reference Consortium human genome build 37 (GRCh37) with the use of Isaac Genome Alignment Software. Family-based variant calling of single-nucleotide variants (SNVs) and insertion or deletions (indels) for chromosomes 1 to 22, the X chromosome, and the mitochondrial genome (mean coverage, 2814×; range, 142 to 16,581) was performed with the use of the Platypus variant caller.10

fulltextpubmed· DIAGNOSTIC PIPELINE· item 34758253

We constructed an automated analytic pipeline to filter the genome down to rare, segregating, and predicted damaging candidate variants in coding regions. To limit the possibility of overlooking or inefficiently prioritizing diagnoses, we focused initially on applied virtual gene panels (applied panels) that were based on both the recruited clinical indication or disease and the submitted HPO terms. To address the issue of which genes have sufficient evidence to show causation and be included in these applied panels, we used our PanelApp software to enable expert, crowd-sourced review and curation of genes with diagnostic-grade evidence for each of our disease categories (e.g., evidence in at least three unrelated families).11 Loss-of-function or de novo protein-altering variants affecting genes in the applied panels were classified as tier 1, other variant types such as missense variants affecting these genes were classified as tier 2, and all other filtered variants were classified as tier 3 (Fig. S1 in the Supplementary Appendix, available with the full text of this article at NEJM.org). To further reduce the possibility of missing or inefficiently prioritized diagnoses, we used a phenotype-based approach with the Exomiser application12 to search across all genes in the genome for a diagnosis. Exomiser prioritizes rare, segregating, and predicted pathogenic variants in genes in which the patient phenotypes match previously referenced knowledge from human disease or model organism databases. The ontology-driven phenotype matching can identify patients who have an atypical profile for a disease. Additional details regarding the Exomiser are provided in the Diagnostic Pipeline section in the Supplementary Appendix.

fulltextpubmed· NOVEL PATHOGENIC VARIANTS· item 34758253

Researchers investigated coding and noncoding regions to detect novel diagnostic variants in genes matching the patients’ phenotypes, including the presence of de novo variants in highly constrained coding regions16 in the 95th percentile. We use the term novel to describe diagnostic variants we have detected that have not previously been described in the literature as causative. This is distinct from de novo variants, which are present for the first time in a family member due to either a new variant in an egg or sperm or a new mutation at conception. The variant may have been previously described. We used a new method described by Wei et al.17 to analyze mitochondrial DNA that accounts for heteroplasmy, the Genomiser to detect noncoding pathogenic variants,18 and the ExpansionHunter software tool to detect simple tandem repeat expansions.19 Finally we used a new random forest method to analyze Canvas20 and Manta21 calls and to identify potentially pathogenic copy-number and structural variants. Gene-based burden testing to detect enrichment of rare, predicted pathogenic, and segregating variants in novel genes in specific disease cohorts relative to controls was performed on the genomes in the pilot study as well as on additional genomes from the rest of the 100,000 Genomes Project to increase power (57,002 genomes; see the Supplementary Methods in the Supplementary Appendix). The genomic and clinical data from the pilot study are freely accessible to members of a Genomics England Clinical Interpretation Partnership domain (https://www.genomicsengland.co.uk/about-gecip/).

fulltextpubmed· STATISTICAL ANALYSIS· item 34758253

Testing was performed with the use of the R software, version 3.6.0 (R Foundation for Statistical Computing), and Stata software, version 16 (StataCorp). Further details on the individual methods used in the study are provided in the Supplementary Appendix.