Skip to Main Navigation Skip to Search Skip to Main Content
Children's National Logo
  • Careers
  • Clinical Trials Search
  • Visit Us
  • News & Events
  • Our Labs & Teams
    Back
    • Close Sub-Menu
      • Center for Cancer & Immunology Research
        • Bollard Lab
        • Cellular Therapy Lab
        • Cruz Lab
        • D. Haydar Lab
        • Lin Lab
        • Pei Lab
        • Rood Lab
        • Zhang Lab
      • Center for Genetic Medicine Research
        • Caldovic Lab
        • Dauber Lab
        • Hahn Lab
        • Jaiswal Lab
        • The K-Labs
        • Koutroulis Lab
        • Morizono Lab
        • Nino Lab
        • Novak Lab
        • Pediatric Mendelian Genomics Research Center
        • Zohn Lab
      • Center for Translational Research
        • ADAPT Lab
        • Coburn Lab
        • Behavioral Diabetes Research Lab
        • HIV Research Lab
        • Mindful AI Lab
      • Center for Neuroscience Research
        • Corbin Lab
        • DC-IDDRC
        • Hashimoto Lab
        • Hsiao Lab
        • POLER Lab
        • Sidorov Lab
        • Torii Lab
        • Triplett Lab
        • Prenatal Infection and Neurodevelopmental Genetics Consortium
      • Center for Prenatal, Neonatal & Maternal Health Research
        • Developing Brain Institute
      • Sheikh Zayed Institute for Pediatric Surgical Innovation
        • 3D Cardiac Visualization Lab
        • Cha Lab
        • Medical Robotics Lab
        • Posnack Lab
        • Precision Medical Imaging Lab
      • Search Labs by Research Area
      • Find a Researcher or Innovator (A-Z)
Research & Innovation
  • Explore Our Research
    Close Sub-Menu Back
    • Our Centers & Institutes
      • Cancer and Immunology Research
      • Genetic Medicine Research
      • Neuroscience Research
      • Prenatal, Neonatal and Maternal Health Research
      • Translational Research
      • Sheikh Zayed Institute
      • Clinical and Translational Science Institute
      • Rare Disease Institute
    • Our Labs
    • Our People
    • Research Areas
    • Platforms & Facilities
    • Research, Education & Innovation Week
  • Innovation
  • Training
  • Research & Innovation Campus
  • Partnerships
    Close Sub-Menu Back
    • Children's National Hospital - Virginia Tech Initiatives
  • About Us
    Close Sub-Menu Back
    • Our Vision & Mission
    • Our History
    • Our Leadership
    • Administration
    • Reports & Statements
    • Support Us
    • Visit Us
    • Contact Us
  • Explore Our Research
    Close Sub-Menu Back
    • Explore Our Research }
    • Our Centers & Institutes }
      • Cancer and Immunology Research
      • Genetic Medicine Research
      • Neuroscience Research
      • Prenatal, Neonatal and Maternal Health Research
      • Translational Research
      • Sheikh Zayed Institute
      • Clinical and Translational Science Institute
      • Rare Disease Institute
    • Our Labs }
    • Our People }
    • Research Areas }
    • Platforms & Facilities }
    • Research, Education & Innovation Week }
  • Innovation
  • Training
  • Research & Innovation Campus
  • Partnerships
    Close Sub-Menu Back
    • Partnerships }
    • Children's National Hospital - Virginia Tech Initiatives }
  • About Us
    Close Sub-Menu Back
    • About Us }
    • Our Vision & Mission }
    • Our History }
    • Our Leadership }
    • Administration }
    • Reports & Statements }
    • Support Us }
    • Visit Us }
    • Contact Us }
  • Careers
  • Clinical Trials Search
  • Visit Us
  • News & Events
  • Our Labs & Teams
    Back
    • Close Sub-Menu
      • Center for Cancer & Immunology Research
        • Bollard Lab
        • Cellular Therapy Lab
        • Cruz Lab
        • D. Haydar Lab
        • Lin Lab
          • Pei Lab
          • Rood Lab
          • Zhang Lab
      • Center for Genetic Medicine Research
        • Caldovic Lab
        • Dauber Lab
        • Hahn Lab
        • Jaiswal Lab
        • The K-Labs
          • Koutroulis Lab
          • Morizono Lab
          • Nino Lab
          • Novak Lab
          • Pediatric Mendelian Genomics Research Center
          • Zohn Lab
      • Center for Translational Research
        • ADAPT Lab
        • Coburn Lab
        • Behavioral Diabetes Research Lab
        • HIV Research Lab
        • Mindful AI Lab
      • Center for Neuroscience Research
        • Corbin Lab
        • DC-IDDRC
        • Hashimoto Lab
        • Hsiao Lab
        • POLER Lab
          • Sidorov Lab
          • Torii Lab
          • Triplett Lab
          • Prenatal Infection and Neurodevelopmental Genetics Consortium
      • Center for Prenatal, Neonatal & Maternal Health Research
        • Developing Brain Institute
      • Sheikh Zayed Institute for Pediatric Surgical Innovation
        • 3D Cardiac Visualization Lab
        • Cha Lab
        • Medical Robotics Lab
        • Posnack Lab
        • Precision Medical Imaging Lab
      • Search Labs by Research Area
      • Find a Researcher or Innovator (A-Z)
CNRI Bioinformatics Unit

Pipelines

  • Overview
  • Authorship Policy
  • Pipelines
  • Meet the Team
  • Workshops/Seminars
  • Research and Innovation
  • Center for Translational Research
  • Resources
  • Informatics
  • CNRI Bioinformatics Unit
  • Pipelines
  • FastQ files containing the raw RNA-Seq reads output from the sequencers are first checked for quality using FastQC.

    It is followed by trimming of poor quality reads using any of the following tools:

    • Trimmomatic
    • BBDuk
    • cutadapt

    An aligner is used to align the reads to the reference genome (Human/Mouse etc) using any of the following tools:

    • Tophat2
    • Hisat2
    • STAR

    The aligned Sequence Alignment Map (SAM) files are sorted and converted to BAM using samtools in some cases.

    Next, the Binary Alignment Map (BAM) files are checked for their quality using either of the following tools:

    • Picard RNASeqmetrics
    • RSeQC

    In cases for fusion gene detection, we specifically use either the Tophat-fusion or STAR-Fusion tool. Splicing events are identified using SGSe tool part of the biocondutor package in R.

    Quantification of reads to Transcripts per million (TPM) is calculated using RSEM.

    For cases of high sequence duplication, we tag the duplicates using Picard MarkDuplicates and count the duplicated reads using featureCounts. In many cases of overduplication we also remove the duplicated reads using Picard MarkDuplicates.

    For raw unnormalized counts, we use either of the following tools:

    • Htseq Counts
    • featureCounts

    For Differential Gene expression, we use either the following packages in R:

    • Deseq2
    • EdgeR
    • Ebseq2

    The final data can be represented in the form of heatmaps, volcano plots, MA plots and PCA plots using basic R packages.

  • The raw sequencing data in fastqc files is first quality checked with FastQC and depending upon the quality it is trimmed using the following tools.

    • Trimmomatic
    • BBDuk
    • cutadapt

    An aligner is used to align the reads to the reference genome (Human/Mouse etc) using any of the following tools:

    • Bwa-mem
    • Bowtie2

    The aligned Sequence Alignment Map (SAM) files are processed to Binary Alignment Map (BAM) files:

    • samtools
    • Picard tools

    We follow the Genome Analysis Toolkit 4 (GATK 4), pipeline to do Variant Calling. The BAM files are further processed for using Base Quality Score Recalibration (BQSR) step to detect and correct base calling errors from sequencers:

    • BaseRecalibrator
    • ApplyBQSR

    Variant calling is performed on the aligned BAM files, producing genome variant call format (gvcf) files using HaplotypeCaller function.

    The gvcfs are combined using CombineGVCFs function, followed by running the GenotypeGVCFs function, to increase the sensitivity of the variants called. Output of this step is a vcf file.

    The vcf file is then further processed using Variant Quality Score Recalibration (VQSR) tools to increase the quality of variants identified, i.e. decrease the number of false positives.

    • VariantRecalibrator
    • ApplyVQSR

    Annotation of variant is done using Annovar, followed by manual filtration.

  • The raw sequencing data in fastqc files is first quality checked with FastQC and depending upon the quality it is trimmed using the following tools:

    • Trimmomatic
    • BBDuk
    • cutadapt

    Extraction and alignment of fragments of target molecules is performed followed by assembly of overlapping fragmented sequencing reads into long-enough CDR3 containing contigs using the MiXCR pipeline to analyze TCR or Ig repertoire from sequencing data.

  • The raw sequencing data in fastqc files is first quality checked with FastQC and depending upon the quality it is trimmed using the following tools.

    • Trimmomatic
    • BBDuk
    • cutadapt

    We perform demultiplexing and quality filtering, OTU picking, taxonomic assignment, and phylogenetic reconstruction, and diversity analyses and visualizations using QIIME1 pipeline.

Model is not null: True
Taxonomies:
Articles are not null: True
Article Count: 0
Error:
Publish Time: 5/9/2025 5:05:14 AM

Innovation District newsletter

Stay up-to-date on all the latest innovation & research news
  • facebook
  • twitter
  • instagram
  • linkedin
  • youtube
Research & Innovation at Children’s National
7144 13th Pl NW
Washington, DC 20012
202-555-3748 Contact Us
  • About Us
  • Partnerships
  • Training
  • Innovation
  • Labs
  • Research & Innovation Campus
Support Pediatric Research
Invest in our children to help make a difference
Donate
Main Hospital: 111 Michigan Avenue NW, Washington, DC, 20010 | 202-476-5000
  • Terms of Use
  • Patient Family Rights
  • Report a Compliance Concern
© Copyright Children's National Hospital 2025