Tamaño de fuente
  • A-
  • A
  • A+
Color del sitio
  • R
  • A
  • A
  • A
eCampus
  • INICIO
  • eCAMPUS
    AULA
  • FORO
  • English ‎(en)‎ Español - Internacional ‎(es)‎
  • Acceder
Salta al contenido principal

Tutorial RNASeq of comparative transcriptomics based on the species Sparus aurata

  1. Página Principal
  2. Nuestros cursos
  3. Tutorial RNASeq differential expression
  4. 1.2 - Tutorial material and case study
◄1. PRELIMINARY INFORMATION2. STEP-BY-STEP MODE TUTORIAL►
  • CONTENTS
  • 1. PRELIMINARY INFORMATION
    • 1.1 - Tutorial objective
    • 1.2 - Tutorial material and case study
    • 1.3 - Experiment design and support
    • 1.4 - Installing and activating RNASeq and the Server-Side
  • 2. STEP-BY-STEP MODE TUTORIAL
    • 2.1. - TOPHAT/HISAT2 & CUFFLINKS
      • 2.1.1 - Preparing your experiment
      • 2.1.2 - Quality analysis and preprocessing
      • 2.1.3 - Mapping
      • 2.1.4 - Transcriptome Assembly
      • 2.1.5 - Differential Expression analysis
      • 2.1.6 - GOSeq
    • 2.2. - MAPPING & COUNTING PROTOCOL
      • 2.2.1 - Preparing your experiment
      • 2.2.2 - Quality analysis and preprocessing
      • 2.2.3 - Mapping
      • 2.2.4 - Postprocessing
      • 2.2.5 - Differential Expression analysis
  • 3. PIPELINE MODE TUTORIAL
    • 3.1 - TopHat/Hisat2 & Cufflinks protocol
    • 3.2 - Mapping & Counting protocol
  • 4. BIBLIOGRAPHY
  • CITE US
  • 1.2 - Tutorial material and case study

    Within this tutorial, we use data from a case study of comparative transcriptomics based on the species Sparus aurata that was previously published in Pérez-Sánchez et al. (2019). The tutorial material consists of nine RNAseq samples from spleen biopsies from specimens of S. aurata. Specimens were separated into two groups: control (BC) (n = 4) and parasite-infected fishes (BI) (n = 5). In Table 1, we provide the nine fastq files with the following SRA Accessions, a summarization of each group and the assignation of samples per group.


                                                    Table 1: Samples and case study groups
    SRA accession Library Names Tags
    SRR8255970 ZFG-17-12_03_26333_S7_R1_001.fastq BC1
    SRR8255963   ZFG-17-12_06_26336_S10_R1_001.fastq BC2
    SRR8255962  ZFG-17-12_09_26339_S13_R1_001.fastq BC3
    SRR8255949  ZFG-17-12_12_26342_S16_R1_001.fastq BC4
    SRR8255945 ZFG-17-12_16_26346_S2_R1_001.fastq BI1
    SRR8255941 ZFG-17-12_20_26350_S6_R1_001.fastq BI2
    SRR8255956  ZFG-17-12_24_26354_S10_R1_001.fastq BI3
    SRR8255952  ZFG-17-12_28_26358_S14_R1_001.fastq BI4
    SRR8255939  ZFG-17-12_32_26362_S18_R1_001.fastq BI5

                                                    *BC = control; BI = Infected fish.


    The 9 fastq files can be downloaded from NCBI at https://www.ncbi.nlm.nih.gov/bioproject/PRJNA507368. If you need to help downloading this material from NCBI, contact us for support at https://forum.biotechvana.com.

    RefSeq material: To complete the tutorial you will need the following reference sequences:

    • The genome assembly draft of S. aurata (fSpaAur1.1 Torre de la Sal release) will be used as a reference genome sequence in the Tophat/Hisat2 & Cufflinks protocol.


    • The GTF file associated with the coding genes of the fSpaAur1.1 release will be used as a reference genome sequence in the Tophat/Hisat2 & Cufflinks protocol.


    • The RefSeq file of transcripts of S. aurata (fSpaAur1.1 release) will be used as a transcriptome reference sequence in the Mapping & counting protocol. 


    • A csv file with the functional descriptions and annotations for all gene features of S. aurata (fSpaAur1.1 release) that will be used to integrate functional information such as gene ontology (GO categories) descriptions or formal annotations to the results of differential expression. 

    You can download the RefSeq material from TorreLaSal CSIC Nutrigroup at https://nutrigroup-iats.org/welcome/request_file. For more details contact Professor Jaume Perez-Sanchez (jaime.perez.sanchez@csic.es). 

    Alternatively, you can also use the Refseq release provided by NBCI at https://www.ncbi.nlm.nih.gov/assembly/GCF_900880675.1. However, please note that NCBI release for S.aurata differ in size and annotations to the TorreLaSal release and so differential expression results would likely vary from the results presented in this tutorial (which is based on the TorreLaSal release.

    GOSeq input material: The tutorial demonstrates how to execute GOseq analyses for DE using either: Tophat/Hisat2 & Cufflinks” or “Mapping & Counting”. Enrichment analysis are performed with the software GOseq ( Young et al., 2010). As S. aurata is a customized species for this software, you need 4 input files for the analysis; 1) assayed genes; 2) differential expressed genes;  3) gene sizes, and 4) GO terms per gene.

    To facilitate this tutorial, we provide you with the following material:

    • “Assayed genes” → assayed_genes.txt
    • “Differentially expressed Genes” → diff_genes.csv
    • “Gene size” → length_genes.txt
    • “Go terms” → Go_final_saurata.txt

    The contents of these four files will differ if the analysis is performed via the “Tophat/Hisat2 & Cufflinks” or “Mapping & Counting” protocols. Nevertheless, the procedure is identical in both cases. For this reason, we provide you these four files pre-created for the “Tophat/Hisat2 & Cufflinks” simply to show you the format of each input file.

    Remember that these files are only valid for GOseq analyses performed in “Tophat/Hisat2 & Cufflinks”. If you want to complete a GOseq analysis under the “Mapping & Counting” protocol, you need to prepare these four files yourself. Similarly, the pre-prepared files for GOSeq analyses will not be valid if you use the NCBI release and so they must be prepared seperately.  

    ◄1.1 - Tutorial objective1.3 - Experiment design and support►
    • Página Principal
    • Calendario
    • Secciones del curso
      • CONTENTS
      • 1. PRELIMINARY INFORMATION
      • 1.1 - Tutorial objective
      • 1.2 - Tutorial material and case study
      • 1.3 - Experiment design and support
      • 1.4 - Installing and activating RNASeq and the Server-Side
      • 2. STEP-BY-STEP MODE TUTORIAL
      • 2.1. - TOPHAT/HISAT2 & CUFFLINKS
      • 2.1.1 - Preparing your experiment
      • 2.1.2 - Quality analysis and preprocessing
      • 2.1.3 - Mapping
      • 2.1.4 - Transcriptome Assembly
      • 2.1.5 - Differential Expression analysis
      • 2.1.6 - GOSeq
      • 2.2. - MAPPING & COUNTING PROTOCOL
      • 2.2.1 - Prepare your experiment for the mapping & counting analysis
      • 2.2.2 - Quality analysis and preprocessing
      • 2.2.3 - Mapping
      • 2.2.4 - Postprocessing
      • 2.2.5 - Differential Expression analysis
      • 3. PIPELINE MODE TUTORIAL
      • 3.1 - TopHat/Hisat2 & Cufflinks protocol
      • 3.2 - Mapping & Counting protocol
      • 4. BIBLIOGRAPHY
      • Pipeline mode: Tophat & Cufflinks protocol results
      • Pipeline mode: Mapping & Counting protocol results
      • CITE US
    Configuraciones de accesibilidad
    About us
    Team
    Publications
    R&D
    Patents & trademarks
    Announcements
    Careers
    Journal Sequencing Partners

    Biotechvana


    Valencia Lab
    Parc Cientific Universitat de Valencia
    Carrer del Catedràtic Agustín Escardino, 9. 46980 Paterna (Valencia) Spain
    Madrid Lab
    Parque Científico de Madrid
    Campus de Cantoblanco
    Calle Faraday 7, 28049 Madrid Spain
    Contact us
    Phone: +34 960 06 74 93
    Email: biotechvana@biotechvana.com

    Esta plataforma forma parte de: IVACE PROJECT IMDIGB/2020/56


    Projectes de Digitalizació de PIME (DIGITALIZA-CV TELETREBALL)2020
    IVACE PROJECT IMDIGB/2020/56

    Biotechvana © 2021
    eCampus Privacy policy    eCampus Terms of use