Simulation example data


Constant transcript coverage pool

This data set includes simulated reads for all transcripts at the same coverage (several levels - from 1X to 30x)

Example command line:

  spankisim_transcripts -o simcov_20intron_30 -g genemodels.gtf -f genome.fa -bp 76 -m flyheads -cov 30 -ends 2  
Fastq files:
  • 1X coverage
  • 2X coverage
  • 3X coverage
  • 4X coverage
  • 5X coverage
  • 6X coverage
  • 7X coverage
  • 8X coverage
  • 9X coverage
  • 10X coverage
  • 15X coverage
  • 20X coverage
  • 30X coverage

    Several pools at equal reads per kilobase (RPK)

    This data set includes simulated reads for all transcripts at the same rpk (300 RPK, equivalent to ~ 23X coverage). Four independent trials

    Example command line:

      spankisim_transcripts -o simcov_20intron_rpk300_trial1 -g genemodels.gtf -f genome.fa -bp 76 -m flyheads -rpk 30 -ends 2  
    
    Fastq files:
  • trial 1
  • trial 2
  • trial 3
  • trial 4

    Transcript models

    The annotation used to generate these reads is available here: Drosophila Ensembl release 67 (edited)