QC of RNAseq files for Montipora capitata larval thermal tolerance 2023 project

This post details initial QC of the Montipora capitata 2023 larval thermal tolerance project RNAseq files. See my notebook posts and my GitHub repo for information on this project. Also see my notebook post on sequence download and SRA upload for these files.

1. Run MultiQC on raw data (untrimmed and unfiltered)

cd /data/putnamlab/ashuffmyer/mcap-2023-rnaseq
cd scripts

nano qc_raw.sh

#!/bin/bash
#SBATCH -t 120:00:00
#SBATCH --nodes=1 --ntasks-per-node=20
#SBATCH --mem=100GB
#SBATCH --account=putnamlab
#SBATCH --export=NONE
#SBATCH --output="%x_out.%j"
#SBATCH --error="%x_err.%j"
#SBATCH -D /data/putnamlab/ashuffmyer/mcap-2023-rnaseq/raw-sequences

# load modules needed
module load fastp/0.19.7-foss-2018b
module load FastQC/0.11.8-Java-1.8
module load MultiQC/1.9-intel-2020a-Python-3.8.2

# fastqc of raw reads
fastqc *.fastq.gz

#generate multiqc report
multiqc ./ --filename multiqc_report_raw.html 

echo "Raw MultiQC report generated." $(date)

Run the script.

sbatch qc_raw.sh

Run as Job ID 303829 on 23 Feb 2024

Finised by 24 Feb 2024.

Move .fastqc files to a QC folder to keep folder organized.

mkdir raw_fastqc
mv *fastqc.html raw_fastqc
mv *fastqc.zip raw_fastqc
mv multiqc* raw_fastqc

Download the multiQC report to my desktop to view (in a new terminal not logged into Andromeda).

scp ashuffmyer@ssh3.hac.uri.edu:/data/putnamlab/ashuffmyer/mcap-2023-rnaseq/raw-sequences/raw_fastqc/multiqc_report_raw.html ~/MyProjects/larval_symbiont_TPC/data/rna_seq/QC

The raw sequence MultiQC report can be found on GitHub here.

Here are the things I noticed from the QC report:

See this previous post with statistics of samples provided by Azenta.

Samples with lowest sequence counts
- Samples R75, R99, R67, R83, R91, R107, R59 have <10M sequence counts. These are randomly distributed between treatments, so if they are removed it doesn’t cause a problem. A standard approach is to have samples with >5M mapped reads for DEG analysis. If we have <5M reads after mapping they will likely be removed or we will need to see how or if results from these samples differ from others using PCA’s, etc.
- I emailed Azenta on 20240226 to inquire about library prep QC, methods, and the option to resequence these low read depth libraries.
- R75: Bleached Cladocopium at 30°C; Batch 7; Qbit RNA 16.1; Nanodrop RNA 12.5
- R99: Nonbleached Mixed at 33°C; Batch 9; Qbit RNA 21.6; Nanodrop RNA 14.1
- R67: Wildtype at 27°C; Batch 8; Qbit RNA 20.7; Nanodrop RNA 18.6
- R83: Nonbleached Mixed at 33°C; Batch 7; Qbit RNA 18.15; Nanodrop RNA 13.5
- R91: Bleached Cladocopium at 33°C; Batch 7; Qbit RNA 25.2; Nanodrop RNA 18.6
- R107: Wildtype at 33°C; Batch 9; Qbit RNA 16.85; Nanodrop RNA 11.7
- R59: Bleached Cladocopium at 27°C; Batch 9; Qbit RNA 17.75; Nanodrop RNA 10.7
- All other samples had RNA concentrations ranging form Qbit 15-40 and Nanodrop 9-25. They are not all from one batch either. Low sequence counts do not appear to correlate with original RNA extraction or concentration and may be due to library prep or something that happened during sequencing.

Sample Name	% Dups	% GC	M Seqs
R77_R1_001	73.5%	44%	39.7
R77_R2_001	72.9%	44%	39.7
R69_R1_001	75.0%	44%	39.5
R69_R2_001	74.2%	44%	39.5
R101_R1_001	76.1%	43%	38.5
R101_R2_001	75.2%	44%	38.5
R85_R1_001	74.7%	43%	36.9
R85_R2_001	73.8%	44%	36.9
R80_R1_001	74.2%	44%	36.1
R80_R2_001	73.4%	44%	36.1
R96_R1_001	75.2%	45%	35.9
R96_R2_001	74.3%	45%	35.9
R98_R1_001	74.2%	44%	35.9
R98_R2_001	73.4%	44%	35.9
R82_R1_001	74.6%	43%	35.5
R82_R2_001	73.8%	44%	35.5
R78_R1_001	73.4%	43%	35.4
R78_R2_001	72.8%	44%	35.4
R73_R1_001	74.0%	44%	35.2
R73_R2_001	73.3%	44%	35.2
R106_R1_001	73.2%	43%	34.9
R106_R2_001	72.4%	44%	34.9
R74_R1_001	73.3%	44%	34.9
R74_R2_001	72.7%	44%	34.9
R100_R1_001	75.4%	44%	34.6
R100_R2_001	74.5%	44%	34.6
R70_R1_001	73.8%	43%	34.3
R70_R2_001	73.0%	44%	34.3
R72_R1_001	73.1%	43%	34.0
R72_R2_001	72.3%	44%	34.0
R102_R1_001	76.1%	44%	33.7
R102_R2_001	75.1%	44%	33.7
R81_R1_001	72.2%	44%	33.5
R81_R2_001	71.4%	45%	33.5
R97_R1_001	75.4%	43%	33.4
R97_R2_001	74.6%	44%	33.4
R89_R1_001	74.3%	43%	33.1
R89_R2_001	73.6%	44%	33.1
R76_R1_001	74.2%	44%	33.0
R76_R2_001	73.7%	44%	33.0
R86_R1_001	73.5%	43%	32.7
R86_R2_001	72.7%	44%	32.7
R94_R1_001	76.1%	44%	32.5
R94_R2_001	75.5%	44%	32.5
R64_R1_001	73.2%	44%	32.3
R64_R2_001	72.5%	44%	32.3
R66_R1_001	75.0%	43%	32.3
R66_R2_001	74.3%	44%	32.3
R84_R1_001	74.5%	43%	32.2
R84_R2_001	73.7%	44%	32.2
R65_R1_001	73.0%	43%	32.1
R65_R2_001	72.1%	44%	32.1
R90_R1_001	73.9%	43%	31.9
R90_R2_001	73.2%	44%	31.9
R93_R1_001	82.9%	46%	31.6
R93_R2_001	82.3%	46%	31.6
R68_R1_001	72.7%	43%	30.9
R68_R2_001	72.1%	44%	30.9
R104_R1_001	72.7%	44%	30.6
R104_R2_001	72.0%	44%	30.6
R88_R1_001	73.5%	44%	30.6
R88_R2_001	72.8%	45%	30.6
R105_R1_001	72.8%	43%	29.4
R105_R2_001	72.1%	44%	29.4
R108_R1_001	72.7%	43%	29.3
R108_R2_001	72.2%	44%	29.3
R92_R1_001	76.2%	43%	28.6
R92_R2_001	75.5%	44%	28.6
R71_R1_001	72.2%	43%	27.8
R71_R2_001	71.7%	44%	27.8
R95_R1_001	74.0%	43%	27.4
R95_R2_001	73.3%	44%	27.4
R79_R1_001	73.5%	44%	25.0
R79_R2_001	72.7%	45%	25.0
R87_R1_001	71.9%	43%	24.4
R87_R2_001	71.5%	43%	24.4
R63_R1_001	73.2%	43%	23.8
R63_R2_001	72.6%	44%	23.8
R103_R1_001	73.4%	44%	22.0
R103_R2_001	72.7%	44%	22.0
R61_R1_001	72.6%	44%	19.9
R61_R2_001	70.4%	44%	19.9
R58_R1_001	71.9%	43%	18.0
R58_R2_001	69.6%	44%	18.0
R56_R1_001	70.4%	44%	17.5
R56_R2_001	68.3%	45%	17.5
R62_R1_001	71.8%	44%	16.7
R62_R2_001	69.5%	45%	16.7
R60_R1_001	70.8%	44%	16.3
R60_R2_001	68.6%	45%	16.3
R55_R1_001	72.1%	43%	15.8
R55_R2_001	69.8%	44%	15.8
R57_R1_001	72.4%	43%	15.2
R57_R2_001	70.1%	44%	15.2
R75_R1_001	65.4%	43%	9.9
R75_R2_001	65.1%	44%	9.9
R99_R1_001	65.3%	44%	8.4
R99_R2_001	64.9%	44%	8.4
R67_R1_001	63.6%	43%	7.7
R67_R2_001	63.2%	44%	7.7
R83_R1_001	64.1%	43%	7.5
R83_R2_001	63.5%	44%	7.5
R91_R1_001	64.1%	43%	6.9
R91_R2_001	63.7%	44%	6.9
R107_R1_001	63.7%	43%	6.7
R107_R2_001	63.2%	44%	6.7
R59_R1_001	60.2%	44%	6.0
R59_R2_001	59.2%	44%	6.0

Adapter content present in sequences, as expected, because we have not yet removed adapters.

Some samples have warnings for GC content. We will revisit this after trimming.

There are a high proportion of overrepresented sequences. This is expected with RNAseq data as there are likely genes that are highly and consistently expressed.

All reads are the same length (150 bp).
High quality scores.

Low/no N content

2. Trimming adapters

I first ran a step to trim adapters from sequences. I will then generate another QC report to look at the results before making other trimming decisions.

I first moved the .md5 files to an md5 folder to keep only .fastq files in raw-sequences.

cd raw-sequences
mkdir md5_files

mv *.md5 md5_files

Make a new folder for trimmed sequences.

cd /data/putnamlab/ashuffmyer/mcap-2023-rnaseq/
mkdir trimmed-sequences

cd scripts
nano trim_adapters.sh

Generate a script adapted from Jill Ashey’s script here.

I will use the following settings for trimming in fastp. Fastp documentation can be found here.

detect_adapter_for_pe \
- This enables auto detection of adapters for paired end data
qualified_quality_phred 30 \
- Filters reads based on phred score >=30
unqualified_percent_limit 10 \
- percents of bases are allowed to be unqualified, set here as 10%
length_required 100 \
- Removes reads shorter than 100 bp. We have read lengths of 150 bp.
cut_right cut_right_window_size 5 cut_right_mean_quality 20
- Jill used this sliding cut window in her script. I am going to leave it out for now and evaluate the QC to see if we need to implement cutting.

#!/bin/bash
#SBATCH -t 100:00:00
#SBATCH --nodes=1 --ntasks-per-node=1
#SBATCH --export=NONE
#SBATCH --mem=100GB
#SBATCH --mail-type=BEGIN,END,FAIL #email you when job starts, stops and/or fails
#SBATCH --mail-user=ashuffmyer@uri.edu #your email to send notifications
#SBATCH --account=putnamlab
#SBATCH -D /data/putnamlab/ashuffmyer/mcap-2023-rnaseq/scripts           
#SBATCH -o adapter-trim-%j.out
#SBATCH -e adapter-trim-%j.error

# Load modules needed 
module load fastp/0.19.7-foss-2018b
module load FastQC/0.11.8-Java-1.8
module load MultiQC/1.9-intel-2020a-Python-3.8.2

# Make an array of sequences to trim in raw data directory 

cd /data/putnamlab/ashuffmyer/mcap-2023-rnaseq/raw-sequences/
array1=($(ls *R1_001.fastq.gz))

echo "Read trimming of adapters started." $(date)

# fastp and fastqc loop 
for i in ${array1[@]}; do
    fastp --in1 ${i} \
        --in2 $(echo ${i}|sed s/_R1/_R2/)\
        --out1 /data/putnamlab/ashuffmyer/mcap-2023-rnaseq/trimmed-sequences/trim.${i} \
        --out2 /data/putnamlab/ashuffmyer/mcap-2023-rnaseq/trimmed-sequences/trim.$(echo ${i}|sed s/_R1/_R2/) \
        --detect_adapter_for_pe \
        --qualified_quality_phred 30 \
        --unqualified_percent_limit 10 \
        --length_required 100 

done

echo "Read trimming of adapters completed." $(date)

sbatch trim_adapters.sh

Job ID 303864
Ran on Feb 24 2024

An example output from the error file looks like this:

Detecting adapter sequence for read1...
Illumina TruSeq Adapter Read 1: AGATCGGAAGAGCACACGTCTGAACTCCAGTCA

Detecting adapter sequence for read2...
No adapter detected for read2

Read1 before filtering:
total reads: 33729404
total bases: 5059410600
Q20 bases: 4895787090(96.766%)
Q30 bases: 4604173440(91.0022%)

Read1 after filtering:
total reads: 21719378
total bases: 3192545540
Q20 bases: 3162771852(99.0674%)
Q30 bases: 3094582490(96.9315%)

Read2 before filtering:
total reads: 33729404
total bases: 5059410600
Q20 bases: 4807178351(95.0146%)
Q30 bases: 4428591566(87.5318%)

Read2 aftering filtering:
total reads: 21719378
total bases: 3192740421
Q20 bases: 3160586478(98.9929%)
Q30 bases: 3088498530(96.735%)

Filtering result:
reads passed filter: 43438756
reads failed due to low quality: 23412708
reads failed due to too many N: 334
reads failed due to too short: 607010
reads with adapter trimmed: 8474961
bases trimmed due to adapters: 213620961

Duplication rate: 62.6758%

Insert size peak (evaluated by paired-end reads): 166

JSON report: fastp.json
HTML report: fastp.html

Completed early morning Feb 25 2024.

Move script out and error files and fastp files to the trimmed sequence folder to keep things organized.

cd raw-sequences
mv adapter-trim* ../trimmed-sequences/
mv fastp* ../trimmed-sequences/

I then downloaded the fastp.html report to look at trimming information.

scp ashuffmyer@ssh3.hac.uri.edu:/data/putnamlab/ashuffmyer/mcap-2023-rnaseq/trimmed-sequences/fastp.html ~/MyProjects/larval_symbiont_TPC/data/rna_seq/QC

This file can be found on GitHub here.

Here are the results:

General statistics

fastp version:	0.19.7 (https://github.com/OpenGene/fastp)
sequencing:	paired end (150 cycles + 150 cycles)
mean length before filtering:	150bp, 150bp
mean length after filtering:	147bp, 147bp
duplication rate:	20.022686%
Insert size peak:	133
Detected read1 adapter:	AGATCGGAAGAGCACACGTCTGAACTCCAGTCA

Before filtering

total reads:	16.841214 M
total bases:	2.526182 G
Q20 bases:	2.421432 G (95.853403%)
Q30 bases:	2.262788 G (89.573449%)
GC content:	44.519841%

After filtering

total reads:	11.487322 M
total bases:	1.692048 G
Q20 bases:	1.678653 G (99.208366%)
Q30 bases:	1.645756 G (97.264174%)
GC content:	43.779533%

Filtering results

reads passed filters:	11.487322 M (68.209584%)
reads with low quality:	5.219646 M (30.993288%)
reads with too many N:	150 (0.000891%)
reads too short:	134.096000 K (0.796237%)

These results show that filtering improved quality of reads and removed about 30% of reads due to length (a small proportion) and quality (most of reads removed). Average Q30 bases improved from 89% to 97%. Adapters were removed.

Next, I’ll run another round of fastqc and multiqc to see how this changed or improved our qc results.

2. FastQC and MultiQC on trimmed sequences

Make a script for running QC on trimmed sequences.

cd /data/putnamlab/ashuffmyer/mcap-2023-rnaseq
cd scripts

nano qc_trimmed.sh

#!/bin/bash
#SBATCH -t 100:00:00
#SBATCH --nodes=1 --ntasks-per-node=1
#SBATCH --export=NONE
#SBATCH --mem=100GB
#SBATCH --mail-type=BEGIN,END,FAIL #email you when job starts, stops and/or fails
#SBATCH --mail-user=ashuffmyer@uri.edu #your email to send notifications
#SBATCH --account=putnamlab
#SBATCH -D /data/putnamlab/ashuffmyer/mcap-2023-rnaseq/trimmed-sequences           
#SBATCH -o trimmed-qc-%j.out
#SBATCH -e trimmed-qc-%j.error

# load modules needed
module load fastp/0.19.7-foss-2018b
module load FastQC/0.11.8-Java-1.8
module load MultiQC/1.9-intel-2020a-Python-3.8.2

# fastqc of raw reads
fastqc *.fastq.gz 

#generate multiqc report
multiqc ./ --filename multiqc_report_trimmed.html 

echo "Trimmed MultiQC report generated." $(date)

Run the script.

sbatch qc_trimmed.sh

Job 303885 started on 25 Feb in the morning.

Job ended on 25 Feb in the evening.

Make folders for QC files to go into. Move QC files. I tried to set output directories but it was failing, so I’m just manually moving them for now.

cd /data/putnamlab/ashuffmyer/mcap-2023-rnaseq/trimmed-sequences
mkdir trimmed_qc_files 

mv *fastqc.html trimmed_qc_files
mv *fastqc.zip trimmed_qc_files
mv multiqc* trimmed_qc_files
mv trimmed-qc-303* trimmed_qc_files

Copy the multiqc html file to my computer.

scp ashuffmyer@ssh3.hac.uri.edu:/data/putnamlab/ashuffmyer/mcap-2023-rnaseq/trimmed-sequences/trimmed_qc_files/multiqc_report_trimmed.html ~/MyProjects/larval_symbiont_TPC/data/rna_seq/QC

The MultiQC report results are below. The seven samples that had <10M reads had reads reduced and now they are <5M reads. These will likely need to be removed from analyses. The remaining samples have 9M-25M reads. We will reevaluate which samples should be removed from analyses after the mapping step and after I hear back from Azenta.

This report also included the fastp stats, which was new to me!

Here are some of the main results:

Sample Name	% Duplication	GC content	% PF	% Adapter	% Dups	% GC	M Seqs
fastp	20.0%	43.8%	68.2%	12.0%
trim.R100_R1_001					75.3%	43%	22.6
trim.R100_R2_001					75.3%	43%	22.6
trim.R101_R1_001					75.7%	43%	24.7
trim.R101_R2_001					75.7%	43%	24.7
trim.R102_R1_001					75.8%	43%	21.7
trim.R102_R2_001					75.7%	43%	21.7
trim.R103_R1_001					73.1%	43%	15.0
trim.R103_R2_001					73.1%	43%	15.0
trim.R104_R1_001					72.3%	43%	20.3
trim.R104_R2_001					72.2%	43%	20.3
trim.R105_R1_001					72.5%	43%	19.3
trim.R105_R2_001					72.5%	43%	19.3
trim.R106_R1_001					72.8%	43%	23.3
trim.R106_R2_001					72.8%	43%	23.3
trim.R107_R1_001					50.2%	43%	4.8
trim.R107_R2_001					50.1%	43%	4.8
trim.R108_R1_001					72.5%	43%	19.6
trim.R108_R2_001					72.6%	43%	19.6
trim.R55_R1_001					72.4%	43%	10.4
trim.R55_R2_001					72.2%	43%	10.4
trim.R56_R1_001					71.0%	44%	11.0
trim.R56_R2_001					70.7%	44%	11.0
trim.R57_R1_001					72.9%	43%	9.8
trim.R57_R2_001					72.5%	43%	9.8
trim.R58_R1_001					72.7%	43%	11.6
trim.R58_R2_001					72.4%	43%	11.6
trim.R59_R1_001					39.1%	43%	4.2
trim.R59_R2_001					38.9%	43%	4.2
trim.R60_R1_001					71.8%	43%	10.4
trim.R60_R2_001					71.4%	43%	10.4
trim.R61_R1_001					73.3%	43%	12.8
trim.R61_R2_001					73.0%	43%	12.8
trim.R62_R1_001					72.6%	43%	10.6
trim.R62_R2_001					72.4%	43%	10.6
trim.R63_R1_001					72.8%	43%	16.4
trim.R63_R2_001					72.8%	43%	16.4
trim.R64_R1_001					72.7%	43%	21.6
trim.R64_R2_001					72.7%	43%	21.6
trim.R65_R1_001					72.6%	43%	20.9
trim.R65_R2_001					72.6%	43%	20.9
trim.R66_R1_001					74.6%	43%	21.7
trim.R66_R2_001					74.7%	43%	21.7
trim.R67_R1_001					52.6%	42%	5.4
trim.R67_R2_001					52.7%	43%	5.4
trim.R68_R1_001					72.4%	43%	20.7
trim.R68_R2_001					72.5%	43%	20.7
trim.R69_R1_001					74.7%	44%	26.1
trim.R69_R2_001					74.6%	44%	26.1
trim.R70_R1_001					73.5%	43%	22.7
trim.R70_R2_001					73.6%	43%	22.7
trim.R71_R1_001					71.8%	43%	19.0
trim.R71_R2_001					71.8%	43%	19.0
trim.R72_R1_001					72.6%	43%	22.0
trim.R72_R2_001					72.5%	43%	22.0
trim.R73_R1_001					73.6%	43%	23.2
trim.R73_R2_001					73.6%	43%	23.2
trim.R74_R1_001					72.9%	44%	23.4
trim.R74_R2_001					73.0%	44%	23.4
trim.R75_R1_001					55.7%	43%	7.0
trim.R75_R2_001					55.9%	43%	7.0
trim.R76_R1_001					74.0%	43%	22.1
trim.R76_R2_001					74.1%	43%	22.1
trim.R77_R1_001					73.1%	43%	26.7
trim.R77_R2_001					73.0%	43%	26.7
trim.R78_R1_001					73.1%	43%	23.2
trim.R78_R2_001					73.2%	43%	23.2
trim.R79_R1_001					73.0%	44%	16.7
trim.R79_R2_001					72.9%	44%	16.7
trim.R80_R1_001					73.7%	43%	23.5
trim.R80_R2_001					73.6%	43%	23.5
trim.R81_R1_001					71.7%	44%	21.5
trim.R81_R2_001					71.6%	44%	21.5
trim.R82_R1_001					74.1%	43%	23.4
trim.R82_R2_001					74.1%	43%	23.4
trim.R83_R1_001					51.3%	43%	5.3
trim.R83_R2_001					51.4%	43%	5.3
trim.R84_R1_001					74.2%	43%	21.3
trim.R84_R2_001					74.1%	43%	21.3
trim.R85_R1_001					74.2%	43%	23.8
trim.R85_R2_001					74.1%	43%	23.8
trim.R86_R1_001					73.0%	43%	21.0
trim.R86_R2_001					73.0%	43%	21.0
trim.R87_R1_001					71.5%	43%	17.0
trim.R87_R2_001					71.6%	43%	17.0
trim.R88_R1_001					73.0%	44%	20.0
trim.R88_R2_001					73.1%	44%	20.0
trim.R89_R1_001					73.9%	43%	21.8
trim.R89_R2_001					74.0%	43%	21.8
trim.R90_R1_001					73.5%	43%	21.6
trim.R90_R2_001					73.5%	43%	21.6
trim.R91_R1_001					51.1%	43%	4.8
trim.R91_R2_001					51.2%	43%	4.8
trim.R92_R1_001					76.0%	43%	19.0
trim.R92_R2_001					76.0%	43%	19.0
trim.R93_R1_001					83.0%	45%	21.3
trim.R93_R2_001					83.1%	45%	21.3
trim.R94_R1_001					76.0%	43%	21.3
trim.R94_R2_001					76.1%	43%	21.3
trim.R95_R1_001					73.7%	43%	18.3
trim.R95_R2_001					73.6%	43%	18.3
trim.R96_R1_001					74.8%	44%	23.0
trim.R96_R2_001					74.8%	44%	23.0
trim.R97_R1_001					75.1%	43%	21.5
trim.R97_R2_001					75.1%	43%	21.5
trim.R98_R1_001					73.8%	43%	23.5
trim.R98_R2_001					73.8%	43%	23.5
trim.R99_R1_001					54.7%	43%	5.7
trim.R99_R2_001					54.6%	43%	5.7

Fastp filtering: most reads filtered were due to low quality

Average insert size is 133 bp

You can see the seven samples here with low read counts (<5M)

There is some sequence duplication. Need to determine if this is expected or a problem.

GC content is better now. There are two samples with weird distributions. I’ll look at those in more detail in individual files.

Length is shorter now after adapter removal.

Quality is very high

Next I need to look at the individual raw and trimmed FastQC files for each sample.

Copy the individual fastqc files to my computer.

scp ashuffmyer@ssh3.hac.uri.edu:/data/putnamlab/ashuffmyer/mcap-2023-rnaseq/raw-sequences/raw_fastqc/\*fastqc.html ~/MyProjects/larval_symbiont_TPC/data/rna_seq/QC/fastqc_raw

scp ashuffmyer@ssh3.hac.uri.edu:/data/putnamlab/ashuffmyer/mcap-2023-rnaseq/trimmed-sequences/trimmed_qc_files/\*fastqc.html ~/MyProjects/larval_symbiont_TPC/data/rna_seq/QC/fastqc_trimmed 

Raw FastQC files can be found on GitHub here and trimmed FastQC files can be found on GitHub here.

I’ll next dig into each sample and see what they look like - especially those with low sequence counts and deviations from normal QC results.

Written on February 23, 2024