Download and QC of Moorea 2023 RNAseq files with NCBI upload
Today I downloaded RNAseq files from GenoHub for the 2023 Moorea Symbiotic Exchange project.
Overview
I submitted samples for RNA sequencing to examine differences in thermal responses between lifestages of Pocillopora and Acropora corals from the 2023 Moorea Symbiotic Exchange project.
Here are relevant links to past posts and data for this project:
Here are links to relevant metadata and sample inventories:
- Sequencing sample inventory
- Biological/experimental sample metadata
- Sequencer QC files
- Description of sequencing methods
- Sequencer Qubit data
Data download
UW Hyak
- Logged onto hyak and made a new directory here for raw sequences:
/gscratch/srlab/ashuff/moorea-2023-rnaseq/raw-sequences
- Started an interactive job
salloc --partition=ckpt-all --cpus-per-task=1 --mem=10G --time=2:00:00
module load coenv/aws/2.17.32
aws configure
and added log in data from Genohub- Used Genohub commands to access S3 storage bucket
- Downloaded all files from Genohub bucket to
/gscratch/srlab/ashuff/moorea-2023-rnaseq/raw-sequences
- The sequences are identified by sample name. Some samples had a second round of sequencing, identified by a different “S###” identifier.
Here is the complete list of files on Genohub’s server.
2025-09-26 15:26:50 202011072 10_S131_R1_001.fastq.gz
2025-09-26 15:26:59 57 10_S131_R1_001.fastq.gz.md5
2025-09-26 15:26:50 201982551 10_S131_R2_001.fastq.gz
2025-09-26 15:26:53 57 10_S131_R2_001.fastq.gz.md5
2025-09-23 09:28:24 653257028 10_S921_R1_001.fastq.gz
2025-09-23 09:28:35 57 10_S921_R1_001.fastq.gz.md5
2025-09-23 09:28:26 650495077 10_S921_R2_001.fastq.gz
2025-09-23 09:28:36 57 10_S921_R2_001.fastq.gz.md5
2025-09-26 15:28:52 1882339016 11_S132_R1_001.fastq.gz
2025-09-26 15:29:00 57 11_S132_R1_001.fastq.gz.md5
2025-09-26 15:28:52 1886049965 11_S132_R2_001.fastq.gz
2025-09-26 15:29:04 57 11_S132_R2_001.fastq.gz.md5
2025-09-23 09:31:35 750780 11_S922_R1_001.fastq.gz
2025-09-23 09:31:39 57 11_S922_R1_001.fastq.gz.md5
2025-09-23 09:31:38 745609 11_S922_R2_001.fastq.gz
2025-09-23 09:31:42 57 11_S922_R2_001.fastq.gz.md5
2025-09-26 15:28:30 266677507 12_S133_R1_001.fastq.gz
2025-09-26 15:28:40 57 12_S133_R1_001.fastq.gz.md5
2025-09-26 15:28:31 267506631 12_S133_R2_001.fastq.gz
2025-09-26 15:28:34 57 12_S133_R2_001.fastq.gz.md5
2025-09-23 09:30:59 878736005 12_S923_R1_001.fastq.gz
2025-09-23 09:31:09 57 12_S923_R1_001.fastq.gz.md5
2025-09-23 09:31:00 874927620 12_S923_R2_001.fastq.gz
2025-09-23 09:31:13 57 12_S923_R2_001.fastq.gz.md5
2025-09-26 15:28:41 266283704 13_S134_R1_001.fastq.gz
2025-09-26 15:28:46 57 13_S134_R1_001.fastq.gz.md5
2025-09-26 15:28:43 265393778 13_S134_R2_001.fastq.gz
2025-09-26 15:28:55 57 13_S134_R2_001.fastq.gz.md5
2025-09-23 09:31:23 863456553 13_S924_R1_001.fastq.gz
2025-09-23 09:31:33 57 13_S924_R1_001.fastq.gz.md5
2025-09-23 09:31:24 857992096 13_S924_R2_001.fastq.gz
2025-09-23 09:31:36 57 13_S924_R2_001.fastq.gz.md5
2025-09-26 15:28:52 356402661 14_S135_R1_001.fastq.gz
2025-09-26 15:29:04 57 14_S135_R1_001.fastq.gz.md5
2025-09-26 15:28:55 353909175 14_S135_R2_001.fastq.gz
2025-09-26 15:29:06 57 14_S135_R2_001.fastq.gz.md5
2025-09-23 09:31:39 1123658157 14_S925_R1_001.fastq.gz
2025-09-23 09:31:48 57 14_S925_R1_001.fastq.gz.md5
2025-09-23 09:31:41 1114020857 14_S925_R2_001.fastq.gz
2025-09-23 09:31:54 57 14_S925_R2_001.fastq.gz.md5
2025-09-26 15:28:49 287539135 15_S136_R1_001.fastq.gz
2025-09-26 15:28:52 57 15_S136_R1_001.fastq.gz.md5
2025-09-26 15:28:49 285499626 15_S136_R2_001.fastq.gz
2025-09-26 15:29:01 57 15_S136_R2_001.fastq.gz.md5
2025-09-23 09:31:27 885160874 15_S926_R1_001.fastq.gz
2025-09-23 09:31:38 57 15_S926_R1_001.fastq.gz.md5
2025-09-23 09:31:32 876632244 15_S926_R2_001.fastq.gz
2025-09-23 09:31:42 57 15_S926_R2_001.fastq.gz.md5
2025-09-26 15:27:49 279202783 16_S137_R1_001.fastq.gz
2025-09-26 15:27:58 57 16_S137_R1_001.fastq.gz.md5
2025-09-26 15:27:49 279103137 16_S137_R2_001.fastq.gz
2025-09-26 15:27:52 57 16_S137_R2_001.fastq.gz.md5
2025-09-23 09:29:53 884175492 16_S927_R1_001.fastq.gz
2025-09-23 09:30:05 57 16_S927_R1_001.fastq.gz.md5
2025-09-23 09:29:56 879388652 16_S927_R2_001.fastq.gz
2025-09-23 09:30:09 57 16_S927_R2_001.fastq.gz.md5
2025-09-26 15:28:46 315521999 17_S138_R1_001.fastq.gz
2025-09-26 15:28:49 57 17_S138_R1_001.fastq.gz.md5
2025-09-26 15:28:46 314173216 17_S138_R2_001.fastq.gz
2025-09-26 15:28:58 57 17_S138_R2_001.fastq.gz.md5
2025-09-23 09:31:24 988629472 17_S928_R1_001.fastq.gz
2025-09-23 09:31:35 57 17_S928_R1_001.fastq.gz.md5
2025-09-23 09:31:28 982460035 17_S928_R2_001.fastq.gz
2025-09-23 09:31:35 57 17_S928_R2_001.fastq.gz.md5
2025-09-26 15:26:18 233732621 18_S139_R1_001.fastq.gz
2025-09-26 15:26:20 57 18_S139_R1_001.fastq.gz.md5
2025-09-26 15:26:23 233198925 18_S139_R2_001.fastq.gz
2025-09-26 15:26:32 57 18_S139_R2_001.fastq.gz.md5
2025-09-23 09:27:31 713013275 18_S929_R1_001.fastq.gz
2025-09-23 09:27:42 57 18_S929_R1_001.fastq.gz.md5
2025-09-23 09:27:43 707972377 18_S929_R2_001.fastq.gz
2025-09-23 09:27:53 57 18_S929_R2_001.fastq.gz.md5
2025-09-26 15:27:30 291295888 19_S140_R1_001.fastq.gz
2025-09-26 15:27:39 57 19_S140_R1_001.fastq.gz.md5
2025-09-26 15:27:31 288288058 19_S140_R2_001.fastq.gz
2025-09-26 15:27:40 57 19_S140_R2_001.fastq.gz.md5
2025-09-23 09:29:36 813380202 19_S930_R1_001.fastq.gz
2025-09-23 09:29:45 57 19_S930_R1_001.fastq.gz.md5
2025-09-23 09:29:37 802567667 19_S930_R2_001.fastq.gz
2025-09-23 09:29:48 57 19_S930_R2_001.fastq.gz.md5
2025-09-26 15:28:22 268090339 1_S122_R1_001.fastq.gz
2025-09-26 15:28:34 56 1_S122_R1_001.fastq.gz.md5
2025-09-26 15:28:25 267398177 1_S122_R2_001.fastq.gz
2025-09-26 15:28:28 56 1_S122_R2_001.fastq.gz.md5
2025-09-23 09:30:44 802810789 1_S912_R1_001.fastq.gz
2025-09-23 09:30:57 56 1_S912_R1_001.fastq.gz.md5
2025-09-23 09:30:47 798365190 1_S912_R2_001.fastq.gz
2025-09-23 09:31:00 56 1_S912_R2_001.fastq.gz.md5
2025-09-26 15:27:55 349562101 20_S141_R1_001.fastq.gz
2025-09-26 15:28:04 57 20_S141_R1_001.fastq.gz.md5
2025-09-26 15:27:58 345927239 20_S141_R2_001.fastq.gz
2025-09-26 15:28:01 57 20_S141_R2_001.fastq.gz.md5
2025-09-23 09:29:59 1040531009 20_S931_R1_001.fastq.gz
2025-09-23 09:30:09 57 20_S931_R1_001.fastq.gz.md5
2025-09-23 09:29:59 1025659742 20_S931_R2_001.fastq.gz
2025-09-23 09:30:09 57 20_S931_R2_001.fastq.gz.md5
2025-09-26 15:27:40 256644872 21_S142_R1_001.fastq.gz
2025-09-26 15:27:44 57 21_S142_R1_001.fastq.gz.md5
2025-09-26 15:27:39 255983049 21_S142_R2_001.fastq.gz
2025-09-26 15:27:51 57 21_S142_R2_001.fastq.gz.md5
2025-09-23 09:29:50 799436215 21_S932_R1_001.fastq.gz
2025-09-23 09:29:59 57 21_S932_R1_001.fastq.gz.md5
2025-09-23 09:29:51 794774614 21_S932_R2_001.fastq.gz
2025-09-23 09:29:59 57 21_S932_R2_001.fastq.gz.md5
2025-09-26 15:27:21 240606837 22_S143_R1_001.fastq.gz
2025-09-26 15:27:23 57 22_S143_R1_001.fastq.gz.md5
2025-09-26 15:27:21 239686313 22_S143_R2_001.fastq.gz
2025-09-26 15:27:29 57 22_S143_R2_001.fastq.gz.md5
2025-09-23 09:29:18 750930588 22_S933_R1_001.fastq.gz
2025-09-23 09:29:31 57 22_S933_R1_001.fastq.gz.md5
2025-09-23 09:29:22 745635127 22_S933_R2_001.fastq.gz
2025-09-23 09:29:33 57 22_S933_R2_001.fastq.gz.md5
2025-09-26 15:28:34 224308588 23_S144_R1_001.fastq.gz
2025-09-26 15:28:36 57 23_S144_R1_001.fastq.gz.md5
2025-09-26 15:28:37 223237425 23_S144_R2_001.fastq.gz
2025-09-26 15:28:41 57 23_S144_R2_001.fastq.gz.md5
2025-09-23 09:31:05 797039603 23_S934_R1_001.fastq.gz
2025-09-23 09:31:18 57 23_S934_R1_001.fastq.gz.md5
2025-09-23 09:31:08 788952748 23_S934_R2_001.fastq.gz
2025-09-23 09:31:26 57 23_S934_R2_001.fastq.gz.md5
2025-09-26 15:28:58 306311416 24_S145_R1_001.fastq.gz
2025-09-26 15:29:07 57 24_S145_R1_001.fastq.gz.md5
2025-09-26 15:28:58 303526222 24_S145_R2_001.fastq.gz
2025-09-26 15:29:01 57 24_S145_R2_001.fastq.gz.md5
2025-09-23 09:31:41 936752300 24_S935_R1_001.fastq.gz
2025-09-23 09:31:53 57 24_S935_R1_001.fastq.gz.md5
2025-09-23 09:31:44 924052120 24_S935_R2_001.fastq.gz
2025-09-23 09:31:54 57 24_S935_R2_001.fastq.gz.md5
2025-09-26 15:27:34 259143906 25_S146_R1_001.fastq.gz
2025-09-26 15:27:43 57 25_S146_R1_001.fastq.gz.md5
2025-09-26 15:27:37 258337178 25_S146_R2_001.fastq.gz
2025-09-26 15:27:39 57 25_S146_R2_001.fastq.gz.md5
2025-09-23 09:29:38 862182039 25_S936_R1_001.fastq.gz
2025-09-23 09:29:47 57 25_S936_R1_001.fastq.gz.md5
2025-09-23 09:29:41 854528149 25_S936_R2_001.fastq.gz
2025-09-23 09:29:50 57 25_S936_R2_001.fastq.gz.md5
2025-09-26 15:28:10 299761772 26_S147_R1_001.fastq.gz
2025-09-26 15:28:13 57 26_S147_R1_001.fastq.gz.md5
2025-09-26 15:28:13 297416514 26_S147_R2_001.fastq.gz
2025-09-26 15:28:21 57 26_S147_R2_001.fastq.gz.md5
2025-09-23 09:30:24 918422035 26_S937_R1_001.fastq.gz
2025-09-23 09:30:36 57 26_S937_R1_001.fastq.gz.md5
2025-09-23 09:30:23 909286653 26_S937_R2_001.fastq.gz
2025-09-23 09:30:36 57 26_S937_R2_001.fastq.gz.md5
2025-09-26 15:27:19 317092560 27_S148_R1_001.fastq.gz
2025-09-26 15:27:28 57 27_S148_R1_001.fastq.gz.md5
2025-09-26 15:27:19 314588951 27_S148_R2_001.fastq.gz
2025-09-26 15:27:31 57 27_S148_R2_001.fastq.gz.md5
2025-09-23 09:29:18 957305963 27_S938_R1_001.fastq.gz
2025-09-23 09:29:27 57 27_S938_R1_001.fastq.gz.md5
2025-09-23 09:29:17 946780102 27_S938_R2_001.fastq.gz
2025-09-23 09:29:31 57 27_S938_R2_001.fastq.gz.md5
2025-09-26 15:26:50 286768653 28_S149_R1_001.fastq.gz
2025-09-26 15:26:59 57 28_S149_R1_001.fastq.gz.md5
2025-09-26 15:26:50 286349169 28_S149_R2_001.fastq.gz
2025-09-26 15:26:58 57 28_S149_R2_001.fastq.gz.md5
2025-09-23 09:28:16 923973131 28_S939_R1_001.fastq.gz
2025-09-23 09:28:23 57 28_S939_R1_001.fastq.gz.md5
2025-09-23 09:28:16 920574071 28_S939_R2_001.fastq.gz
2025-09-23 09:28:24 57 28_S939_R2_001.fastq.gz.md5
2025-09-26 15:26:09 334129360 29_S150_R1_001.fastq.gz
2025-09-26 15:26:09 57 29_S150_R1_001.fastq.gz.md5
2025-09-26 15:26:09 333229683 29_S150_R2_001.fastq.gz
2025-09-26 15:26:09 57 29_S150_R2_001.fastq.gz.md5
2025-09-23 09:27:12 1078548940 29_S940_R1_001.fastq.gz
2025-09-23 09:27:21 57 29_S940_R1_001.fastq.gz.md5
2025-09-23 09:27:12 1072744348 29_S940_R2_001.fastq.gz
2025-09-23 09:27:22 57 29_S940_R2_001.fastq.gz.md5
2025-09-26 15:26:50 312110698 2_S123_R1_001.fastq.gz
2025-09-26 15:26:59 56 2_S123_R1_001.fastq.gz.md5
2025-09-26 15:26:50 309261122 2_S123_R2_001.fastq.gz
2025-09-26 15:26:59 56 2_S123_R2_001.fastq.gz.md5
2025-09-23 09:28:06 960723270 2_S913_R1_001.fastq.gz
2025-09-23 09:28:23 56 2_S913_R1_001.fastq.gz.md5
2025-09-23 09:28:12 949954952 2_S913_R2_001.fastq.gz
2025-09-23 09:28:21 56 2_S913_R2_001.fastq.gz.md5
2025-09-26 15:26:32 631382195 30_S151_R1_001.fastq.gz
2025-09-26 15:26:44 57 30_S151_R1_001.fastq.gz.md5
2025-09-26 15:26:35 631177701 30_S151_R2_001.fastq.gz
2025-09-26 15:26:44 57 30_S151_R2_001.fastq.gz.md5
2025-09-23 09:27:53 1976118285 30_S941_R1_001.fastq.gz
2025-09-23 09:28:02 57 30_S941_R1_001.fastq.gz.md5
2025-09-23 09:27:53 1970809265 30_S941_R2_001.fastq.gz
2025-09-23 09:28:02 57 30_S941_R2_001.fastq.gz.md5
2025-09-26 15:27:09 329272825 31_S152_R1_001.fastq.gz
2025-09-26 15:27:19 57 31_S152_R1_001.fastq.gz.md5
2025-09-26 15:27:09 329234104 31_S152_R2_001.fastq.gz
2025-09-26 15:27:19 57 31_S152_R2_001.fastq.gz.md5
2025-09-23 09:28:48 1055454535 31_S942_R1_001.fastq.gz
2025-09-23 09:29:06 57 31_S942_R1_001.fastq.gz.md5
2025-09-23 09:28:54 1051723014 31_S942_R2_001.fastq.gz
2025-09-23 09:29:05 57 31_S942_R2_001.fastq.gz.md5
2025-09-26 15:28:19 345274632 32_S153_R1_001.fastq.gz
2025-09-26 15:28:28 57 32_S153_R1_001.fastq.gz.md5
2025-09-26 15:28:19 341163227 32_S153_R2_001.fastq.gz
2025-09-26 15:28:31 57 32_S153_R2_001.fastq.gz.md5
2025-09-23 09:30:33 1043423919 32_S943_R1_001.fastq.gz
2025-09-23 09:30:42 57 32_S943_R1_001.fastq.gz.md5
2025-09-23 09:30:35 1026463015 32_S943_R2_001.fastq.gz
2025-09-23 09:30:48 57 32_S943_R2_001.fastq.gz.md5
2025-09-26 15:26:44 290139935 33_S154_R1_001.fastq.gz
2025-09-26 15:26:47 57 33_S154_R1_001.fastq.gz.md5
2025-09-26 15:26:47 288989963 33_S154_R2_001.fastq.gz
2025-09-26 15:26:58 57 33_S154_R2_001.fastq.gz.md5
2025-09-23 09:28:02 914358842 33_S944_R1_001.fastq.gz
2025-09-23 09:28:16 57 33_S944_R1_001.fastq.gz.md5
2025-09-23 09:28:07 909381908 33_S944_R2_001.fastq.gz
2025-09-23 09:28:16 57 33_S944_R2_001.fastq.gz.md5
2025-09-26 15:28:28 276051752 34_S155_R1_001.fastq.gz
2025-09-26 15:28:31 57 34_S155_R1_001.fastq.gz.md5
2025-09-26 15:28:28 274356940 34_S155_R2_001.fastq.gz
2025-09-26 15:28:41 57 34_S155_R2_001.fastq.gz.md5
2025-09-23 09:30:48 822290222 34_S945_R1_001.fastq.gz
2025-09-23 09:30:59 57 34_S945_R1_001.fastq.gz.md5
2025-09-23 09:30:57 815663538 34_S945_R2_001.fastq.gz
2025-09-23 09:31:08 57 34_S945_R2_001.fastq.gz.md5
2025-09-26 15:27:40 286374528 35_S156_R1_001.fastq.gz
2025-09-26 15:27:49 57 35_S156_R1_001.fastq.gz.md5
2025-09-26 15:27:40 284795688 35_S156_R2_001.fastq.gz
2025-09-26 15:27:49 57 35_S156_R2_001.fastq.gz.md5
2025-09-23 09:29:44 879499541 35_S946_R1_001.fastq.gz
2025-09-23 09:29:56 57 35_S946_R1_001.fastq.gz.md5
2025-09-23 09:29:46 873601324 35_S946_R2_001.fastq.gz
2025-09-23 09:29:57 57 35_S946_R2_001.fastq.gz.md5
2025-09-26 15:26:38 372766750 36_S157_R1_001.fastq.gz
2025-09-26 15:26:41 57 36_S157_R1_001.fastq.gz.md5
2025-09-26 15:26:41 370882226 36_S157_R2_001.fastq.gz
2025-09-26 15:26:50 57 36_S157_R2_001.fastq.gz.md5
2025-09-23 09:27:53 1087001803 36_S947_R1_001.fastq.gz
2025-09-23 09:28:03 57 36_S947_R1_001.fastq.gz.md5
2025-09-23 09:27:53 1077963884 36_S947_R2_001.fastq.gz
2025-09-23 09:28:06 57 36_S947_R2_001.fastq.gz.md5
2025-09-26 15:28:04 285625695 37_S158_R1_001.fastq.gz
2025-09-26 15:28:07 57 37_S158_R1_001.fastq.gz.md5
2025-09-26 15:28:07 282292944 37_S158_R2_001.fastq.gz
2025-09-26 15:28:10 57 37_S158_R2_001.fastq.gz.md5
2025-09-23 09:30:11 831669283 37_S948_R1_001.fastq.gz
2025-09-23 09:30:24 57 37_S948_R1_001.fastq.gz.md5
2025-09-23 09:30:14 821392226 37_S948_R2_001.fastq.gz
2025-09-23 09:30:24 57 37_S948_R2_001.fastq.gz.md5
2025-09-26 15:27:40 1887207722 38_S159_R1_001.fastq.gz
2025-09-26 15:27:49 57 38_S159_R1_001.fastq.gz.md5
2025-09-26 15:27:39 1885030462 38_S159_R2_001.fastq.gz
2025-09-26 15:27:52 57 38_S159_R2_001.fastq.gz.md5
2025-09-23 09:29:47 746 38_S949_R1_001.fastq.gz
2025-09-23 09:29:50 57 38_S949_R1_001.fastq.gz.md5
2025-09-23 09:29:47 724 38_S949_R2_001.fastq.gz
2025-09-23 09:29:50 57 38_S949_R2_001.fastq.gz.md5
2025-09-26 15:27:29 203004182 39_S160_R1_001.fastq.gz
2025-09-26 15:27:30 57 39_S160_R1_001.fastq.gz.md5
2025-09-26 15:27:28 202741281 39_S160_R2_001.fastq.gz
2025-09-26 15:27:39 57 39_S160_R2_001.fastq.gz.md5
2025-09-23 09:29:30 732953151 39_S950_R1_001.fastq.gz
2025-09-23 09:29:44 57 39_S950_R1_001.fastq.gz.md5
2025-09-23 09:29:33 730396508 39_S950_R2_001.fastq.gz
2025-09-23 09:29:46 57 39_S950_R2_001.fastq.gz.md5
2025-09-26 15:26:09 300231191 3_S124_R1_001.fastq.gz
2025-09-26 15:26:09 56 3_S124_R1_001.fastq.gz.md5
2025-09-26 15:26:09 297343002 3_S124_R2_001.fastq.gz
2025-09-26 15:26:09 56 3_S124_R2_001.fastq.gz.md5
2025-09-23 09:27:12 871154353 3_S914_R1_001.fastq.gz
2025-09-23 09:27:21 56 3_S914_R1_001.fastq.gz.md5
2025-09-23 09:27:12 861099928 3_S914_R2_001.fastq.gz
2025-09-23 09:27:22 56 3_S914_R2_001.fastq.gz.md5
2025-09-26 15:28:01 323927035 40_S161_R1_001.fastq.gz
2025-09-26 15:28:10 57 40_S161_R1_001.fastq.gz.md5
2025-09-26 15:28:01 321904613 40_S161_R2_001.fastq.gz
2025-09-26 15:28:13 57 40_S161_R2_001.fastq.gz.md5
2025-09-23 09:30:05 959725567 40_S951_R1_001.fastq.gz
2025-09-23 09:30:18 57 40_S951_R1_001.fastq.gz.md5
2025-09-23 09:30:06 952033449 40_S951_R2_001.fastq.gz
2025-09-23 09:30:21 57 40_S951_R2_001.fastq.gz.md5
2025-09-26 15:26:18 253011608 41_S162_R1_001.fastq.gz
2025-09-26 15:26:20 57 41_S162_R1_001.fastq.gz.md5
2025-09-26 15:26:18 252117752 41_S162_R2_001.fastq.gz
2025-09-26 15:26:20 57 41_S162_R2_001.fastq.gz.md5
2025-09-23 09:27:31 779194290 41_S952_R1_001.fastq.gz
2025-09-23 09:27:42 57 41_S952_R1_001.fastq.gz.md5
2025-09-23 09:27:31 774392844 41_S952_R2_001.fastq.gz
2025-09-23 09:27:42 57 41_S952_R2_001.fastq.gz.md5
2025-09-26 15:29:01 362284115 42_S163_R1_001.fastq.gz
2025-09-26 15:29:13 57 42_S163_R1_001.fastq.gz.md5
2025-09-26 15:29:04 360602460 42_S163_R2_001.fastq.gz
2025-09-26 15:29:06 57 42_S163_R2_001.fastq.gz.md5
2025-09-23 09:31:48 1138848440 42_S953_R1_001.fastq.gz
2025-09-23 09:32:00 57 42_S953_R1_001.fastq.gz.md5
2025-09-23 09:31:50 1129654684 42_S953_R2_001.fastq.gz
2025-09-23 09:32:07 57 42_S953_R2_001.fastq.gz.md5
2025-09-26 15:27:02 312427427 43_S164_R1_001.fastq.gz
2025-09-26 15:27:14 57 43_S164_R1_001.fastq.gz.md5
2025-09-26 15:27:05 311959430 43_S164_R2_001.fastq.gz
2025-09-26 15:27:16 57 43_S164_R2_001.fastq.gz.md5
2025-09-23 09:28:39 922783741 43_S954_R1_001.fastq.gz
2025-09-23 09:28:53 57 43_S954_R1_001.fastq.gz.md5
2025-09-23 09:28:45 920414635 43_S954_R2_001.fastq.gz
2025-09-23 09:28:59 57 43_S954_R2_001.fastq.gz.md5
2025-09-26 15:27:29 288444293 44_S165_R1_001.fastq.gz
2025-09-26 15:27:39 57 44_S165_R1_001.fastq.gz.md5
2025-09-26 15:27:28 286181750 44_S165_R2_001.fastq.gz
2025-09-26 15:27:39 57 44_S165_R2_001.fastq.gz.md5
2025-09-23 09:29:24 876365211 44_S955_R1_001.fastq.gz
2025-09-23 09:29:36 57 44_S955_R1_001.fastq.gz.md5
2025-09-23 09:29:31 867795464 44_S955_R2_001.fastq.gz
2025-09-23 09:29:39 57 44_S955_R2_001.fastq.gz.md5
2025-09-26 15:28:21 292987120 45_S166_R1_001.fastq.gz
2025-09-26 15:28:24 57 45_S166_R1_001.fastq.gz.md5
2025-09-26 15:28:22 292087105 45_S166_R2_001.fastq.gz
2025-09-26 15:28:24 57 45_S166_R2_001.fastq.gz.md5
2025-09-23 09:30:41 899719347 45_S956_R1_001.fastq.gz
2025-09-23 09:30:54 57 45_S956_R1_001.fastq.gz.md5
2025-09-23 09:30:44 894983870 45_S956_R2_001.fastq.gz
2025-09-23 09:30:53 57 45_S956_R2_001.fastq.gz.md5
2025-09-26 15:26:09 270808221 46_S167_R1_001.fastq.gz
2025-09-26 15:26:09 57 46_S167_R1_001.fastq.gz.md5
2025-09-26 15:26:09 270144351 46_S167_R2_001.fastq.gz
2025-09-26 15:26:09 57 46_S167_R2_001.fastq.gz.md5
2025-09-23 09:27:12 854634298 46_S957_R1_001.fastq.gz
2025-09-23 09:27:25 57 46_S957_R1_001.fastq.gz.md5
2025-09-23 09:27:12 850548721 46_S957_R2_001.fastq.gz
2025-09-23 09:27:21 57 46_S957_R2_001.fastq.gz.md5
2025-09-26 15:27:57 329397321 47_S168_R1_001.fastq.gz
2025-09-26 15:28:01 57 47_S168_R1_001.fastq.gz.md5
2025-09-26 15:27:57 328855813 47_S168_R2_001.fastq.gz
2025-09-26 15:28:10 57 47_S168_R2_001.fastq.gz.md5
2025-09-23 09:30:03 1010118838 47_S958_R1_001.fastq.gz
2025-09-23 09:30:12 57 47_S958_R1_001.fastq.gz.md5
2025-09-23 09:30:03 1005423262 47_S958_R2_001.fastq.gz
2025-09-23 09:30:15 57 47_S958_R2_001.fastq.gz.md5
2025-09-26 15:28:21 369327078 48_S169_R1_001.fastq.gz
2025-09-26 15:28:30 57 48_S169_R1_001.fastq.gz.md5
2025-09-26 15:28:21 366096713 48_S169_R2_001.fastq.gz
2025-09-26 15:28:31 57 48_S169_R2_001.fastq.gz.md5
2025-09-23 09:30:42 1161330687 48_S959_R1_001.fastq.gz
2025-09-23 09:30:50 57 48_S959_R1_001.fastq.gz.md5
2025-09-23 09:30:41 1147526684 48_S959_R2_001.fastq.gz
2025-09-23 09:30:53 57 48_S959_R2_001.fastq.gz.md5
2025-09-26 15:27:19 271963220 49_S170_R1_001.fastq.gz
2025-09-26 15:27:28 57 49_S170_R1_001.fastq.gz.md5
2025-09-26 15:27:19 270444508 49_S170_R2_001.fastq.gz
2025-09-26 15:27:29 57 49_S170_R2_001.fastq.gz.md5
2025-09-23 09:29:15 817728308 49_S960_R1_001.fastq.gz
2025-09-23 09:29:24 57 49_S960_R1_001.fastq.gz.md5
2025-09-23 09:29:18 811658705 49_S960_R2_001.fastq.gz
2025-09-23 09:29:27 57 49_S960_R2_001.fastq.gz.md5
2025-09-26 15:28:13 368146858 4_S125_R1_001.fastq.gz
2025-09-26 15:28:22 56 4_S125_R1_001.fastq.gz.md5
2025-09-26 15:28:16 365405091 4_S125_R2_001.fastq.gz
2025-09-26 15:28:19 56 4_S125_R2_001.fastq.gz.md5
2025-09-23 09:30:26 1102879277 4_S915_R1_001.fastq.gz
2025-09-23 09:30:39 56 4_S915_R1_001.fastq.gz.md5
2025-09-23 09:30:29 1090096121 4_S915_R2_001.fastq.gz
2025-09-23 09:30:41 56 4_S915_R2_001.fastq.gz.md5
2025-09-26 15:27:09 294580496 50_S171_R1_001.fastq.gz
2025-09-26 15:27:21 57 50_S171_R1_001.fastq.gz.md5
2025-09-26 15:27:11 293493398 50_S171_R2_001.fastq.gz
2025-09-26 15:27:23 57 50_S171_R2_001.fastq.gz.md5
2025-09-23 09:28:56 942227203 50_S961_R1_001.fastq.gz
2025-09-23 09:29:05 57 50_S961_R1_001.fastq.gz.md5
2025-09-23 09:28:56 936688078 50_S961_R2_001.fastq.gz
2025-09-23 09:29:14 57 50_S961_R2_001.fastq.gz.md5
2025-09-26 15:26:59 294530870 51_S172_R1_001.fastq.gz
2025-09-26 15:27:11 57 51_S172_R1_001.fastq.gz.md5
2025-09-26 15:27:02 292906744 51_S172_R2_001.fastq.gz
2025-09-26 15:27:05 57 51_S172_R2_001.fastq.gz.md5
2025-09-23 09:28:36 866464210 51_S962_R1_001.fastq.gz
2025-09-23 09:28:46 57 51_S962_R1_001.fastq.gz.md5
2025-09-23 09:28:35 859915939 51_S962_R2_001.fastq.gz
2025-09-23 09:28:45 57 51_S962_R2_001.fastq.gz.md5
2025-09-26 15:26:18 287786913 52_S173_R1_001.fastq.gz
2025-09-26 15:26:20 57 52_S173_R1_001.fastq.gz.md5
2025-09-26 15:26:18 287061007 52_S173_R2_001.fastq.gz
2025-09-26 15:26:20 57 52_S173_R2_001.fastq.gz.md5
2025-09-23 09:27:32 819250901 52_S963_R1_001.fastq.gz
2025-09-23 09:27:42 57 52_S963_R1_001.fastq.gz.md5
2025-09-23 09:27:31 815599341 52_S963_R2_001.fastq.gz
2025-09-23 09:27:42 57 52_S963_R2_001.fastq.gz.md5
2025-09-26 15:26:23 332130478 53_S174_R1_001.fastq.gz
2025-09-26 15:26:32 57 53_S174_R1_001.fastq.gz.md5
2025-09-26 15:26:29 327414234 53_S174_R2_001.fastq.gz
2025-09-26 15:26:38 57 53_S174_R2_001.fastq.gz.md5
2025-09-23 09:27:54 984470057 53_S964_R1_001.fastq.gz
2025-09-23 09:28:02 57 53_S964_R1_001.fastq.gz.md5
2025-09-23 09:27:54 968782918 53_S964_R2_001.fastq.gz
2025-09-23 09:28:02 57 53_S964_R2_001.fastq.gz.md5
2025-09-26 15:27:44 1765868056 54_S175_R1_001.fastq.gz
2025-09-26 15:27:52 57 54_S175_R1_001.fastq.gz.md5
2025-09-26 15:27:44 1759719215 54_S175_R2_001.fastq.gz
2025-09-26 15:27:57 57 54_S175_R2_001.fastq.gz.md5
2025-09-26 15:27:14 319809749 55_S176_R1_001.fastq.gz
2025-09-26 15:27:16 57 55_S176_R1_001.fastq.gz.md5
2025-09-26 15:27:16 318254839 55_S176_R2_001.fastq.gz
2025-09-26 15:27:29 57 55_S176_R2_001.fastq.gz.md5
2025-09-23 09:29:06 982958681 55_S966_R1_001.fastq.gz
2025-09-23 09:29:17 57 55_S966_R1_001.fastq.gz.md5
2025-09-23 09:29:08 975883160 55_S966_R2_001.fastq.gz
2025-09-23 09:29:24 57 55_S966_R2_001.fastq.gz.md5
2025-09-26 15:26:29 290253182 56_S177_R1_001.fastq.gz
2025-09-26 15:26:38 57 56_S177_R1_001.fastq.gz.md5
2025-09-26 15:26:29 290296568 56_S177_R2_001.fastq.gz
2025-09-26 15:26:41 57 56_S177_R2_001.fastq.gz.md5
2025-09-23 09:27:53 917489787 56_S967_R1_001.fastq.gz
2025-09-23 09:28:02 57 56_S967_R1_001.fastq.gz.md5
2025-09-23 09:27:53 916184588 56_S967_R2_001.fastq.gz
2025-09-23 09:28:02 57 56_S967_R2_001.fastq.gz.md5
2025-09-26 15:26:50 318364886 57_S178_R1_001.fastq.gz
2025-09-26 15:27:02 57 57_S178_R1_001.fastq.gz.md5
2025-09-26 15:26:53 317691683 57_S178_R2_001.fastq.gz
2025-09-26 15:27:02 57 57_S178_R2_001.fastq.gz.md5
2025-09-23 09:28:33 1019196290 57_S968_R1_001.fastq.gz
2025-09-23 09:28:44 57 57_S968_R1_001.fastq.gz.md5
2025-09-23 09:28:35 1013994907 57_S968_R2_001.fastq.gz
2025-09-23 09:28:44 57 57_S968_R2_001.fastq.gz.md5
2025-09-26 15:28:37 502643900 58_S179_R1_001.fastq.gz
2025-09-26 15:28:49 57 58_S179_R1_001.fastq.gz.md5
2025-09-26 15:28:41 504007280 58_S179_R2_001.fastq.gz
2025-09-26 15:28:52 57 58_S179_R2_001.fastq.gz.md5
2025-09-23 09:31:14 1544192904 58_S969_R1_001.fastq.gz
2025-09-23 09:31:23 57 58_S969_R1_001.fastq.gz.md5
2025-09-23 09:31:18 1533897578 58_S969_R2_001.fastq.gz
2025-09-23 09:31:27 57 58_S969_R2_001.fastq.gz.md5
2025-09-26 15:29:07 290857999 59_S180_R1_001.fastq.gz
2025-09-26 15:29:10 57 59_S180_R1_001.fastq.gz.md5
2025-09-26 15:29:06 287868541 59_S180_R2_001.fastq.gz
2025-09-26 15:29:19 57 59_S180_R2_001.fastq.gz.md5
2025-09-23 09:31:57 873067476 59_S970_R1_001.fastq.gz
2025-09-23 09:32:07 57 59_S970_R1_001.fastq.gz.md5
2025-09-23 09:31:57 862418652 59_S970_R2_001.fastq.gz
2025-09-23 09:32:09 57 59_S970_R2_001.fastq.gz.md5
2025-09-26 15:26:56 341306791 5_S126_R1_001.fastq.gz
2025-09-26 15:26:58 56 5_S126_R1_001.fastq.gz.md5
2025-09-26 15:26:58 339970740 5_S126_R2_001.fastq.gz
2025-09-26 15:27:09 56 5_S126_R2_001.fastq.gz.md5
2025-09-23 09:28:36 960779525 5_S916_R1_001.fastq.gz
2025-09-23 09:28:44 56 5_S916_R1_001.fastq.gz.md5
2025-09-23 09:28:35 955164543 5_S916_R2_001.fastq.gz
2025-09-23 09:28:45 56 5_S916_R2_001.fastq.gz.md5
2025-09-26 15:28:10 275424908 60_S181_R1_001.fastq.gz
2025-09-26 15:28:19 57 60_S181_R1_001.fastq.gz.md5
2025-09-26 15:28:10 274786130 60_S181_R2_001.fastq.gz
2025-09-26 15:28:13 57 60_S181_R2_001.fastq.gz.md5
2025-09-23 09:30:17 878885544 60_S971_R1_001.fastq.gz
2025-09-23 09:30:30 57 60_S971_R1_001.fastq.gz.md5
2025-09-23 09:30:21 874297882 60_S971_R2_001.fastq.gz
2025-09-23 09:30:30 57 60_S971_R2_001.fastq.gz.md5
2025-09-26 15:27:09 129017626 61_S182_R1_001.fastq.gz
2025-09-26 15:27:11 57 61_S182_R1_001.fastq.gz.md5
2025-09-26 15:27:09 128728252 61_S182_R2_001.fastq.gz
2025-09-26 15:27:11 57 61_S182_R2_001.fastq.gz.md5
2025-09-23 09:28:53 398343545 61_S972_R1_001.fastq.gz
2025-09-23 09:29:05 57 61_S972_R1_001.fastq.gz.md5
2025-09-23 09:28:56 395952830 61_S972_R2_001.fastq.gz
2025-09-23 09:29:05 57 61_S972_R2_001.fastq.gz.md5
2025-09-26 15:29:01 301018226 62_S183_R1_001.fastq.gz
2025-09-26 15:29:09 57 62_S183_R1_001.fastq.gz.md5
2025-09-26 15:29:01 299125684 62_S183_R2_001.fastq.gz
2025-09-26 15:29:10 57 62_S183_R2_001.fastq.gz.md5
2025-09-23 09:31:44 910832277 62_S973_R1_001.fastq.gz
2025-09-23 09:31:53 57 62_S973_R1_001.fastq.gz.md5
2025-09-23 09:31:48 902094943 62_S973_R2_001.fastq.gz
2025-09-23 09:32:00 57 62_S973_R2_001.fastq.gz.md5
2025-09-26 15:28:41 250580530 63_S184_R1_001.fastq.gz
2025-09-26 15:28:43 57 63_S184_R1_001.fastq.gz.md5
2025-09-26 15:28:41 250170713 63_S184_R2_001.fastq.gz
2025-09-26 15:28:43 57 63_S184_R2_001.fastq.gz.md5
2025-09-23 09:31:17 799150275 63_S974_R1_001.fastq.gz
2025-09-23 09:31:28 57 63_S974_R1_001.fastq.gz.md5
2025-09-23 09:31:17 795282825 63_S974_R2_001.fastq.gz
2025-09-23 09:31:29 57 63_S974_R2_001.fastq.gz.md5
2025-09-26 15:29:10 293025625 64_S185_R1_001.fastq.gz
2025-09-26 15:29:13 57 64_S185_R1_001.fastq.gz.md5
2025-09-26 15:29:09 291386174 64_S185_R2_001.fastq.gz
2025-09-26 15:29:21 57 64_S185_R2_001.fastq.gz.md5
2025-09-23 09:32:00 916698527 64_S975_R1_001.fastq.gz
2025-09-23 09:32:11 57 64_S975_R1_001.fastq.gz.md5
2025-09-23 09:32:03 909202203 64_S975_R2_001.fastq.gz
2025-09-23 09:32:11 57 64_S975_R2_001.fastq.gz.md5
2025-09-26 15:26:10 320509250 6_S127_R1_001.fastq.gz
2025-09-26 15:26:09 56 6_S127_R1_001.fastq.gz.md5
2025-09-26 15:26:09 320058676 6_S127_R2_001.fastq.gz
2025-09-26 15:26:09 56 6_S127_R2_001.fastq.gz.md5
2025-09-23 09:27:12 968902635 6_S917_R1_001.fastq.gz
2025-09-23 09:27:23 56 6_S917_R1_001.fastq.gz.md5
2025-09-23 09:27:12 965482495 6_S917_R2_001.fastq.gz
2025-09-23 09:27:21 56 6_S917_R2_001.fastq.gz.md5
2025-09-26 15:28:30 325918787 7_S128_R1_001.fastq.gz
2025-09-26 15:28:43 56 7_S128_R1_001.fastq.gz.md5
2025-09-26 15:28:34 325573961 7_S128_R2_001.fastq.gz
2025-09-26 15:28:37 56 7_S128_R2_001.fastq.gz.md5
2025-09-23 09:31:04 997026820 7_S918_R1_001.fastq.gz
2025-09-23 09:31:15 56 7_S918_R1_001.fastq.gz.md5
2025-09-23 09:31:03 994968921 7_S918_R2_001.fastq.gz
2025-09-23 09:31:17 56 7_S918_R2_001.fastq.gz.md5
2025-09-26 15:26:09 332323651 8_S129_R1_001.fastq.gz
2025-09-26 15:26:09 56 8_S129_R1_001.fastq.gz.md5
2025-09-26 15:26:09 333288034 8_S129_R2_001.fastq.gz
2025-09-26 15:26:09 56 8_S129_R2_001.fastq.gz.md5
2025-09-23 09:27:12 1070272911 8_S919_R1_001.fastq.gz
2025-09-23 09:27:21 56 8_S919_R1_001.fastq.gz.md5
2025-09-23 09:27:12 1068752075 8_S919_R2_001.fastq.gz
2025-09-23 09:27:21 56 8_S919_R2_001.fastq.gz.md5
2025-09-26 15:27:52 410127602 9_S130_R1_001.fastq.gz
2025-09-26 15:28:01 56 9_S130_R1_001.fastq.gz.md5
2025-09-26 15:27:52 410076073 9_S130_R2_001.fastq.gz
2025-09-26 15:27:55 56 9_S130_R2_001.fastq.gz.md5
2025-09-23 09:29:59 1174407278 9_S920_R1_001.fastq.gz
2025-09-23 09:30:10 56 9_S920_R1_001.fastq.gz.md5
2025-09-23 09:29:59 1171948638 9_S920_R2_001.fastq.gz
2025-09-23 09:30:09 56 9_S920_R2_001.fastq.gz.md5
We will concatenate all files for each sample, removing those that are flagged for issues in QC.
The files are:
find . -type f | wc -l
508 files (64 samples * 2 read directions * 2 file types = 256 * 2 sequencing runs = 512 files (one sample - Sample 54 - didn’t need resequencing, so there are 4 fewer files)
- I then concatenated all .md5 files made by the sequencer and made my own .md5 file
cat *.gz.md5 > genohub_original_checksums.md5
md5sum *.fastq.gz > checkmd5_20250929.md5
cmp genohub_original_checksums.md5 checkmd5_20250929.md5
All files match check sums, transfer completed successfully.
QC on raw sequences
cd /gscratch/srlab/ashuff/moorea-2023-rnaseq
mkdir scripts
cd scripts
nano qc_raw.sh
#!/bin/bash
#SBATCH --job-name=rawQC
#SBATCH --mail-type=ALL
#SBATCH --mail-user=ashuff@uw.edu
#SBATCH --account=srlab
#SBATCH --partition=cpu-g2-mem2x
#SBATCH --nodes=1
#SBATCH --mem=250G
#SBATCH --time=01-12:00:00 # Max runtime in DD-HH:MM:SS format.
#SBATCH --chdir=/gscratch/srlab/ashuff/moorea-2023-rnaseq/raw-sequences
#SBATCH --export=all
#SBATCH -o raw-qc-%j.out
#SBATCH -e raw-qc-%j.error
# load modules needed
module load kawaldorflab/fastqc/0.12.1
module load kawaldorflab/multiqc/1.15
# fastqc of raw reads
fastqc *.fastq.gz
#generate multiqc report
multiqc ./ --filename multiqc_report_raw.html
echo "Raw MultiQC report generated." $(date)
Run the script.
sbatch qc_raw.sh
Submitted as job 29898292 at 20:30 on 29 September, took about 12 hours.
Copy the MultiQC HTML to my computer.
scp ashuff@klone.hyak.uw.edu:/gscratch/srlab/ashuff/moorea-2023-rnaseq/raw-sequences/multiqc_report_raw.html ~/MyProjects/moorea_symbiotic_exchange_2023/data/rna/QC
#copy single fastQC for troubleshooting
scp ashuff@klone.hyak.uw.edu:/gscratch/srlab/ashuff/moorea-2023-rnaseq/raw-sequences/raw_fastqc/11_S922_R1_001_fastqc.html ~/MyProjects/moorea_symbiotic_exchange_2023/data/rna/QC
scp ashuff@klone.hyak.uw.edu:/gscratch/srlab/ashuff/moorea-2023-rnaseq/raw-sequences/raw_fastqc/38_S949_R1_001_fastqc.html ~/MyProjects/moorea_symbiotic_exchange_2023/data/rna/QC
Move .fastqc files to a QC folder to keep folder organized.
mkdir raw_fastqc
mv *fastqc.html raw_fastqc
mv *fastqc.zip raw_fastqc
mv multiqc* raw_fastqc
The raw sequence MultiQC report can be found on GitHub here.
Here are some things I noticed from the raw QC:
- Samples did need a second round of sequencing to reach ~15M total reads or more. This is not surprising given lower than ideal RIN and concentrations of RNA from QC reports. There does not appear to be any pattern by species or lifestage.
Sample Name | % Dups | % GC | M Seqs | Species | Lifestage |
---|---|---|---|---|---|
10_S131_R1_001 | 41.00% | 43% | 3 | Pocillopora | Larvae |
10_S131_R2_001 | 37.40% | 43% | 3 | Pocillopora | Larvae |
10_S921_R1_001 | 50.00% | 43% | 9.1 | Pocillopora | Larvae |
10_S921_R2_001 | 46.00% | 44% | 9.1 | Pocillopora | Larvae |
11_S132_R1_001 | 58.30% | 43% | 27.9 | Pocillopora | Recruit |
11_S132_R2_001 | 53.90% | 43% | 27.9 | Pocillopora | Recruit |
11_S922_R1_001 | 5.20% | 43% | 0 | Pocillopora | Recruit |
11_S922_R2_001 | 4.50% | 43% | 0 | Pocillopora | Recruit |
12_S133_R1_001 | 62.20% | 44% | 3.9 | Acropora | Larvae |
12_S133_R2_001 | 58.30% | 45% | 3.9 | Acropora | Larvae |
12_S923_R1_001 | 76.00% | 44% | 12.3 | Acropora | Larvae |
12_S923_R2_001 | 73.20% | 45% | 12.3 | Acropora | Larvae |
13_S134_R1_001 | 32.30% | 42% | 3.9 | Pocillopora | Larvae |
13_S134_R2_001 | 29.40% | 43% | 3.9 | Pocillopora | Larvae |
13_S924_R1_001 | 37.80% | 42% | 11.9 | Pocillopora | Larvae |
13_S924_R2_001 | 34.40% | 43% | 11.9 | Pocillopora | Larvae |
14_S135_R1_001 | 40.90% | 43% | 5.2 | Pocillopora | Recruit |
14_S135_R2_001 | 36.00% | 43% | 5.2 | Pocillopora | Recruit |
14_S925_R1_001 | 48.90% | 43% | 15.5 | Pocillopora | Recruit |
14_S925_R2_001 | 43.90% | 44% | 15.5 | Pocillopora | Recruit |
15_S136_R1_001 | 40.20% | 42% | 4.2 | Pocillopora | Larvae |
15_S136_R2_001 | 36.50% | 43% | 4.2 | Pocillopora | Larvae |
15_S926_R1_001 | 48.30% | 43% | 12.2 | Pocillopora | Larvae |
15_S926_R2_001 | 44.00% | 43% | 12.2 | Pocillopora | Larvae |
16_S137_R1_001 | 44.80% | 42% | 4.1 | Pocillopora | Larvae |
16_S137_R2_001 | 41.10% | 42% | 4.1 | Pocillopora | Larvae |
16_S927_R1_001 | 54.60% | 42% | 12.2 | Pocillopora | Larvae |
16_S927_R2_001 | 50.90% | 43% | 12.2 | Pocillopora | Larvae |
17_S138_R1_001 | 39.30% | 41% | 4.7 | Pocillopora | Recruit |
17_S138_R2_001 | 34.30% | 42% | 4.7 | Pocillopora | Recruit |
17_S928_R1_001 | 47.40% | 42% | 13.6 | Pocillopora | Recruit |
17_S928_R2_001 | 42.40% | 42% | 13.6 | Pocillopora | Recruit |
18_S139_R1_001 | 46.20% | 42% | 3.4 | Acropora | Larvae |
18_S139_R2_001 | 42.80% | 42% | 3.4 | Acropora | Larvae |
18_S929_R1_001 | 56.50% | 42% | 9.7 | Acropora | Larvae |
18_S929_R2_001 | 53.40% | 43% | 9.7 | Acropora | Larvae |
19_S140_R1_001 | 36.80% | 43% | 4.2 | Pocillopora | Larvae |
19_S140_R2_001 | 33.80% | 43% | 4.2 | Pocillopora | Larvae |
19_S930_R1_001 | 43.50% | 43% | 11 | Pocillopora | Larvae |
19_S930_R2_001 | 39.80% | 44% | 11 | Pocillopora | Larvae |
1_S122_R1_001 | 42.40% | 41% | 4 | Acropora | Larvae |
1_S122_R2_001 | 38.40% | 42% | 4 | Acropora | Larvae |
1_S912_R1_001 | 50.50% | 42% | 11 | Acropora | Larvae |
1_S912_R2_001 | 46.60% | 43% | 11 | Acropora | Larvae |
20_S141_R1_001 | 42.00% | 42% | 5 | Acropora | Larvae |
20_S141_R2_001 | 37.80% | 43% | 5 | Acropora | Larvae |
20_S931_R1_001 | 50.00% | 42% | 14.1 | Acropora | Larvae |
20_S931_R2_001 | 46.00% | 43% | 14.1 | Acropora | Larvae |
21_S142_R1_001 | 40.60% | 42% | 3.8 | Acropora | Larvae |
21_S142_R2_001 | 36.60% | 42% | 3.8 | Acropora | Larvae |
21_S932_R1_001 | 48.20% | 42% | 11 | Acropora | Larvae |
21_S932_R2_001 | 44.20% | 43% | 11 | Acropora | Larvae |
22_S143_R1_001 | 41.60% | 42% | 3.5 | Pocillopora | Larvae |
22_S143_R2_001 | 38.80% | 43% | 3.5 | Pocillopora | Larvae |
22_S933_R1_001 | 49.90% | 43% | 10.3 | Pocillopora | Larvae |
22_S933_R2_001 | 46.80% | 43% | 10.3 | Pocillopora | Larvae |
23_S144_R1_001 | 34.80% | 42% | 3.2 | Acropora | Recruit |
23_S144_R2_001 | 29.90% | 43% | 3.2 | Acropora | Recruit |
23_S934_R1_001 | 44.90% | 43% | 10.9 | Acropora | Recruit |
23_S934_R2_001 | 39.80% | 44% | 10.9 | Acropora | Recruit |
24_S145_R1_001 | 40.90% | 41% | 4.5 | Acropora | Larvae |
24_S145_R2_001 | 37.90% | 42% | 4.5 | Acropora | Larvae |
24_S935_R1_001 | 49.60% | 42% | 12.8 | Acropora | Larvae |
24_S935_R2_001 | 46.40% | 42% | 12.8 | Acropora | Larvae |
25_S146_R1_001 | 46.40% | 42% | 3.8 | Pocillopora | Recruit |
25_S146_R2_001 | 42.10% | 43% | 3.8 | Pocillopora | Recruit |
25_S936_R1_001 | 59.50% | 43% | 11.9 | Pocillopora | Recruit |
25_S936_R2_001 | 55.90% | 44% | 11.9 | Pocillopora | Recruit |
26_S147_R1_001 | 38.00% | 42% | 4.4 | Pocillopora | Larvae |
26_S147_R2_001 | 34.70% | 43% | 4.4 | Pocillopora | Larvae |
26_S937_R1_001 | 46.20% | 43% | 12.5 | Pocillopora | Larvae |
26_S937_R2_001 | 42.40% | 43% | 12.5 | Pocillopora | Larvae |
27_S148_R1_001 | 38.20% | 42% | 4.6 | Acropora | Recruit |
27_S148_R2_001 | 33.80% | 42% | 4.6 | Acropora | Recruit |
27_S938_R1_001 | 46.50% | 42% | 13.1 | Acropora | Recruit |
27_S938_R2_001 | 42.30% | 43% | 13.1 | Acropora | Recruit |
28_S149_R1_001 | 34.90% | 43% | 4.2 | Acropora | Recruit |
28_S149_R2_001 | 30.30% | 43% | 4.2 | Acropora | Recruit |
28_S939_R1_001 | 44.00% | 43% | 12.7 | Acropora | Recruit |
28_S939_R2_001 | 39.40% | 44% | 12.7 | Acropora | Recruit |
29_S150_R1_001 | 39.40% | 43% | 4.9 | Acropora | Recruit |
29_S150_R2_001 | 35.20% | 43% | 4.9 | Acropora | Recruit |
29_S940_R1_001 | 48.60% | 43% | 14.8 | Acropora | Recruit |
29_S940_R2_001 | 44.60% | 44% | 14.8 | Acropora | Recruit |
2_S123_R1_001 | 39.10% | 44% | 4.6 | Acropora | Recruit |
2_S123_R2_001 | 35.20% | 44% | 4.6 | Acropora | Recruit |
2_S913_R1_001 | 48.30% | 44% | 13.1 | Acropora | Recruit |
2_S913_R2_001 | 44.50% | 45% | 13.1 | Acropora | Recruit |
30_S151_R1_001 | 49.60% | 42% | 9.4 | Pocillopora | Larvae |
30_S151_R2_001 | 46.80% | 43% | 9.4 | Pocillopora | Larvae |
30_S941_R1_001 | 59.60% | 42% | 27.4 | Pocillopora | Larvae |
30_S941_R2_001 | 56.70% | 43% | 27.4 | Pocillopora | Larvae |
31_S152_R1_001 | 39.70% | 41% | 4.9 | Pocillopora | Recruit |
31_S152_R2_001 | 34.80% | 42% | 4.9 | Pocillopora | Recruit |
31_S942_R1_001 | 48.80% | 42% | 14.7 | Pocillopora | Recruit |
31_S942_R2_001 | 43.80% | 42% | 14.7 | Pocillopora | Recruit |
32_S153_R1_001 | 41.80% | 43% | 5 | Acropora | Recruit |
32_S153_R2_001 | 38.30% | 43% | 5 | Acropora | Recruit |
32_S943_R1_001 | 50.60% | 43% | 14.1 | Acropora | Recruit |
32_S943_R2_001 | 47.40% | 44% | 14.1 | Acropora | Recruit |
33_S154_R1_001 | 38.60% | 42% | 4.3 | Pocillopora | Recruit |
33_S154_R2_001 | 34.10% | 43% | 4.3 | Pocillopora | Recruit |
33_S944_R1_001 | 47.00% | 42% | 12.5 | Pocillopora | Recruit |
33_S944_R2_001 | 42.50% | 43% | 12.5 | Pocillopora | Recruit |
34_S155_R1_001 | 41.40% | 42% | 4.1 | Pocillopora | Larvae |
34_S155_R2_001 | 39.00% | 43% | 4.1 | Pocillopora | Larvae |
34_S945_R1_001 | 48.70% | 42% | 11.3 | Pocillopora | Larvae |
34_S945_R2_001 | 45.60% | 43% | 11.3 | Pocillopora | Larvae |
35_S156_R1_001 | 38.00% | 43% | 4.2 | Acropora | Recruit |
35_S156_R2_001 | 33.90% | 43% | 4.2 | Acropora | Recruit |
35_S946_R1_001 | 46.20% | 43% | 12 | Acropora | Recruit |
35_S946_R2_001 | 41.80% | 43% | 12 | Acropora | Recruit |
36_S157_R1_001 | 41.40% | 43% | 5.5 | Acropora | Recruit |
36_S157_R2_001 | 36.00% | 43% | 5.5 | Acropora | Recruit |
36_S947_R1_001 | 49.50% | 43% | 14.9 | Acropora | Recruit |
36_S947_R2_001 | 44.30% | 44% | 14.9 | Acropora | Recruit |
37_S158_R1_001 | 39.10% | 42% | 4.2 | Acropora | Larvae |
37_S158_R2_001 | 37.00% | 42% | 4.2 | Acropora | Larvae |
37_S948_R1_001 | 46.10% | 42% | 11.3 | Acropora | Larvae |
37_S948_R2_001 | 43.90% | 43% | 11.3 | Acropora | Larvae |
38_S159_R1_001 | 59.90% | 43% | 28.2 | Pocillopora | Larvae |
38_S159_R2_001 | 57.80% | 43% | 28.2 | Pocillopora | Larvae |
38_S949_R1_001 | 0.00% | 41% | 0 | Pocillopora | Larvae |
38_S949_R2_001 | 0.00% | 42% | 0 | Pocillopora | Larvae |
39_S160_R1_001 | 39.70% | 41% | 3 | Acropora | Larvae |
39_S160_R2_001 | 36.70% | 42% | 3 | Acropora | Larvae |
39_S950_R1_001 | 50.10% | 41% | 10.1 | Acropora | Larvae |
39_S950_R2_001 | 47.10% | 42% | 10.1 | Acropora | Larvae |
3_S124_R1_001 | 41.60% | 41% | 4.5 | Acropora | Larvae |
3_S124_R2_001 | 38.90% | 42% | 4.5 | Acropora | Larvae |
3_S914_R1_001 | 49.20% | 42% | 11.9 | Acropora | Larvae |
3_S914_R2_001 | 46.50% | 42% | 11.9 | Acropora | Larvae |
40_S161_R1_001 | 42.00% | 42% | 4.8 | Acropora | Larvae |
40_S161_R2_001 | 39.50% | 42% | 4.8 | Acropora | Larvae |
40_S951_R1_001 | 50.70% | 42% | 13.1 | Acropora | Larvae |
40_S951_R2_001 | 48.20% | 42% | 13.1 | Acropora | Larvae |
41_S162_R1_001 | 39.10% | 41% | 3.8 | Acropora | Larvae |
41_S162_R2_001 | 36.00% | 42% | 3.8 | Acropora | Larvae |
41_S952_R1_001 | 47.00% | 42% | 10.7 | Acropora | Larvae |
41_S952_R2_001 | 43.80% | 43% | 10.7 | Acropora | Larvae |
42_S163_R1_001 | 44.80% | 43% | 5.3 | Acropora | Recruit |
42_S163_R2_001 | 39.80% | 44% | 5.3 | Acropora | Recruit |
42_S953_R1_001 | 53.60% | 43% | 15.7 | Acropora | Recruit |
42_S953_R2_001 | 49.00% | 44% | 15.7 | Acropora | Recruit |
43_S164_R1_001 | 36.50% | 42% | 4.6 | Acropora | Recruit |
43_S164_R2_001 | 32.50% | 43% | 4.6 | Acropora | Recruit |
43_S954_R1_001 | 43.40% | 43% | 12.5 | Acropora | Recruit |
43_S954_R2_001 | 39.00% | 43% | 12.5 | Acropora | Recruit |
44_S165_R1_001 | 37.30% | 43% | 4.2 | Acropora | Recruit |
44_S165_R2_001 | 33.70% | 44% | 4.2 | Acropora | Recruit |
44_S955_R1_001 | 45.80% | 43% | 11.9 | Acropora | Recruit |
44_S955_R2_001 | 42.20% | 44% | 11.9 | Acropora | Recruit |
45_S166_R1_001 | 45.80% | 41% | 4.4 | Pocillopora | Larvae |
45_S166_R2_001 | 42.80% | 42% | 4.4 | Pocillopora | Larvae |
45_S956_R1_001 | 54.80% | 42% | 12.4 | Pocillopora | Larvae |
45_S956_R2_001 | 52.20% | 42% | 12.4 | Pocillopora | Larvae |
46_S167_R1_001 | 32.50% | 45% | 3.9 | Acropora | Recruit |
46_S167_R2_001 | 29.30% | 46% | 3.9 | Acropora | Recruit |
46_S957_R1_001 | 41.00% | 45% | 11.7 | Acropora | Recruit |
46_S957_R2_001 | 37.30% | 46% | 11.7 | Acropora | Recruit |
47_S168_R1_001 | 37.60% | 42% | 4.9 | Pocillopora | Recruit |
47_S168_R2_001 | 33.30% | 43% | 4.9 | Pocillopora | Recruit |
47_S958_R1_001 | 45.90% | 43% | 13.9 | Pocillopora | Recruit |
47_S958_R2_001 | 41.30% | 43% | 13.9 | Pocillopora | Recruit |
48_S169_R1_001 | 40.40% | 42% | 5.3 | Acropora | Larvae |
48_S169_R2_001 | 35.90% | 43% | 5.3 | Acropora | Larvae |
48_S959_R1_001 | 49.30% | 42% | 15.9 | Acropora | Larvae |
48_S959_R2_001 | 45.20% | 43% | 15.9 | Acropora | Larvae |
49_S170_R1_001 | 36.70% | 43% | 4 | Pocillopora | Larvae |
49_S170_R2_001 | 34.30% | 43% | 4 | Pocillopora | Larvae |
49_S960_R1_001 | 43.90% | 43% | 11.2 | Pocillopora | Larvae |
49_S960_R2_001 | 41.00% | 44% | 11.2 | Pocillopora | Larvae |
4_S125_R1_001 | 52.20% | 41% | 5.4 | Pocillopora | Larvae |
4_S125_R2_001 | 47.00% | 42% | 5.4 | Pocillopora | Larvae |
4_S915_R1_001 | 61.30% | 42% | 15.2 | Pocillopora | Larvae |
4_S915_R2_001 | 56.40% | 42% | 15.2 | Pocillopora | Larvae |
50_S171_R1_001 | 39.90% | 43% | 4.3 | Acropora | Recruit |
50_S171_R2_001 | 35.90% | 43% | 4.3 | Acropora | Recruit |
50_S961_R1_001 | 48.50% | 43% | 13 | Acropora | Recruit |
50_S961_R2_001 | 44.30% | 44% | 13 | Acropora | Recruit |
51_S172_R1_001 | 38.40% | 43% | 4.3 | Pocillopora | Larvae |
51_S172_R2_001 | 36.00% | 43% | 4.3 | Pocillopora | Larvae |
51_S962_R1_001 | 45.30% | 43% | 11.8 | Pocillopora | Larvae |
51_S962_R2_001 | 42.40% | 43% | 11.8 | Pocillopora | Larvae |
52_S173_R1_001 | 37.60% | 42% | 4.2 | Pocillopora | Larvae |
52_S173_R2_001 | 35.10% | 43% | 4.2 | Pocillopora | Larvae |
52_S963_R1_001 | 44.50% | 43% | 11.1 | Pocillopora | Larvae |
52_S963_R2_001 | 41.50% | 43% | 11.1 | Pocillopora | Larvae |
53_S174_R1_001 | 38.90% | 43% | 4.8 | Acropora | Recruit |
53_S174_R2_001 | 35.60% | 43% | 4.8 | Acropora | Recruit |
53_S964_R1_001 | 47.30% | 43% | 13.3 | Acropora | Recruit |
53_S964_R2_001 | 43.50% | 43% | 13.3 | Acropora | Recruit |
54_S175_R1_001 | 58.20% | 43% | 26 | Acropora | Recruit |
54_S175_R2_001 | 54.80% | 44% | 26 | Acropora | Recruit |
55_S176_R1_001 | 38.90% | 41% | 4.7 | Pocillopora | Recruit |
55_S176_R2_001 | 34.60% | 42% | 4.7 | Pocillopora | Recruit |
55_S966_R1_001 | 47.40% | 42% | 13.5 | Pocillopora | Recruit |
55_S966_R2_001 | 42.80% | 42% | 13.5 | Pocillopora | Recruit |
56_S177_R1_001 | 41.40% | 41% | 4.3 | Acropora | Larvae |
56_S177_R2_001 | 38.70% | 42% | 4.3 | Acropora | Larvae |
56_S967_R1_001 | 50.50% | 42% | 12.7 | Acropora | Larvae |
56_S967_R2_001 | 47.60% | 42% | 12.7 | Acropora | Larvae |
57_S178_R1_001 | 38.10% | 42% | 4.7 | Acropora | Larvae |
57_S178_R2_001 | 34.30% | 42% | 4.7 | Acropora | Larvae |
57_S968_R1_001 | 45.90% | 42% | 14 | Acropora | Larvae |
57_S968_R2_001 | 41.50% | 43% | 14 | Acropora | Larvae |
58_S179_R1_001 | 22.60% | 42% | 7.4 | Pocillopora | Larvae |
58_S179_R2_001 | 21.00% | 43% | 7.4 | Pocillopora | Larvae |
58_S969_R1_001 | 28.30% | 43% | 21.4 | Pocillopora | Larvae |
58_S969_R2_001 | 26.50% | 44% | 21.4 | Pocillopora | Larvae |
59_S180_R1_001 | 38.00% | 42% | 4.2 | Acropora | Recruit |
59_S180_R2_001 | 33.70% | 43% | 4.2 | Acropora | Recruit |
59_S970_R1_001 | 46.00% | 42% | 11.9 | Acropora | Recruit |
59_S970_R2_001 | 41.40% | 43% | 11.9 | Acropora | Recruit |
5_S126_R1_001 | 44.70% | 42% | 5 | Pocillopora | Larvae |
5_S126_R2_001 | 41.90% | 43% | 5 | Pocillopora | Larvae |
5_S916_R1_001 | 51.60% | 42% | 13.1 | Pocillopora | Larvae |
5_S916_R2_001 | 48.60% | 43% | 13.1 | Pocillopora | Larvae |
60_S181_R1_001 | 40.50% | 41% | 4.1 | Acropora | Larvae |
60_S181_R2_001 | 37.50% | 42% | 4.1 | Acropora | Larvae |
60_S971_R1_001 | 48.90% | 42% | 12.2 | Acropora | Larvae |
60_S971_R2_001 | 46.00% | 42% | 12.2 | Acropora | Larvae |
61_S182_R1_001 | 28.80% | 43% | 1.9 | Pocillopora | Recruit |
61_S182_R2_001 | 24.70% | 43% | 1.9 | Pocillopora | Recruit |
61_S972_R1_001 | 35.30% | 43% | 5.4 | Pocillopora | Recruit |
61_S972_R2_001 | 30.70% | 44% | 5.4 | Pocillopora | Recruit |
62_S183_R1_001 | 35.90% | 42% | 4.4 | Pocillopora | Recruit |
62_S183_R2_001 | 31.40% | 42% | 4.4 | Pocillopora | Recruit |
62_S973_R1_001 | 44.10% | 42% | 12.4 | Pocillopora | Recruit |
62_S973_R2_001 | 39.70% | 43% | 12.4 | Pocillopora | Recruit |
63_S184_R1_001 | 38.70% | 42% | 3.7 | Acropora | Larvae |
63_S184_R2_001 | 35.70% | 42% | 3.7 | Acropora | Larvae |
63_S974_R1_001 | 47.20% | 42% | 11 | Acropora | Larvae |
63_S974_R2_001 | 44.00% | 42% | 11 | Acropora | Larvae |
64_S185_R1_001 | 46.90% | 41% | 4.3 | Acropora | Recruit |
64_S185_R2_001 | 42.50% | 42% | 4.3 | Acropora | Recruit |
64_S975_R1_001 | 56.30% | 42% | 12.6 | Acropora | Recruit |
64_S975_R2_001 | 52.60% | 42% | 12.6 | Acropora | Recruit |
6_S127_R1_001 | 44.50% | 43% | 4.7 | Acropora | Recruit |
6_S127_R2_001 | 39.10% | 43% | 4.7 | Acropora | Recruit |
6_S917_R1_001 | 54.10% | 43% | 13.3 | Acropora | Recruit |
6_S917_R2_001 | 48.80% | 44% | 13.3 | Acropora | Recruit |
7_S128_R1_001 | 35.50% | 45% | 4.8 | Pocillopora | Larvae |
7_S128_R2_001 | 31.90% | 45% | 4.8 | Pocillopora | Larvae |
7_S918_R1_001 | 42.80% | 45% | 13.7 | Pocillopora | Larvae |
7_S918_R2_001 | 38.50% | 45% | 13.7 | Pocillopora | Larvae |
8_S129_R1_001 | 40.80% | 42% | 4.9 | Acropora | Larvae |
8_S129_R2_001 | 37.50% | 43% | 4.9 | Acropora | Larvae |
8_S919_R1_001 | 49.70% | 42% | 14.9 | Acropora | Larvae |
8_S919_R2_001 | 46.30% | 43% | 14.9 | Acropora | Larvae |
9_S130_R1_001 | 41.70% | 41% | 6 | Acropora | Larvae |
9_S130_R2_001 | 37.60% | 42% | 6 | Acropora | Larvae |
9_S920_R1_001 | 48.60% | 42% | 16 | Acropora | Larvae |
9_S920_R2_001 | 44.10% | 42% | 16 | Acropora | Larvae |
- There are four files per samples and the second round of sequencing was needed to obtain at least 15M sequences. There is high duplication, which is not uncommon in RNA datasets.
- Quality scores were high on average.
- GC sequence content looks odd with one particular sample. We will look at this again following trimming and we will see how the sequences perform in alignment.
- N content looks good.
- Duplication varies with some samples showing higher duplication. This is not uncommon with RNA seqeuences especially if there was low RNA concentration and it was sequenced deeply.
- Sample 38 had high overrepresented sequences.
- Adapters are present and will be removed in trimming.
Samples 11_S922_R1_001
(no reads) and 38_S949_R1_001
(bad GC and quality) fail QC checks, so instead we will use the other sequence run files for these samples (11_S132 and 38_S159).
Trimming raw sequences
I first ran a step to trim adapters from sequences. I will then generate another QC report to look at the results before making other trimming decisions.
I first moved the .md5 files to an md5 folder to keep only .fastq files in raw-sequences
.
cd raw-sequences
mkdir md5_files
mv *.md5 md5_files
Make a new folder for trimmed sequences.
cd ../
mkdir trimmed-sequences
Find the program in the Roberts Lab bioinformatics container.
cd /gscratch/srlab/containers/
ls -la
Most recent container is srlab-R4.4-bioinformatics-container-f050784.sif
.
cd scripts
nano trim_adapters_slurm.sh
I will use the following settings for trimming in fastp
. Fastp documentation can be found here.
detect_adapter_for_pe \
- This enables auto detection of adapters for paired end data
qualified_quality_phred 30 \
- Filters reads based on phred score >=30
unqualified_percent_limit 10 \
- percents of bases are allowed to be unqualified, set here as 10%
length_required 100 \
- Removes reads shorter than 100 bp. We have read lengths of 150 bp.
cut_right cut_right_window_size 5 cut_right_mean_quality 20
- Jill used this sliding cut window in her script. I am going to leave it out for now and evaluate the QC to see if we need to implement cutting.
Make the job script.
nano trim_adapters_slurm.sh
#!/bin/bash
#SBATCH --job-name=adapter-trimming
#SBATCH --mail-type=ALL
#SBATCH --mail-user=ashuff@uw.edu
#SBATCH --account=srlab
#SBATCH --partition=cpu-g2-mem2x
#SBATCH --nodes=1
#SBATCH --mem=250G
#SBATCH --time=02-12:00:00 # Max runtime in DD-HH:MM:SS format.
#SBATCH --chdir=/gscratch/srlab/ashuff/moorea-2023-rnaseq/raw-sequences
#SBATCH --export=all
#SBATCH -o adapter-trim-%j.out
#SBATCH -e adapter-trim-%j.error
# load modules
module load apptainer
apptainer exec \
--home $PWD \
--bind /mmfs1/home/ \
--bind /mmfs1/gscratch/ \
--bind /gscratch/ \
/gscratch/srlab/containers/srlab-R4.4-bioinformatics-container-f050784.sif \
/gscratch/srlab/ashuff/moorea-2023-rnaseq/scripts/trim_adapters.sh
Make the script for the specific job.
nano trim_adapters.sh
#save file
chmod +x trim_adapters.sh
#!/bin/bash
# This script is designed to be called by a SLURM script which
# runs this script across an array of HPC nodes.
# Make an array of sequences to trim in raw data directory
cd /gscratch/srlab/ashuff/moorea-2023-rnaseq/raw-sequences/
array1=($(ls *R1_001.fastq.gz))
echo "Read trimming of adapters started." $(date)
# fastp and fastqc loop
for i in ${array1[@]}; do
fastp --in1 ${i} \
--in2 $(echo ${i}|sed s/_R1/_R2/)\
--out1 /gscratch/srlab/ashuff/moorea-2023-rnaseq/trimmed-sequences/trim.${i} \
--out2 /gscratch/srlab/ashuff/moorea-2023-rnaseq/trimmed-sequences/trim.$(echo ${i}|sed s/_R1/_R2/) \
--detect_adapter_for_pe \
--qualified_quality_phred 30 \
--unqualified_percent_limit 10 \
--length_required 100
done
echo "Read trimming of adapters completed." $(date)
sbatch trim_adapters_slurm.sh
Submitted job 29916491 at 17:00 on Sep 30. Job finished Oct 1.
QC of trimmed sequences
cd scripts
nano qc_trimmed.sh
#!/bin/bash
#SBATCH --job-name=trimmedQC
#SBATCH --mail-type=ALL
#SBATCH --mail-user=ashuff@uw.edu
#SBATCH --account=srlab
#SBATCH --partition=cpu-g2-mem2x
#SBATCH --nodes=1
#SBATCH --mem=250G
#SBATCH --time=01-12:00:00 # Max runtime in DD-HH:MM:SS format.
#SBATCH --chdir=/gscratch/srlab/ashuff/moorea-2023-rnaseq/trimmed-sequences
#SBATCH --export=all
#SBATCH -o trimmed-qc-%j.out
#SBATCH -e trimmed-qc-%j.error
# load modules needed
module load kawaldorflab/fastqc/0.12.1
module load kawaldorflab/multiqc/1.15
# fastqc of raw reads
fastqc trim*.fastq.gz
#generate multiqc report
multiqc ./ --filename multiqc_report_trimmed.html
echo "Trimmed MultiQC report generated." $(date)
Run the script.
sbatch qc_trimmed.sh
Job 29937034 on Oct 1 at 09:28, completed at 12:00.
Move to my computer.
scp ashuff@klone.hyak.uw.edu:/gscratch/srlab/ashuff/moorea-2023-rnaseq/trimmed-sequences/multiqc_report_trimmed.html ~/MyProjects/moorea_symbiotic_exchange_2023/data/rna/QC
I then examined the QC information for the trimmed sequences. The MultiQC for the trimmed sequences can be found here.
Here are the stats for all files following trimming.
Sample Name | % Dups | % GC | M Seqs |
---|---|---|---|
trim.10_S131_R1_001 | 39.8% | 42% | 2.6 |
trim.10_S131_R2_001 | 36.5% | 42% | 2.6 |
trim.10_S921_R1_001 | 48.6% | 42% | 7.5 |
trim.10_S921_R2_001 | 44.9% | 42% | 7.5 |
trim.11_S132_R1_001 | 57.5% | 42% | 24.4 |
trim.11_S132_R2_001 | 53.3% | 42% | 24.4 |
trim.11_S922_R1_001 | 3.3% | 42% | 0.0 |
trim.11_S922_R2_001 | 3.0% | 42% | 0.0 |
trim.12_S133_R1_001 | 60.6% | 42% | 3.3 |
trim.12_S133_R2_001 | 58.1% | 43% | 3.3 |
trim.12_S923_R1_001 | 75.3% | 42% | 9.7 |
trim.12_S923_R2_001 | 73.9% | 42% | 9.7 |
trim.13_S134_R1_001 | 32.0% | 42% | 3.5 |
trim.13_S134_R2_001 | 29.2% | 42% | 3.5 |
trim.13_S924_R1_001 | 37.7% | 42% | 9.8 |
trim.13_S924_R2_001 | 34.4% | 42% | 9.8 |
trim.14_S135_R1_001 | 40.3% | 42% | 4.7 |
trim.14_S135_R2_001 | 35.4% | 43% | 4.7 |
trim.14_S925_R1_001 | 48.4% | 43% | 12.9 |
trim.14_S925_R2_001 | 43.4% | 43% | 12.9 |
trim.15_S136_R1_001 | 39.3% | 42% | 3.8 |
trim.15_S136_R2_001 | 35.8% | 42% | 3.8 |
trim.15_S926_R1_001 | 47.5% | 42% | 10.2 |
trim.15_S926_R2_001 | 43.3% | 42% | 10.2 |
trim.16_S137_R1_001 | 43.8% | 41% | 3.5 |
trim.16_S137_R2_001 | 40.2% | 41% | 3.5 |
trim.16_S927_R1_001 | 53.5% | 41% | 9.9 |
trim.16_S927_R2_001 | 50.3% | 41% | 9.9 |
trim.17_S138_R1_001 | 38.5% | 41% | 4.2 |
trim.17_S138_R2_001 | 33.6% | 41% | 4.2 |
trim.17_S928_R1_001 | 46.5% | 41% | 11.3 |
trim.17_S928_R2_001 | 41.8% | 41% | 11.3 |
trim.18_S139_R1_001 | 45.1% | 41% | 3.0 |
trim.18_S139_R2_001 | 41.9% | 41% | 3.0 |
trim.18_S929_R1_001 | 55.4% | 41% | 7.9 |
trim.18_S929_R2_001 | 52.6% | 41% | 7.9 |
trim.19_S140_R1_001 | 36.0% | 43% | 3.8 |
trim.19_S140_R2_001 | 33.1% | 43% | 3.8 |
trim.19_S930_R1_001 | 42.6% | 43% | 9.2 |
trim.19_S930_R2_001 | 39.1% | 43% | 9.2 |
trim.1_S122_R1_001 | 41.4% | 41% | 3.5 |
trim.1_S122_R2_001 | 37.6% | 41% | 3.5 |
trim.1_S912_R1_001 | 49.4% | 41% | 9.2 |
trim.1_S912_R2_001 | 45.9% | 41% | 9.2 |
trim.20_S141_R1_001 | 41.4% | 42% | 4.5 |
trim.20_S141_R2_001 | 37.2% | 42% | 4.5 |
trim.20_S931_R1_001 | 49.4% | 42% | 11.6 |
trim.20_S931_R2_001 | 45.5% | 42% | 11.6 |
trim.21_S142_R1_001 | 39.9% | 41% | 3.4 |
trim.21_S142_R2_001 | 36.1% | 42% | 3.4 |
trim.21_S932_R1_001 | 47.5% | 42% | 9.2 |
trim.21_S932_R2_001 | 43.7% | 42% | 9.2 |
trim.22_S143_R1_001 | 40.5% | 42% | 3.1 |
trim.22_S143_R2_001 | 37.9% | 42% | 3.1 |
trim.22_S933_R1_001 | 48.6% | 42% | 8.5 |
trim.22_S933_R2_001 | 45.9% | 42% | 8.5 |
trim.23_S144_R1_001 | 33.6% | 42% | 2.9 |
trim.23_S144_R2_001 | 28.9% | 42% | 2.9 |
trim.23_S934_R1_001 | 43.6% | 42% | 9.0 |
trim.23_S934_R2_001 | 38.6% | 42% | 9.0 |
trim.24_S145_R1_001 | 39.9% | 41% | 4.0 |
trim.24_S145_R2_001 | 37.1% | 41% | 4.0 |
trim.24_S935_R1_001 | 48.7% | 41% | 10.6 |
trim.24_S935_R2_001 | 45.5% | 41% | 10.6 |
trim.25_S146_R1_001 | 44.9% | 41% | 3.3 |
trim.25_S146_R2_001 | 40.8% | 42% | 3.3 |
trim.25_S936_R1_001 | 58.2% | 41% | 9.6 |
trim.25_S936_R2_001 | 54.8% | 42% | 9.6 |
trim.26_S147_R1_001 | 37.3% | 42% | 3.9 |
trim.26_S147_R2_001 | 34.2% | 42% | 3.9 |
trim.26_S937_R1_001 | 45.5% | 42% | 10.5 |
trim.26_S937_R2_001 | 42.0% | 42% | 10.5 |
trim.27_S148_R1_001 | 37.4% | 42% | 4.2 |
trim.27_S148_R2_001 | 33.1% | 42% | 4.2 |
trim.27_S938_R1_001 | 45.8% | 42% | 11.1 |
trim.27_S938_R2_001 | 41.6% | 42% | 11.1 |
trim.28_S149_R1_001 | 34.1% | 43% | 3.8 |
trim.28_S149_R2_001 | 29.6% | 43% | 3.8 |
trim.28_S939_R1_001 | 43.1% | 43% | 10.7 |
trim.28_S939_R2_001 | 38.6% | 43% | 10.7 |
trim.29_S150_R1_001 | 38.5% | 42% | 4.4 |
trim.29_S150_R2_001 | 34.5% | 43% | 4.4 |
trim.29_S940_R1_001 | 47.7% | 42% | 12.3 |
trim.29_S940_R2_001 | 44.1% | 43% | 12.3 |
trim.2_S123_R1_001 | 38.2% | 43% | 4.0 |
trim.2_S123_R2_001 | 34.5% | 44% | 4.0 |
trim.2_S913_R1_001 | 47.3% | 43% | 10.7 |
trim.2_S913_R2_001 | 43.7% | 44% | 10.7 |
trim.30_S151_R1_001 | 49.0% | 42% | 8.4 |
trim.30_S151_R2_001 | 46.4% | 42% | 8.4 |
trim.30_S941_R1_001 | 59.0% | 42% | 23.0 |
trim.30_S941_R2_001 | 56.6% | 42% | 23.0 |
trim.31_S152_R1_001 | 38.9% | 41% | 4.4 |
trim.31_S152_R2_001 | 34.1% | 41% | 4.4 |
trim.31_S942_R1_001 | 47.9% | 41% | 12.4 |
trim.31_S942_R2_001 | 43.1% | 41% | 12.4 |
trim.32_S153_R1_001 | 40.9% | 42% | 4.3 |
trim.32_S153_R2_001 | 37.3% | 43% | 4.3 |
trim.32_S943_R1_001 | 49.8% | 42% | 11.4 |
trim.32_S943_R2_001 | 46.6% | 43% | 11.4 |
trim.33_S154_R1_001 | 37.7% | 42% | 3.8 |
trim.33_S154_R2_001 | 33.4% | 42% | 3.8 |
trim.33_S944_R1_001 | 46.0% | 42% | 10.5 |
trim.33_S944_R2_001 | 41.6% | 42% | 10.5 |
trim.34_S155_R1_001 | 40.7% | 42% | 3.8 |
trim.34_S155_R2_001 | 38.5% | 42% | 3.8 |
trim.34_S945_R1_001 | 47.9% | 42% | 9.6 |
trim.34_S945_R2_001 | 45.3% | 42% | 9.6 |
trim.35_S156_R1_001 | 37.3% | 42% | 3.8 |
trim.35_S156_R2_001 | 33.2% | 43% | 3.8 |
trim.35_S946_R1_001 | 45.5% | 43% | 10.2 |
trim.35_S946_R2_001 | 41.1% | 43% | 10.2 |
trim.36_S157_R1_001 | 40.3% | 42% | 4.8 |
trim.36_S157_R2_001 | 35.0% | 42% | 4.8 |
trim.36_S947_R1_001 | 48.5% | 42% | 12.2 |
trim.36_S947_R2_001 | 43.4% | 42% | 12.2 |
trim.37_S158_R1_001 | 38.5% | 41% | 3.8 |
trim.37_S158_R2_001 | 36.6% | 42% | 3.8 |
trim.37_S948_R1_001 | 45.5% | 42% | 9.5 |
trim.37_S948_R2_001 | 43.7% | 42% | 9.5 |
trim.38_S159_R1_001 | 59.6% | 42% | 25.2 |
trim.38_S159_R2_001 | 57.6% | 42% | 25.2 |
trim.38_S949_R1_001 | 0.0% | 44% | 0.0 |
trim.38_S949_R2_001 | 0.0% | 41% | 0.0 |
trim.39_S160_R1_001 | 38.7% | 41% | 2.7 |
trim.39_S160_R2_001 | 36.1% | 41% | 2.7 |
trim.39_S950_R1_001 | 49.2% | 41% | 8.6 |
trim.39_S950_R2_001 | 46.7% | 41% | 8.6 |
trim.3_S124_R1_001 | 40.8% | 41% | 4.0 |
trim.3_S124_R2_001 | 38.2% | 41% | 4.0 |
trim.3_S914_R1_001 | 48.4% | 41% | 9.9 |
trim.3_S914_R2_001 | 45.9% | 42% | 9.9 |
trim.40_S161_R1_001 | 41.4% | 41% | 4.3 |
trim.40_S161_R2_001 | 39.0% | 42% | 4.3 |
trim.40_S951_R1_001 | 50.0% | 41% | 11.0 |
trim.40_S951_R2_001 | 48.0% | 42% | 11.0 |
trim.41_S162_R1_001 | 38.2% | 41% | 3.4 |
trim.41_S162_R2_001 | 35.2% | 41% | 3.4 |
trim.41_S952_R1_001 | 46.0% | 41% | 8.9 |
trim.41_S952_R2_001 | 43.0% | 41% | 8.9 |
trim.42_S163_R1_001 | 43.8% | 43% | 4.7 |
trim.42_S163_R2_001 | 38.9% | 43% | 4.7 |
trim.42_S953_R1_001 | 52.9% | 43% | 13.0 |
trim.42_S953_R2_001 | 48.3% | 43% | 13.0 |
trim.43_S164_R1_001 | 35.7% | 42% | 4.1 |
trim.43_S164_R2_001 | 31.9% | 42% | 4.1 |
trim.43_S954_R1_001 | 42.5% | 42% | 10.3 |
trim.43_S954_R2_001 | 38.6% | 42% | 10.3 |
trim.44_S165_R1_001 | 36.6% | 43% | 3.7 |
trim.44_S165_R2_001 | 33.1% | 43% | 3.7 |
trim.44_S955_R1_001 | 44.9% | 43% | 9.9 |
trim.44_S955_R2_001 | 41.5% | 43% | 9.9 |
trim.45_S166_R1_001 | 45.2% | 41% | 3.9 |
trim.45_S166_R2_001 | 42.5% | 42% | 3.9 |
trim.45_S956_R1_001 | 54.2% | 41% | 10.5 |
trim.45_S956_R2_001 | 51.9% | 42% | 10.5 |
trim.46_S167_R1_001 | 31.6% | 45% | 3.5 |
trim.46_S167_R2_001 | 28.7% | 45% | 3.5 |
trim.46_S957_R1_001 | 40.1% | 45% | 9.7 |
trim.46_S957_R2_001 | 36.6% | 45% | 9.7 |
trim.47_S168_R1_001 | 36.6% | 42% | 4.3 |
trim.47_S168_R2_001 | 32.4% | 42% | 4.3 |
trim.47_S958_R1_001 | 45.0% | 42% | 11.6 |
trim.47_S958_R2_001 | 40.6% | 42% | 11.6 |
trim.48_S169_R1_001 | 39.5% | 41% | 4.7 |
trim.48_S169_R2_001 | 35.0% | 42% | 4.7 |
trim.48_S959_R1_001 | 48.5% | 41% | 12.9 |
trim.48_S959_R2_001 | 44.3% | 42% | 12.9 |
trim.49_S170_R1_001 | 36.1% | 43% | 3.6 |
trim.49_S170_R2_001 | 33.8% | 43% | 3.6 |
trim.49_S960_R1_001 | 43.2% | 43% | 9.5 |
trim.49_S960_R2_001 | 40.5% | 43% | 9.5 |
trim.4_S125_R1_001 | 51.6% | 41% | 4.8 |
trim.4_S125_R2_001 | 46.6% | 41% | 4.8 |
trim.4_S915_R1_001 | 60.9% | 41% | 12.6 |
trim.4_S915_R2_001 | 56.4% | 41% | 12.6 |
trim.50_S171_R1_001 | 39.3% | 43% | 3.9 |
trim.50_S171_R2_001 | 35.5% | 43% | 3.9 |
trim.50_S961_R1_001 | 47.9% | 43% | 11.0 |
trim.50_S961_R2_001 | 44.0% | 43% | 11.0 |
trim.51_S172_R1_001 | 37.7% | 42% | 3.9 |
trim.51_S172_R2_001 | 35.5% | 43% | 3.9 |
trim.51_S962_R1_001 | 44.5% | 43% | 9.9 |
trim.51_S962_R2_001 | 41.9% | 43% | 9.9 |
trim.52_S173_R1_001 | 37.0% | 42% | 3.8 |
trim.52_S173_R2_001 | 34.6% | 42% | 3.8 |
trim.52_S963_R1_001 | 43.6% | 42% | 9.3 |
trim.52_S963_R2_001 | 41.0% | 43% | 9.3 |
trim.53_S174_R1_001 | 38.1% | 42% | 4.3 |
trim.53_S174_R2_001 | 34.9% | 43% | 4.3 |
trim.53_S964_R1_001 | 46.5% | 42% | 11.1 |
trim.53_S964_R2_001 | 42.9% | 43% | 11.1 |
trim.54_S175_R1_001 | 57.7% | 43% | 23.0 |
trim.54_S175_R2_001 | 54.5% | 43% | 23.0 |
trim.55_S176_R1_001 | 38.1% | 41% | 4.2 |
trim.55_S176_R2_001 | 33.7% | 41% | 4.2 |
trim.55_S966_R1_001 | 46.5% | 41% | 11.3 |
trim.55_S966_R2_001 | 42.1% | 41% | 11.3 |
trim.56_S177_R1_001 | 40.8% | 41% | 3.9 |
trim.56_S177_R2_001 | 38.2% | 41% | 3.9 |
trim.56_S967_R1_001 | 49.8% | 41% | 10.8 |
trim.56_S967_R2_001 | 47.3% | 41% | 10.8 |
trim.57_S178_R1_001 | 37.3% | 42% | 4.2 |
trim.57_S178_R2_001 | 33.6% | 42% | 4.2 |
trim.57_S968_R1_001 | 45.0% | 42% | 11.6 |
trim.57_S968_R2_001 | 41.0% | 42% | 11.6 |
trim.58_S179_R1_001 | 21.4% | 42% | 6.2 |
trim.58_S179_R2_001 | 19.6% | 42% | 6.2 |
trim.58_S969_R1_001 | 26.6% | 42% | 17.0 |
trim.58_S969_R2_001 | 24.7% | 42% | 17.0 |
trim.59_S180_R1_001 | 37.2% | 42% | 3.8 |
trim.59_S180_R2_001 | 32.9% | 42% | 3.8 |
trim.59_S970_R1_001 | 45.2% | 42% | 10.0 |
trim.59_S970_R2_001 | 40.7% | 42% | 10.0 |
trim.5_S126_R1_001 | 44.0% | 42% | 4.5 |
trim.5_S126_R2_001 | 41.5% | 42% | 4.5 |
trim.5_S916_R1_001 | 50.9% | 42% | 10.9 |
trim.5_S916_R2_001 | 48.3% | 42% | 10.9 |
trim.60_S181_R1_001 | 39.7% | 41% | 3.7 |
trim.60_S181_R2_001 | 37.0% | 41% | 3.7 |
trim.60_S971_R1_001 | 48.1% | 41% | 10.2 |
trim.60_S971_R2_001 | 45.4% | 41% | 10.2 |
trim.61_S182_R1_001 | 27.7% | 43% | 1.7 |
trim.61_S182_R2_001 | 23.9% | 43% | 1.7 |
trim.61_S972_R1_001 | 34.1% | 43% | 4.5 |
trim.61_S972_R2_001 | 29.6% | 43% | 4.5 |
trim.62_S183_R1_001 | 34.9% | 41% | 3.9 |
trim.62_S183_R2_001 | 30.4% | 42% | 3.9 |
trim.62_S973_R1_001 | 43.2% | 42% | 10.3 |
trim.62_S973_R2_001 | 38.8% | 42% | 10.3 |
trim.63_S184_R1_001 | 37.9% | 41% | 3.3 |
trim.63_S184_R2_001 | 35.1% | 41% | 3.3 |
trim.63_S974_R1_001 | 46.4% | 41% | 9.2 |
trim.63_S974_R2_001 | 43.5% | 41% | 9.2 |
trim.64_S185_R1_001 | 46.4% | 41% | 3.8 |
trim.64_S185_R2_001 | 42.0% | 41% | 3.8 |
trim.64_S975_R1_001 | 56.0% | 41% | 10.4 |
trim.64_S975_R2_001 | 52.4% | 41% | 10.4 |
trim.6_S127_R1_001 | 43.7% | 42% | 4.1 |
trim.6_S127_R2_001 | 38.5% | 43% | 4.1 |
trim.6_S917_R1_001 | 53.4% | 42% | 11.0 |
trim.6_S917_R2_001 | 48.4% | 43% | 11.0 |
trim.7_S128_R1_001 | 34.7% | 44% | 4.3 |
trim.7_S128_R2_001 | 31.3% | 45% | 4.3 |
trim.7_S918_R1_001 | 41.9% | 44% | 11.4 |
trim.7_S918_R2_001 | 38.0% | 45% | 11.4 |
trim.8_S129_R1_001 | 40.1% | 42% | 4.4 |
trim.8_S129_R2_001 | 37.0% | 42% | 4.4 |
trim.8_S919_R1_001 | 48.9% | 42% | 12.4 |
trim.8_S919_R2_001 | 45.8% | 42% | 12.4 |
trim.9_S130_R1_001 | 40.9% | 41% | 5.4 |
trim.9_S130_R2_001 | 37.0% | 41% | 5.4 |
trim.9_S920_R1_001 | 47.8% | 41% | 13.3 |
trim.9_S920_R2_001 | 43.8% | 41% | 13.3 |
Files to remove from analysis due to few/no reads and failed QC:
#no reads/few reads after trimming
trim.11_S922_R1_001
trim.11_S922_R2_001
trim.38_S949_R1_001
trim.38_S949_R2_001
Other reads will need to be concatenated to include both first and second rounds of sequencing. For some samples, the first round produced more sequences while in others the second round produced more. So we are not able to only select one round of sequencing.
Other QC checks passed. Adapters were removed, quality is high, and GC content looks good when we remove the samples I listed above.
Here are some stats:
- Trimming removed between 0.5-5 M sequences.
- GC content was reduced 1-2% afer trimming.
- Duplication was reduced 0-2 after triming.
- Total sequences when totalling both sequencing runs ranged from ~ 6M at the lowest to >25 M at the highest.
The sample with low (<10 M seqs) are:
- 61 (POC recruit)
All other samples have above 10 M seqs with ~ 15 M seqs as the average.
It is likely we will need to remove sample 61 as an outlier in analysis.
Move files to QC directory.
cd trimmed-sequences
mkdir trimmed_fastqc
mv *fastqc.html trimmed_fastqc
mv *fastqc.zip trimmed_fastqc
mv multiqc_report_trimmed.html trimmed_fastqc
Test blast files
I then blasted some of the sequences (3 per file) by hand to check if they mapped to corals as expected. This will serve as a species QC check until I do alignments to reference genomes.
ACR recruit: Sample 2 2_S913_R1_001
- Hits = Acropora!
GTCGGTTCAC+ACACCTGTAT
GGGACCGGATCCACTTGGACCAGTTGGTGCAGGTTCTGTTGTAGAGCTCGTCATGGAAGAATTAGCAGCGCCCATACCATGTGACATAGACATCATGGACTCTGAGCTTGCCATATCCATCACCGTCGTCATCGTGCT
#no similarity found
GTCGGTTCAC+ACACCTGTAT
TTCTGCTTTTCGCTTGTCGATCTCCTTACACTTTGCCAGCACAGCAGGCTTTCCCTCTGGAAGGTTCTTGAAGAGAACTCGTGCCATTGCATCTGCCCAGCCCTCATTCACACGTCCAATGTTATCTGCAGTCTTCATATTTTCC
#Acropora palmata, digitifera, hyacinthus
GTCGGTTCAC+ACACCTGTAT
TGCGGTGTGTCTGACTACTTTTTACATATCAACAATAATAAAGTCATTAAGGAAAAACTAAAGCCAATGAAATGCTTTTTTTTAAAGCAGAACTGAAAATAGAAAAATTATTTCTAGTTTTTAATAATAAAAAATTCATGAAGTTTTTGGC
#Acropora digitifera, spicifera, muricata, hyacinthus
ACR larvae: Sample 1 1_S912_R1_001
- Hits = Acropora!
TCCTAGTATC+ATAGCTTGTG
GCTAGAATGATTACATGTTCAAAGTTAAAAATTCCATTTCACCCTTGTTATTGATAACAATGAAAAAAAAAAGTCAAGGGGCTTCACATTTGTTCAAATGTAAAGCGAGCTTGACAACCCAAAAAACAAATCCTCCTTCC
# Acropora loripes, hyacinthus, digitifera
TCCTAGTATC+ATAGCTTGTG
AGAGAATATGCTGTCATCAGCTTACGGAGACCTAATACACGATCCATTAATGGCTGATGGAGTGGAGCCTGTTGGATCGAAAGCATGGTTTACCAGAAACGGGATCAGGTGAGTTAACCACCTTCATACGTTCTACAATAGTTTTATTGCA
# Acropora spathulata, loripes, hyacinthus, palmata
TCCTAGTATC+ATAGCTTGTG
CGGGCAAATTGGAAGGCTAACAAAGCAAAAGTGCGTGTTATATTGACTTAAGAAGATAAAACAAAGCGTGCAGTCAAAGCGAGGATGACAGAGGCGGCAGTATACGATCCTCCGCATTTCGCAGCCGAAGCTGTTGTGGCTGGGGGGTCTG
# Acropora hyacinthus, spicifera, loripes
POC recruit: Sample 11 11_S132_R1_001
- Hits = Pocillopora!
AAGACGTATG+CCAGATGCCA
GCTGGCTAGGCAACATTTAAAACCTTACACGAGAAACCGAAACAAAAACTACCTTTGGACTATATAACTTAAATCTATCAAATGCTACACCGTACAGGCCTACTCTCAACGAGATTAAACACAATTAAGTTATCAGAACATTAATAAAATG
#Pocillopora grandis, verrucosa, damicornis
AAGACGTATG+CCAGATGCCA
ATCCAGCTTAATAAAACAAAAGTTGTATTCTATGATAATTTCTGCCGTTCCAGTGTGCAGCTCAGTTACATTTAGACACCAAGCATGCAGTGTTGTTGGATTTTTTAAGAACACATAAGAAACAAGAAAAACACCACCTGCAGTTTTCCTC
#Pocillopora damicornis, verrucosa, grandis
AAGACGTATG+CCAGATGCCA
ACTGTGCTCAAGCATTCCTCAGGAGATTTTTATCCTCGGTAAAACGGAGATGTTCTGAATGAAACTTCAGCTGCTGTCATATAACTGTGAAGCTCCAATATGATGAGTTCATTGTCTCCTGTCCGTAACCATG
#Pocillopora damicornis, grandis, verrucosa
POC larvae: Sample 10 10_S921_R1_001
- Hits =
ACCAGCATCT+CAAGGTATGT
GCACTAATTTCTTTTGTCATTTTTAAGCCATCGGTATACAATACAGGTAACAGCTTCTTCTCTTTCAGTTTCTCGAGTGTTTCCTTGTCATCCCGTAAGTCAAG
#Pocillopora damicornis, grandis, verrucosa
ACCAGCATCT+CAAGGTATGT
GCCAAGAAGCTGGAAATCTTAAATTAAATAAGTTTGTCTGAGAATGTCAGGATTAATATCCACAACTCTTGTTTAACCAAAGCTTGAGGAAACAGCTAAACTGAACGTTAGAGGTGCAAAGTTCTTCAATTCAGGAAGGTGTGGCGGCAAC
#Pocillopora damicornis, grandis
ACCAGCATCT+CAAGGTATGT
CCTGAATTGTATTGGCCTTGCTTTCTTCGCTCCGTGATTGGTCGAGGCATTTTCTCAATAACCTCTCCACCAATCATATGCAGTTTTCCCGCACTTCAGGCAGGGAGTTTCCATTGGCTCCTTGAAACATTTTCTTCGGCTCTGATTGGCT
#Pocillopora verrucosa, damicornis, grandis
Submitting sequences to NCBI
I first created an NCBI SRA BioProject and BioSample information in preparation for download and submission. I used the same steps outlined in my previous notebook post for the Hawaii2023 project and the Putnam Lab SRA protocol.
- Created new BioProject
- Release date Oct 1 2026
- Title: Thermal sensitivity across coral larval and recruit life stages
- Description: Examination of thermal stress responses in two coral species across larval and recruit life stages collected in Moorea, French Polynesia in 2023
- Grants: ID: 2205966; OCE-PRF: Investigating ontogenetic shifts in microbe-derived nutrition in reef building corals; National Science Foundation
- Package: MIMS environmental/meta host-associated
The BioProject information file can be found here.
I then completed the SRA upload metadata file, which can be found here.
Requested FTP upload folder.
Create symlinks to the files I want to upload and follow NCBI FTP instructions.
cd /gscratch/srlab/ashuff/moorea-2023-rnaseq/raw-sequences
mkdir ncbi_upload
cd ncbi_upload
ln -s /gscratch/srlab/ashuff/moorea-2023-rnaseq/raw-sequences/*fastq.gz /gscratch/srlab/ashuff/moorea-2023-rnaseq/raw-sequences/ncbi_upload
salloc --partition=ckpt-all --cpus-per-task=1 --mem=10G --time=2:00:00
sftp subftp@sftp-private.ncbi.nlm.nih.gov
#entered credentials
mkdir rnaseq_20251001
mput *
All information was added to the Putnam Lab sequence inventory here.
254 Files were uploaded under SUB15678460.
Sequencing inventory including all accession numbers is available on GitHub here.