- SNPs are more interesting in the plasmid
- for the chromosome, we only care about which reads match and which do not
December 21, 2018
These are two command lines
bwa index pKEC-a3c.fna
bwa mem -R '@RG\tID:foo\tSM:bar\tLB:library1' \
pKEC-a3c.fna \
/home/bioinfo/1-F5-96_S1_L001_R1_001.fastq.gz \
/home/bioinfo/1-F5-96_S1_L001_R2_001.fastq.gz \
> plasmid.sam
samtools fixmate -O bam plasmid.sam plasmid-fixed.bam
samtools sort -O bam -o plasmid-sorted.bam \
-T /tmp/plasmid plasmid-fixed.bam
samtools faidx pKEC-a3c.fna samtools dict pKEC-a3c.fna > pKEC-a3c.fna.dict samtools index plasmid-sorted.bam
If you are curious (and you should), take a look at the indices
cat pKEC-a3c.fna.fai cat pKEC-a3c.fna.dict
java -Xmx2g -jar /home/anaraven/GenomeAnalysisTK.jar \
-T RealignerTargetCreator \
-R pKEC-a3c.fna \
-I plasmid-sorted.bam -o plasmid.intervals
Here the name of the output file is critical
java -Xmx2g -jar /home/anaraven/GenomeAnalysisTK.jar \
-T IndelRealigner \
-R pKEC-a3c.fna \
-I plasmid-sorted.bam \
-targetIntervals plasmid.intervals \
-o plasmid-realigned.bam
bcftools mpileup -Ou -f pKEC-a3c.fna \
plasmid-realigned.bam | \
bcftools call -vmO z -o study.vcf
This was, more or less, what we did on classes
All the relevant information is on the files
study.vcfplasmid.samecoliK12.sam (or the one corresponding to the chromosome you studied)plasmid-fixed.bam into sam format, to work with a cleaner version