Intersecting compressed gVCF with bed file

Intersecting compressed gVCF with bed file


This may be a ridiculously simple question to ask but, I have a compressed genomic VCF file generated by the Strelka germline variant caller, with lines like the following, where no variation was detected:

chr1    27394730    .   T   .   .   PASS    END=27394756;BLOCKAVG_min30p3a  GT:GQX:DP:DPF:MIN_DP    0/0:3070:1137:14:1122

I need to intersect this with a set of regions I’m interested in. I have tried using bedtools intersect with a suitable BED file, but this only matches the the start of this blocked region at chr1 27394730 and not the remainder of the interval chr1:27394730-27394756.

Is there a way to run this intersection using bedtools?

I would think there’s a way to do this by converting the gVCF into a BED file (preserving the variations that Strelka has found), but if there’s a tool that can do this directly, then please point me in that direction





Read more here: Source link