Bug in estimated depth and base calling

I’m noticing some of my alignments where the number of bases called at some positions plateaus at 65535, e.g. part of my mat.gz file that looks like this:

A       65535   3       55      2       0       6
G       13      0       65535   8       0       7
T       26      34      14      65535   0       3
-       0       0       26      0       0       37
C       10      65535   1       15      0       5
-       0       0       0       2       0       64
G       54      3       65535   6       0       5
G       28      0       65535   6       0       5
A       65535   4       43      4       0       7
A       65535   27      81      47      0       7

65535 is the largest number in a 16-bit unsigned int. That can’t be a conincidence. For my use case this does not matter much, because it’s still easily able to call the right base. But this might cause issues for other datasets, if the correct depth is needed.

Is it possible to change to a 32-bit integer? Depending on the platform, this might actually cause a speed increase, but I don’t know if there will be some memory issues or something.

Edit: BWA reports a depth of around 7400 at these positions. So this is likely a bug in KMA that produces the erroneous value of 65535, it’s not that it maxes out.

Comments (7)