I am trying to understand how sinto ends up filtering and selecting unique fragments per cell. my input bam file has the following reads assigned to my cell of interest
A00261:518:HK73GDSX3:1:1515:27118:35211 147 chr1 9997 51 110S40M = 10010 -27 CCACAGCCGCGGCAAAGCCACATCACTTTCACCTCCACCAACACACAAAATCAAACAATCACTAACGCTAACTGTCTGACTCACTCTGCCTCACTATACCTAAACCTATACCGATAACCCTAACCCTAACCCTAACCCTAACCCTAACCC :,,,,,,,,,,,,,,F,,,F,:,,,,:,,,,,,,,F,:,,,:,,,,,,,,:,F,,,,:,,,,F,,,,,,,,,,,,F,,,,F,,,,,,,,,F:,:,,F,,,,:,:,,,F,:,:,:F:,F:,:F,FFFFFFFFFFFFFFFFFFFFFFFFFFF NM:i:0 MD:Z:40 AS:i:40 XS:i:37 XA:Z:chr6,-147869,113S37M,0;chr7,-10002,114S36M,0;chr1,-180752,114S36M,0;chr15,+101981123,36M114S,0; CR:Z:AGATTCAAGGTTGTAA CY:Z:FFFFFFFFFFFFFFFF CB:Z:CTGAATATCCTGGTCT-1 BC:Z:TTATTGGT QT:Z:FFFFFFFF RG:Z:Sample_output:MissingLibrary:1:HK73GDSX3:1
A00261:518:HK73GDSX3:1:2125:4083:16673 147 chr1 10002 0 43S107M = 10010 -99 CCTCTTTCTCCTGCAGCGTCATATGTTTAGTATAGCCCTCCCAAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCC ,,,,:,,,,,,,,,,,F::,,,,,,,,:,,,,,,,,,,,:,,,,:::,:F,:F,,:,:F:FF,F::FF,F:FFF::,,FFF:F,FFF:F,FFF:FFFFFF:FFFF:F,FFF:::FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF NM:i:0 MD:Z:107 AS:i:107 XS:i:108 CR:Z:AGATTCAAGGTTGTAA CY:Z:FFFFFFFF::FFFFFF CB:Z:CTGAATATCCTGGTCT-1 BC:Z:AACGGTCA QT:Z:FFFFFFFF RG:Z:Sample_output:MissingLibrary:1:HK73GDSX3:1
A00261:518:HK73GDSX3:1:1216:26946:37012 147 chr1 10003 0 98S52M = 10045 -10 CACCCCAACTCTAATGCCTCGGCGTCCACCTAGTCCTACTCATATTCATTGTGGTTACGGGTTTGTCTTCGGTATCGTAAGATGTGTATATTACACTTACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCC ,,,,,,,,,,,,,F,,,,,,,,,,,,,,,,,F,:,,::,,,,FF:,,,F:,:,,F,F,:,F:,,F:,:::,F:,:,,,,,,,FFF,F,:,F,,,,,F,,F,FFFFF,FFFFF:FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF NM:i:0 MD:Z:52 AS:i:52 XS:i:53 CR:Z:AGATTCAAGGTTGTAA CY:Z:FFFFFFFFFFFFFFFF CB:Z:CTGAATATCCTGGTCT-1 BC:Z:CCGAACTC QT:Z:FFFF:FFF RG:Z:Sample_output:MissingLibrary:1:HK73GDSX3:1
A00261:518:HK73GDSX3:1:1515:27118:35211 99 chr1 10010 60 62M2I35M3D28M23S = 9997 27 CCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCAAACCCTAACCCTAACCCTCTAACCCTAACCCTAACCCTAACCCTAACCCTAACACCCTAACCCTAACCCTAACCCTAACCCGGGGCGTTACGCTCCCTCTAACC FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF:FFFFF:FF:FF,FFF,,,FF,,F:F,,:F::,F:,::FFF,,:F,::::,:FF,:FFFF:F,FFF:,FFFF:F:,FFF,FFF:,::FF,,:FF,,,,,,,,,,,,,,,,,,,,,,, NM:i:6 MD:Z:45T51^CCA28 AS:i:103 XS:i:91 XA:Z:chr7,+10001,14S48M2I35M1D28M23S,4;chr7,+10035,45M3D19M4D35M1D28M23S,10;chr1,+180749,53M2D9M2I35M1D20M31S,6; CR:Z:AGATTCAAGGTTGTAA CY:Z:FFFFFFFFFFFFFFFF CB:Z:CTGAATATCCTGGTCT-1 BC:Z:TTATTGGT QT:Z:FFFFFFFF RG:Z:Sample_output:MissingLibrary:1:HK73GDSX3:1
A00261:518:HK73GDSX3:1:2125:4083:16673 99 chr1 10010 0 101M49S = 10002 99 CCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCAAAACCAAAACCCACTCACTTATAAACATCTACGAACCAACCAGACAAAGG FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF:FFFFFFFFFF:FF:FFFFF:FFFFF,FF:FF:FF::F,FF,F:,FF,FF,FF,:,,:,,,,,:,F,F,FFF,,,,,,F,:F,F,,F,FFFF,,, NM:i:0 MD:Z:101 AS:i:101 XS:i:100 CR:Z:AGATTCAAGGTTGTAA CY:Z:FFFFFFFF::FFFFFF CB:Z:CTGAATATCCTGGTCT-1 BC:Z:AACGGTCA QT:Z:FFFFFFFF RG:Z:Sample_output:MissingLibrary:1:HK73GDSX3:1
A00261:518:HK73GDSX3:1:1664:20157:30859 147 chr1 10027 0 74S76M = 10033 -70 ACCACCGAGATCTACACATATTCATGGTTGTAACGCGTCTGTTGTAGGCAGCGTCATATGTGTATATTATACTGACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCC ,,F,F,FF,FF:FF,F,:FF,FF,,,,F:FFF:F:,,::F::,,F,F,,F,:FF,,,FF,F,FFF,::F,,,,,F:::FFFF,,FFF,::F::,,FF:FFFFF:FFFFF:FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF NM:i:0 MD:Z:76 AS:i:76 XS:i:77 CR:Z:AGATTCAAGGTTGTAA CY:Z:FFFFFFFFFFFFFFFF CB:Z:CTGAATATCCTGGTCT-1 BC:Z:CCGAACTC QT:Z:FFFFFFFF RG:Z:Sample_output:MissingLibrary:1:HK73GDSX3:1
A00261:518:HK73GDSX3:1:1431:25192:2347 99 chr1 10028 0 83M67S = 10034 81 CCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCAAACCCAAAACCAAACACTAACCCACAACCAGACGCTCCAACTAACCCTAAGCCTAAGCCTGCAAGTAAGCCTCG FFFFFFFFFFFF:FFFFFFFFFF:FFFFFF:FFFFFFFFFFFFFF:FFFFFFFFFFF:FFF:FFFFFFF:FFFFF:,,,F,F:,,:,,F:F,FFF::F::,,,F,,,F,,FF,F,,:,:,,:,:,,,:,:,,,,F,,,,,:,:,,F,:,, NM:i:1 MD:Z:75T7 AS:i:78 XS:i:77 CR:Z:AGATTCAAGGTTGTAA CY:Z:FFFFFFFFFFFFFFFF CB:Z:CTGAATATCCTGGTCT-1 BC:Z:GGTCCAAG QT:Z:FFFFF:FF RG:Z:Sample_output:MissingLibrary:1:HK73GDSX3:1
A00261:518:HK73GDSX3:1:2414:19244:7044 1123 chr1 10028 0 83M67S = 10028 81 CCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCAAGACGAAAAAAAACAACTAACACAACCCCACACAAAACACAATACCCTATCCCGAGCGCTGCGACTAA FFFFFFFFFFFFFFFFFFFFFFFFFFF:FFFFF,FFFFFFFF:FF,FF,FF:FFF::FFF:FFFFF:FF:FF:F::FFFFFF:,,F,,::,,:,F,,:F:,,,,,,,,,F::,:F:,:,,,F,F,,,:,F:,,,:,,,,,::,,,,:,,, NM:i:0 MD:Z:83 AS:i:83 XS:i:81 CR:Z:AGATTCAAGGTTGTAA CY:Z:FFFFFFFFFFFFFFFF CB:Z:CTGAATATCCTGGTCT-1 BC:Z:AACGGTCA QT:Z:F:FFFFFF RG:Z:Sample_output:MissingLibrary:1:HK73GDSX3:1
A00261:518:HK73GDSX3:1:1318:11731:7592 99 chr1 10028 0 83M67S = 10400 414 CCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCAAACCATAAACCAAAACATCAACATAACCCTAACACTACCCCAATCCCTACCCCTAACGCTCAGCGTAG F,FFFFFFFFFFFFFFFF,,F,FF:FFF:FFFF:F:F:FF:F,,,FFF::,,:F,::::F,F:FFF,,F,:F:FF:FFF,F,F,:F,,,,,F,,:F,,,,,F,,,,FF,,,:::F,F:F,,,,,,,,,,F,,F,,,,,,,,F,,,,,,,, NM:i:0 MD:Z:83 AS:i:83 XS:i:83 CR:Z:AGATTCAAGGTTGTAA CY:Z:FFFFF,F:::FFFFFF CB:Z:CTGAATATCCTGGTCT-1 BC:Z:TTATTGGT QT:Z::FFFFFFF RG:Z:Sample_output:MissingLibrary:1:HK73GDSX3:1
A00261:518:HK73GDSX3:1:2414:19244:7044 1171 chr1 10028 0 69S81M = 10028 -81 AGAGAGCAACACTCATACTATGTTGTAACGGATCTGTATTAGTAAGAGTCAGATGTAGCTAAGACACATCCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCC ,,::F,,,FFFF,,F,,,,F,::,,,F,,,:,:F,,F,,F,F:FF,,,F,FF:,,,,,,,,F,:,,,F,FFFFFFF,F:FFF:FFFFFFFFFFFF:FF,FF:FFF:FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF NM:i:0 MD:Z:81 AS:i:81 XS:i:81 CR:Z:AGATTCAAGGTTGTAA CY:Z:FFFFFFFFFFFFFFFF CB:Z:CTGAATATCCTGGTCT-1 BC:Z:AACGGTCA QT:Z:F:FFFFFF RG:Z:Sample_output:MissingLibrary:1:HK73GDSX3:1
A00261:518:HK73GDSX3:1:1664:20157:30859 99 chr1 10033 0 78M72S = 10027 70 ACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCAAGACAATTAAAAACAACTCACAGCCCACGATACCCGAACTCATCGCGTATGGCGTGGGCTGCGGGTAACCGGG FFFFFFFFFFFFFFFFFFF:FFFFFFFFFFFFFFFFF:FFFFF:FF,FFF:F:FF,:FFFF:FF:FFFFF:FF,FF,,,,F,,:,F,F,F,:,F:,,,F,,,:F:,:,F,FF,FFF,F,,F,,,F,:,,F,,,,,,,,,,,,,,,,,,,, NM:i:0 MD:Z:78 AS:i:78 XS:i:76 CR:Z:AGATTCAAGGTTGTAA CY:Z:FFFFFFFFFFFFFFFF CB:Z:CTGAATATCCTGGTCT-1 BC:Z:CCGAACTC QT:Z:FFFFFFFF RG:Z:Sample_output:MissingLibrary:1:HK73GDSX3:1
A00261:518:HK73GDSX3:1:1431:25192:2347 147 chr1 10034 0 75S75M = 10028 -81 TACCACTTAGATATACACTTATACTACGTTTTAGCGTTTCTGTATTCGTAAGCGTAAGATTATAAATAAACATATCCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCC :F,F,,,:F:,:,F:,,,F,::,,FF,:,,F,:,:,,::,F,F:,,:,,,F,,,F:F,:,,,:,F,F,,,,,,F,FF:FFF::FFF:FFFFFFFFFFFFFFFFFFFF:FFFFFFFFFFFFFFFFFFFFFFFFF:FFFFFFFFFFFFFFFF NM:i:0 MD:Z:75 AS:i:75 XS:i:75 CR:Z:AGATTCAAGGTTGTAA CY:Z:FFFFFFFFFFFFFFFF CB:Z:CTGAATATCCTGGTCT-1 BC:Z:GGTCCAAG QT:Z:FFFFF:FF RG:Z:Sample_output:MissingLibrary:1:HK73GDSX3:1`
I understand that many of these reads will get removed due to mapping quality. Still, I don't really understand what leads to the positions 10013 and 10031. Is this due to +4/-5 shifting? Even so, I don't see how these numbers are arrived at. Could you please help me understand this?