This tool computes the Hamming distance (edit distance) between all pairs of sequences in the input. Only sequence pairs closer than or equal to the 'Minimum distance' are reported.
The input format should be a list of sequences and their names as two columns in an Excel (.xlsx) document:
ACGTACTCCGT S1
CTCGACTAGCT S2
GGGCGCGGGGT S3
GGGCTCGGTGT S4
If you have multiple barcodes, add these as an extra pair of columns, for example:
AGGCTGAA FWD1 TTCGATAC REV1
TGGCTGAA FWD2 TTCGATAC REV2
AGCGTGAA FWD3 ACTGATAC REV3
TGCGTGAA FWD4 TTCGAATG REV4
Here, the FWD1 through FWD4 sequences would be analyzed as part of barcode set 1, and the REV1 through REV4 sequences would analyzed as part of barcode set 2.
NOTE: sequences of different lengths are allowed, but no alignment is done prior to the comparison. The comparison would start with the first letter of each over the smallest of the two strings, and the difference in lengths would be added to the total distance.
Sequence 1 | Sequence 2 | Distance |