Skip to main content
Fig. 4 | Algorithms for Molecular Biology

Fig. 4

From: Fractional hitting sets for efficient multiset sketching

Fig. 4

How supersampler and sourmash perform their respective sketch comparison. Colored rectangles represent k-mers. Those sharing the same color are sharing a common minimizer. In supersampler sketches, k-mers sharing their minimizers are stored in the same partition. In this example, we discuss the comparison of one document against a collection, although other use cases can be inferred. supersampler is capable of skipping certain partitions that are not relevant to the query. By focusing on smaller sub-parts of the collection one at a time, supersampler effectively improves practical performance and reduces memory usage

Back to article page