Skip to main content

Table 4 Suggested scale factors for various levels of desired confidence and various tolerable rates of error, when \(\min (m,n) = 10000\). For only 10K elements, if the tolerable error is up to 7%, we cannot but use all elements to get the desired accuracy

From: Estimating similarity and distance using FracMinHash

 

Desired level of confidence, \(\alpha\)

Tolerable Error, \(\delta\)

0.91

0.93

0.95

0.97

0.99

0.01

1.0000

1.0000

1.0000

1.0000

1.0000

0.03

1.0000

1.0000

1.0000

1.0000

1.0000

0.05

1.0000

1.0000

1.0000

1.0000

1.0000

0.07

1.0000

1.0000

1.0000

1.0000

1.0000

0.09

0.6794

0.7201

0.7745

0.8572

1.0000

0.1

0.5556

0.5889

0.6334

0.7010

0.8463