Skip to main content

Table 1 Index and query statistics of pangenome query tools

From: Mem-based pangenome indexing for k-mer queries

Method

Index - HPRC

Query - HLA Locus

Size (GB)

Pivot

Query Length

Query Type

Time

Memory (GB)

PanKmer

23.29

any

31-mer only

1, –, 3, –

1:24:33.87

6.27

KMC3-M

1,267.20

any

re-index

1, 2, 3, –

1:31:23.07

14.32

KMC3-C

18.05

any

re-index

1, –, 3, 4*

0:00:35.71

18.10

MEMO-M

2.35

re-index

any

1, 2, 3, –

0:00:51.15

2.69

MEMO-C

2.04

re-index

any

1, –, 3, –

0:00:13.89

2.79

MEMO-DC

0.87

re-index

any

–, –, –, 4

0:00:08.12

2.46

  1. The pangenome includes 88 human autosomal haplotypes from HPRC and T2T-CHM13, with MEMO pivot as T2T-CHM13. Index query types include: 1. Global presence/absence; 2. Member presence/absence; 3. Conservation; 4. Decile conservation. Query type 4* indicates no relative size reduction in a KMC3 decile index. The decile conservation index yields counts to the nearest lowest decile. Elapsed conservation query runtime and peak memory usage on the HLA locus (chr6:29,476,949–33,231,258) anchored to T2T-CHM13. Time is expressed in hours:minutes:seconds