Jump to content

Tabix

From Wikipedia, the free encyclopedia

Tabix is a bioinformatics software utility for indexing large genomic data files.[1][2][3] Tabix is free software under the MIT license.[4]

Benefits

[edit]
  • Speed: Without an index, extracting specific regions from large files would require scanning through the entire file. Tabix avoids this by jumping directly to the region of interest.
  • Storage Efficiency: Tabix compresses the data using BGZF, which helps reduce storage requirements while still allowing for fast random access.

References

[edit]
  1. ^ Li, Heng (March 1, 2011). "Tabix: fast retrieval of sequence features from generic TAB-delimited files". Bioinformatics. 27 (5): 718–719. doi:10.1093/bioinformatics/btq671. ISSN 1367-4803. PMC 3042176. PMID 21208982.
  2. ^ Buffalo, Vince (2015). "Out-of-Memory Approaches: Tabix and SQLite". Bioinformatics data skills (1st ed.). California: O'Reilly. p. 427. ISBN 978-1-4493-6737-4. OCLC 916120899.
  3. ^ "VCF+tabix Track Format". UCSC Genome Browser. University of California, Santa Cruz. Retrieved January 26, 2021.
  4. ^ "Samtools/Htslib". GitHub. 2 May 2022.
[edit]