vcf.5 2.3 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596
  1. '\" t
  2. .TH vcf 5 "August 2013" "htslib" "Bioinformatics formats"
  3. .SH NAME
  4. vcf \- Variant Call Format
  5. .SH DESCRIPTION
  6. The Variant Call Format (VCF) is a TAB-delimited format with each data line
  7. consisting of the following fields:
  8. .TS
  9. nlbl.
  10. 1 CHROM CHROMosome name
  11. 2 POS the left-most POSition of the variant
  12. 3 ID unique variant IDentifier
  13. 4 REF the REFerence allele
  14. 5 ALT the ALTernate allele(s) (comma-separated)
  15. 6 QUAL variant/reference QUALity
  16. 7 FILTER FILTERs applied
  17. 8 INFO INFOrmation related to the variant (semicolon-separated)
  18. 9 FORMAT FORMAT of the genotype fields (optional; colon-separated)
  19. 10+ SAMPLE SAMPLE genotypes and per-sample information (optional)
  20. .TE
  21. .P
  22. The following table gives the \fBINFO\fP tags used by samtools and bcftools.
  23. .TP
  24. .B AF1
  25. Max-likelihood estimate of the site allele frequency (AF) of the first ALT allele
  26. (double)
  27. .TP
  28. .B DP
  29. Raw read depth (without quality filtering)
  30. (int)
  31. .TP
  32. .B DP4
  33. # high-quality reference forward bases, ref reverse, alternate for and alt rev bases
  34. (int[4])
  35. .TP
  36. .B FQ
  37. Consensus quality. Positive: sample genotypes different; negative: otherwise
  38. (int)
  39. .TP
  40. .B MQ
  41. Root-Mean-Square mapping quality of covering reads
  42. (int)
  43. .TP
  44. .B PC2
  45. Phred probability of AF in group1 samples being larger (,smaller) than in group2
  46. (int[2])
  47. .TP
  48. .B PCHI2
  49. Posterior weighted chi^2 P-value between group1 and group2 samples
  50. (double)
  51. .TP
  52. .B PV4
  53. P-value for strand bias, baseQ bias, mapQ bias and tail distance bias
  54. (double[4])
  55. .TP
  56. .B QCHI2
  57. Phred-scaled PCHI2
  58. (int)
  59. .TP
  60. .B RP
  61. # permutations yielding a smaller PCHI2
  62. (int)
  63. .TP
  64. .B CLR
  65. Phred log ratio of genotype likelihoods with and without the trio/pair constraint
  66. (int)
  67. .TP
  68. .B UGT
  69. Most probable genotype configuration without the trio constraint
  70. (string)
  71. .TP
  72. .B CGT
  73. Most probable configuration with the trio constraint
  74. (string)
  75. .TP
  76. .B VDB
  77. Tests variant positions within reads. Intended for filtering RNA-seq artifacts around splice sites
  78. (float)
  79. .TP
  80. .B RPB
  81. Mann-Whitney rank-sum test for tail distance bias
  82. (float)
  83. .TP
  84. .B HWE
  85. Hardy-Weinberg equilibrium test (Wigginton et al)
  86. (float)
  87. .P
  88. .SH SEE ALSO
  89. .TP
  90. https://github.com/samtools/hts-specs
  91. The full VCF/BCF file format specification
  92. .TP
  93. .I A note on exact tests of Hardy-Weinberg equilibrium
  94. Wigginton JE et al
  95. PMID:15789306
  96. .\" (http://www.ncbi.nlm.nih.gov/pubmed/15789306)