sgkit.read_vcfzarr¶
-
sgkit.read_vcfzarr(path)¶ Read a VCF Zarr file created using scikit-allel.
Loads VCF variant, sample, and genotype data as Dask arrays within a Dataset from a Zarr file created using scikit-allel’s
vcf_to_zarrfunction.Since
vcf_to_zarrdoes not preserve phasing information, there is nosgkit.variables.call_genotype_phased_specvariable in the resulting dataset.- Parameters
- Return type
- Returns
A dataset containing the following variables:
sgkit.variables.variant_id_spec(variants)sgkit.variables.variant_contig_spec(variants)sgkit.variables.variant_position_spec(variants)sgkit.variables.variant_allele_spec(variants)sgkit.variables.sample_id_spec(samples)sgkit.variables.call_genotype_spec(variants, samples, ploidy)sgkit.variables.call_genotype_mask_spec(variants, samples, ploidy)