admix.Dataset#
- class admix.Dataset(geno: Array | None = None, lanc: Array | None = None, snp: DataFrame | None = None, indiv: DataFrame | None = None, n_anc: int | None = None, dset_ref=None, snp_idx: slice | int | ndarray | None = None, indiv_idx: slice | int | ndarray | None = None, enforce_order: bool = True)[source]#
Data structure to contain genotype and local ancestry.
- __init__(geno: Array | None = None, lanc: Array | None = None, snp: DataFrame | None = None, indiv: DataFrame | None = None, n_anc: int | None = None, dset_ref=None, snp_idx: slice | int | ndarray | None = None, indiv_idx: slice | int | ndarray | None = None, enforce_order: bool = True)[source]#
Methods
__init__
([geno, lanc, snp, indiv, n_anc, ...])af_per_anc
([force])Return the allele frequency per ancestry (n_snp, n_anc)
allele_per_anc
()Return the allele-per-ancestry raw count matrix
append_indiv_info
(df_info[, force_update])append indiv info to the dataset, individual is matched using the self.indiv.index and df_info.index.
append_snp_info
(df_info)append snp info to the dataset, snp is matched using the self.snp.index and df_info.index.
nhaplo_per_anc
([force])Return the number of haplotype per ancestry (n_snp, n_anc)
persist
()persist the lazy data to memory
Attributes
data
Number of individuals.
geno
Genotype matrix
indiv
One-dimensional annotation of observations (pd.DataFrame).
lanc
Local ancestry matrix
n_anc
Number of ancestries.
n_indiv
Number of individuals.
n_snp
Number of SNPs.
snp
One-dimensional annotation of observations (pd.DataFrame).
uns
Unstructured annotation (ordered dictionary).
xr
Return the xr.Dataset used internally