Publicly available statistical genetics data and analysis. 1000G, HGDP, SGDP, and GIAB. Reproducible global ADMIXTURE with variable K.
-
Updated
Apr 28, 2026 - Python
Publicly available statistical genetics data and analysis. 1000G, HGDP, SGDP, and GIAB. Reproducible global ADMIXTURE with variable K.
Replicating and extending Novembre et al. (2008) — end-to-end Python pipeline for human population structure analysis using 1000 Genomes Phase 1 chr22 data, with a within-Africa extension, Fst-based ancestry-informative marker identification, and an interactive Streamlit explorer.
Benchmarking forensic ancestry inference using AIM SNPs, machine learning and 1000 Genomes validation.
Personal genomics analysis toolkit: ingest consumer DNA raw data, impute against 1000 Genomes, and produce an evidence-graded ledger of pharmacogenomic, carrier-screening, trait, polygenic-score, and haplogroup findings — all locally.
Structural variant analysis of common disease-associated genes in 1000 Genomes long-read sequencing data
A screening pipeline that uses existing Oxford Nanopore long-read sequencing data from the 1000 Genomes Project to identify samples carrying specific copies of three genes of interest as cloning candidates.
Add a description, image, and links to the 1000-genomes topic page so that developers can more easily learn about it.
To associate your repository with the 1000-genomes topic, visit your repo's landing page and select "manage topics."