POLYMER GENOMICS
Data sources·50 layers · versions · provenance
§ DATA SOURCES · 46 layers · versions · licenses · evidence classes

Data sources

Every layer the Polymer API serves, with its origin, version, license, and evidence class. Each response from the API carries a content hash and the same evidence-class tags — so downstream pipelines can audit, cache, and cite.

Evidence classesMMeasuredKCuratedDDerivedSStatisticalHHypothetical
Filter
LayerDescriptionVersionSourceLicenseCountBuildEV
GENCODE v44 genesgencode_v44 · Reference
Comprehensive gene annotation — transcripts, exons, CDS, UTRs.v44 · 2023-10GENCODECC BY 4.063,086 transcriptshg38+hg37K
RefSeq v224refseq_v224 · Reference
NCBI reference annotations · clinical-grade.v224 · 2024-08NCBIPublic domain54,212 transcriptshg38+hg37K
GTEx v8 expressiongtex_v8 · Reference
Cross-tissue gene expression for 54 tissues.v8 · 2019GTExdbGaP DUC54 tissueshg38M
gnomAD v4 constraintgnomad_v4 · Reference
Loss-of-function intolerance metrics — pLI, LOEUF, oe-syn.v4 · 2024gnomADCC BY 4.0730K exomeshg38D
Stacking ΔG₃₇stacking_dg37 · Biophysics
Nearest-neighbor stacking free energy at 37 °C, base-pair resolution.v3.0 · 2026-02SantaLucia 2004 + curatedCC BY 4.03.20 Gb @ 1 bphg38+hg37D
Melting temperaturemelting_tm · Biophysics
Predicted Tm at 50 mM Na⁺, 1 kb sliding window.v3.0 · 2026-02SantaLuciaCC BY 4.03.20 Gb @ 1 kbhg38+hg37D
Intrinsic curvaturecurvature · Biophysics
Calladine-style curvature predictions, 30 bp window.v2.4 · 2026-01Polymer GenomicsPolymer EULA3.20 Gb @ 1 bphg38D
Major/minor groovegroove_width · Biophysics
Major and minor groove width + depth from DNA shape models.v2.4 · 2026-01Polymer GenomicsPolymer EULA3.20 Gb @ 1 bphg38D
A-form propensitya_form_prop · Biophysics
Sequence-encoded propensity to adopt A-form duplex.v2.4 · 2026-01Polymer GenomicsPolymer EULA3.20 Gb @ 1 bphg38D
Z-form propensityz_form_prop · Biophysics
Z-DNA propensity tied to alternating purine-pyrimidine.v2.4 · 2026-01Polymer GenomicsPolymer EULA3.20 Gb @ 1 bphg38D
Molar extinction (E₂₆₀)extinction_e260 · Biophysics
Per-base ε₂₆₀ for quantification calculations.v1.0 · 2026-02Cantor & WarshawCC BY 4.03.20 Gb @ 1 bphg38+hg37D
Biosynthetic costgene_costs_v1 · Biophysics
Per-amino-acid ATP-equivalent biosynthetic cost rolled up per gene.v1.0 · 2025-11Polymer + AkashiCC BY 4.019,427 geneshg38D
CpG sitescpg_sites · CpG / methylation
All CpG dinucleotide positions genome-wide.2024-08GRCh38 / GRCh37Public domain29,439,827hg38+hg37M
CpG islandscpg_islands · CpG / methylation
Gardiner-Garden + Frommer CpG island calls.2024-08UCSCPublic domain235,847hg38+hg37K
EPIC v2 probesprobe_epic_v2 · CpG / methylation
Illumina EPIC v2 array manifest with probe context.v1.0 · 2023IlluminaIllumina937,690hg38M
EPIC v1 probesprobe_epic_v1 · CpG / methylation
Illumina EPIC v1 array manifest.v1.0 · 2016IlluminaIllumina865,918hg38+hg37M
450K probesprobe_450k · CpG / methylation
Illumina HumanMethylation450 BeadChip.v1.0 · 2011IlluminaIllumina485,577hg38+hg37M
Methylation atlasmethylation_atlas · CpG / methylation
Reference WGBS methylation across 12 tissues.v2.1 · 2024NIH RoadmapCC BY 4.012 tissueshg38M
Epigenetic clocksclocks_v1 · CpG / methylation
6 clocks: Horvath, Hannum, PhenoAge, GrimAge, Retro-Age, DunedinPACE.v1.2 · 2026-01Curated · PolymerCC BY-SA 4.06 clocks · 2,488 probeshg38D
Isochoresisochores · Structural
L1/L2/H1/H2/H3 isochore class per 100 kb bin.v1.4 · 2025-09Polymer GenomicsCC BY-SA 4.034,000 binshg38D
TAD domainstads_encode · Structural
TAD calls across 108 ENCODE cell types.2024ENCODECC BY 4.0347,914 TADshg38M
H3K4me3histone_h3k4me3 · Structural
Active promoter mark, ENCODE consolidated.v3 · 2024ENCODECC BY 4.078 cell typeshg38M
H3K27achistone_h3k27ac · Structural
Active enhancer mark.v3 · 2024ENCODECC BY 4.092 cell typeshg38M
H3K27me3histone_h3k27me3 · Structural
Polycomb repressive mark.v3 · 2024ENCODECC BY 4.064 cell typeshg38M
G-quadruplex predictionsg4_predicted · Structural
pqsfinder G4 motif predictions, per-strand.v1.0 · 2024pqsfinderCC BY 4.0739,201 G4shg38D
R-loop forming sequencesrloop_predicted · Structural
QmRLFS R-loop prediction.v1.0 · 2024QmRLFSCC BY 4.0212,488 RLFShg38D
RepeatMasker v4.1.9repeatmasker_v419 · Repeats
All annotated repeats (SINE/LINE/LTR/DNA/Satellite/Simple).v4.1.9 · 2024Smit, Hubley & GreenOpen5,632,191hg38+hg37K
SINE / Alusine_alu · Repeats
Alu family subset with subfamily call (AluY/AluS/AluJ).2024RepeatMaskerOpen1,094,000 Alushg38K
LINE-1line_l1 · Repeats
L1 family with subfamily age (L1HS, L1PA2-7, L2).2024RepeatMaskerOpen569,000 L1shg38K
LTR / ERVltr_erv · Repeats
Endogenous retroviruses with subfamily annotation.2024RepeatMasker + curatedOpen567,000 ERVshg38K
HERV subfamily curationherv_subfamilies · Repeats
Manual subfamily refinement (HERV-K/H/W/E/L).v1.2 · 2025-08Polymer GenomicsPolymer EULA6,464 ERVshg38K
TE age (MYA)te_age · Repeats
Per-element evolutionary age from divergence.v1.0 · 2025-10Polymer GenomicsPolymer EULA5.6M elementshg38D
TE awakening scorete_awakening · Repeats
Reactivation propensity score per TE family.v1.0 · 2026-02Polymer GenomicsPolymer EULA17 familieshg38S
ClinVarclinvar · Variants
Clinically reported variants with assertions.2024-09NCBIPublic domain2,847,212hg38+hg37K
GWAS Cataloggwas_catalog · Variants
EBI GWAS Catalog SNP-trait associations.2024-08EBICC0658,124hg38M
dbSNP build 156dbsnp_b156 · Variants
Reference SNPs for variant annotation.b156 · 2024NCBIPublic domain~1 Bhg38+hg37K
IMGT/HLAimgt_hla · Clinical
Curated HLA allele sequences (32,867 alleles across 6 loci).v3.55 · 2024IMGT/HLACC BY-ND 3.032,867 alleleshg38K
HLA non-coding divergencehla_ncds · Clinical
Per-allele NCDS score derived from non-coding biophysics.v1.0 · 2026-03Polymer GenomicsPolymer EULA32,867 alleleshg38D
TCGAtcga · Clinical
Cancer methylation + RNA-seq, planned ingestion.PendingTCGAdbGaP DUC11,000 sampleshg38M
Contextual biophysicsbiophysics_compute · Compute
8 property groups computable on-demand for any region ≤1 Mb.v3.0 · 2026-02Polymer GenomicsPolymer EULA8 property groupshg38+hg37D
Sequence evaluatorevaluate_compute · Compute
Physics linter with 13 flag codes for ≤100 kb sequences.v2.4 · 2026-01Polymer GenomicsPolymer EULA13 flag codesD
Aggregation serviceaggregation · Compute
Pre-binned layer counts, 1 kb–10 Mb resolution.v1.0 · 2026-01Polymer GenomicsPolymer EULAall layershg38+hg37D
Clock predictionclocks_predict · Compute
Apply DNAm clock coefficients to beta matrices.v1.0 · 2026-02Polymer GenomicsPolymer EULA6 clockshg38D
Exp 17 · recombinationexp17_recomb · Compute
Per-region crossover-hotspot score (AUROC 0.827).v1.0 · 2026-04Polymer GenomicsPolymer EULA1 score per regionhg38S
Exp 03 · TE silencingexp03_te · Compute
Per-TE silencing score with biophysical feature attribution.v1.0 · 2026-02Polymer GenomicsPolymer EULA5.6M scoreshg38S
Exp 22 · CpG erosionexp22_age · Compute
TE-age estimate from CpG erosion chronometer.v1.0 · 2026-05Polymer GenomicsPolymer EULA17 TE familieshg38S
Showing 46 of 46 layers · every layer is queryable through /v1/layers