feat: implement match_single for participant solution #4

ali-sefidmouy · 2025-12-18T11:50:08Z

No description provided.

tavallaie · 2025-12-18T18:03:10Z

Incorrect parent-child matching logic
Using shared allele overlap (q_set & d_set) instead of Mendelian subset rule. This causes massive false positives — unrelated profiles with common alleles rank highly.
No proper Likelihood Ratio (CLR) calculation
CLR set to number of "consistent" loci. Real forensic evaluation requires product of per-locus LRs using allele frequencies. Current ranking does not reflect true relationship strength.
No mutation handling
True parent-child pairs with even one ±1 mutation are incorrectly penalized or excluded.
No candidate pre-filtering / indexing
Full scan of ~500k profiles per query → extremely slow (likely times out in evaluation). Challenge requires efficient filtering for scalability.
Bidirectional logic not properly implemented
Subset check must work both ways (query as child or query as parent), but current overlap test fails this.
Mutated_loci always 0
Required output field not populated.
Inconclusive vs mismatch confusion
True mismatches counted as "inconclusive" instead of exclusions.

Score impact: True parent rarely appears in top 10 due to false positives and lack of statistical weighting.

feat: implement match_single for participant solution

9063772

tavallaie merged commit 6a3d1b3 into pyday-iran:main Dec 28, 2025
1 check failed

Provide feedback