feat: scaffold for all annotations with reasonable structure by shloknatarajan · Pull Request #13 · DaneshjouLab/AutoGKB

shloknatarajan · 2025-07-01T06:02:07Z

Annotation pipeline works and saves outputs

…ations - Implement module-level caching for get_true_variants() to avoid repeated JSON file loading - Fix type annotations across multiple files to use Optional[T] instead of T = None - Add comprehensive efficiency analysis report documenting all identified issues - Add test script to verify caching functionality works correctly This addresses the critical efficiency issue where JSON files were loaded on every function call, causing unnecessary disk I/O operations. The caching implementation uses lazy loading with proper error handling for missing files. Co-Authored-By: Shlok Natarajan <shlok.natarajan@gmail.com>

- Add DrugAnnotation and DrugAnnotationList models to src/variants.py - Create new drug_annotation_extraction.py component with detailed field extraction - Integrate drug annotation extraction into variant association pipeline - Add comprehensive test script for verification - Follow existing LLM infrastructure patterns (Generator/Parser) - Extract detailed pharmacogenomic fields matching provided schema Co-Authored-By: Shlok Natarajan <shlok.natarajan@gmail.com>

- Modified extract_drug_annotations to loop through variants one at a time - Each variant now gets individual LLM processing for better control - Added SingleDrugAnnotation model for individual variant processing - Updated logging to show individual variant processing progress - Maintains same output quality while providing cleaner extraction per variant - Updated test script to reflect individual processing approach Co-Authored-By: Shlok Natarajan <shlok.natarajan@gmail.com>

…ation-extraction Add drug annotation extraction component for variants with drug associations

…-improvements Efficiency improvements: Cache JSON loading and fix type annotations

…ents - Add PhenotypeAnnotation and FunctionalAnnotation data models to variants.py - Create phenotype_annotation_extraction.py with detailed extraction logic - Create functional_annotation_extraction.py with mechanistic annotation logic - Update variant_association_pipeline.py to integrate new extraction components - Follow existing drug annotation extraction patterns - Use detailed prompt templates from annotation_prompts.md - Process variants individually for better control and cleaner extraction - Include proper error handling and logging throughout Co-Authored-By: Shlok Natarajan <shlok.natarajan@gmail.com>

- Move test_imports.py and test_new_annotations.py to tests/ directory - Clean up temporary converted notebook file - Follow proper repository structure conventions Co-Authored-By: Shlok Natarajan <shlok.natarajan@gmail.com>

- Fix Python path resolution for tests running from tests/ subdirectory - Ensure tests can properly import src modules from new location - Verify all tests pass after organizational changes Co-Authored-By: Shlok Natarajan <shlok.natarajan@gmail.com>

…functional-extraction feat: implement phenotype and functional annotation extraction components

devin-ai-integration bot and others added 30 commits June 28, 2025 17:05

Add linux-64 platform support for Devin setup

0c9b458

Merge pull request #2 from shloknatarajan/devin/1751131840-drug-annot…

4630fc4

…ation-extraction Add drug annotation extraction component for variants with drug associations

Merge branch 'main' into devin/1751130154-efficiency-improvements

5d3106b

Merge pull request #1 from shloknatarajan/devin/1751130154-efficiency…

883c836

…-improvements Efficiency improvements: Cache JSON loading and fix type annotations

feat: gdown data downloading

296b6f6

feat: gdown pixi command

2d8a132

fix: updated command

cae9aaf

feat: envrc gitignore

6cd116c

fix: remove zip after unzipping

07507a3

chore: black formatting

ed1ca8d

Merge pull request #3 from shloknatarajan/devin/1751134713-phenotype-…

f1f0021

…functional-extraction feat: implement phenotype and functional annotation extraction components

feat: moves tests into folder

2deec50

feat: basic fuser, all_associations (both untested)

fdd46ec

chore: comment

49893f6

feat: all associations prompt updates

dbe40f3

feat: drug annotation prompt updates

b5c91ef

chore: moved old components to deprecated folder

6b9a673

fix: deprecated imports

3cf9c5b

feat: file movements and started phenotype annotation

f55ffe9

feat: phenotype annotation prompt

941f1cf

feat: FA and removed old main code

505bce6

fix: removed unused tests

cf14e9d

feat: study parameters prompt

afce8b2

fix: removed old testing function

fea928e

shloknatarajan added 7 commits June 30, 2025 23:16

fix: updated inference types

c4baaee

fix: updated all inference types

05121c0

checkpoint: debugging all associations run

43b342e

feat: working get all associations

f985500

checkpoint: json output of all associations + generator

72c32b9

chore: types and gitignore

fb76426

checkpoint: almost working drug annotation

04b02c6

shloknatarajan self-assigned this Jul 1, 2025

shloknatarajan added the WIP Work In Progress label Jul 1, 2025

shloknatarajan added 8 commits July 1, 2025 02:11

feat: working drug annotation extraction

0f3fdf4

feat: untested functional annotation

be32867

chore: black formatting

91ea3d7

feat: untested study parameters

62f051f

feat: (untested) complete annotation generation pipeline

7922fa0

feat: final annotation saving (untested)

8a9f821

chore: black

dfae818

Merge branch 'main' of https://github.com/daneshjoulab/autogkb

5f86d89

shloknatarajan removed the WIP Work In Progress label Jul 1, 2025

shloknatarajan added 2 commits July 1, 2025 17:43

feat: full working pipeline run

8bdf492

chore: black formatting

a6d05de

shloknatarajan merged commit 627619f into DaneshjouLab:main Jul 1, 2025
3 of 7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

feat: scaffold for all annotations with reasonable structure#13

feat: scaffold for all annotations with reasonable structure#13
shloknatarajan merged 47 commits intoDaneshjouLab:mainfrom
shloknatarajan:main

shloknatarajan commented Jul 1, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

shloknatarajan commented Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

shloknatarajan commented Jul 1, 2025 •

edited

Loading