Skip to content

Segfault loading input SNP file with unexpected format #6

@yancylo

Description

@yancylo

I am trying to run RELI on a list of 3,635 genome-wide significant variants which are aggregated into 78 LD blocks, but after RELI loaded in the snp table, it failed when loading the LD table. The error I got is "Segmentation fault":

|---------------------------------------------------------------|
|                                                               |
|       Regulatory Element Locus Intersection (RELI) Analysis   |
|                       Current version: 0.90                   |
|                                                               |
|---------------------------------------------------------------|
Start Regulatory Element Locus Intersection (RELI) analysis.
Running arguements:
1) phenotype snp file: multiple_sclerosis.snp
2) phenotype LD structure file: multiple_sclerosis.ld
3) SNP matching mode: 0
4) null model file: ../data/Null/CommonSNP_MAFmatch
5) dbSNP table file: ../data/SNPtable/SNPtable
6) target chip-seq label: hg19_0302
7) chip-seq index file: ../data/ChIPseq.index
8) chip-seq data dir: ../data/ChIP-seq/
9) output dir name: Output/
10) genome build file:
11) statistics output file name: Output//hg19_0302.RELI.stats
12) overlapped locus numbers output file name: Output//hg19_0302.RELI.overlaps
13) overlapped snps output file name: Output//hg19_0302.RELI.rsids
14) provided phenotype name: MS
15) provided ancestry name: .
using default hg19 genome build
genome structure loaded.
chip-seq index file loaded.
target ChIP-seq file set.
target ChIP-seq file loaded, sorted, and width calculated.
null model loaded.
reading snp file completed.
snp table loaded.
snp MAF information queried.
phenotype LD file loaded.
Segmentation fault

I followed the format of SLE_EU.ld in the example/ folder but I am not sure if I am missing something. My .ld file contains 78 rows, each row starts with the top hit variant of the LD block, and is followed by ":" and a list of rs IDs in the same LD block with genome-wide significant pvalue. All rsIDs in the .ld file are listed in the .snp file (BED4 format).

I tried to run RELI without the .ld file (only run with .snp). It finished without any errors, but of course the results were not legitimate because LD was ignored - I got a Ratio of 1 (100% intersect) and p-value of 0.

Please advise on how to properly account for LD in the analysis. Thank you very much!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions