Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion R/colocboost.R
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@
#' In brief, this function fits a multiple linear regression model \eqn{Y = XB + E} in matrix form.
#' ColocBoost can be generally used in multi-task variable selection regression problem.
#'
#' @details The function \code{colocboost} implements the proximity smoothed gradient boosting method from Cao et al (2025).
#' @details The function \code{colocboost} implements the proximity smoothed gradient boosting method from Cao etc (2025).
#' There is an additional step to help merge the confidence sets with small \code{between_putiry}
#' (default is 0.8) but within the same locus. This step addresses potential instabilities in linkage disequilibrium (LD) estimation
#' that may arise from small sample sizes or discrepancies in minor allele frequencies (MAF) across different confidence sets.
Expand Down
82 changes: 82 additions & 0 deletions inst/WORDLIST
Original file line number Diff line number Diff line change
@@ -0,0 +1,82 @@
Biobank
Bioinformatics
CFB
COLOC
CoS
Codecov
ColocBoost
Colocalization
Colocalized
Conda
FineBoost
GTEx
GWAS
HyPrColoc
INDELs
Jager
KK
LD
MAF
Mazumder
Micromamba
NPC
Najar
Nealelab
PIPs
PLINK
Pre
Recalibrate
SEL
SuSiE
Sumstat
UKBB
VCP
VPA
Xcorr
YI
bioinformatics
chrom
cis
colocalization
colocalize
colocalized
conda
de
decayrate
eQTL
grey
iteratively
jk
ld
lfsr
lth
maf
medRxiv
modularity
nd
npc
omics
phenotypes
pixi
pos
pre
probabilistically
pvalue
qc
rcond
recalibrate
recalibrated
reconciliate
rss
sQTL
subsampled
sumstat
sumstats
tabix
uCoS
uS
ucos
uncolocalized
vcp
xQTL
xQTLs
2 changes: 1 addition & 1 deletion man/colocboost.Rd

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

16 changes: 8 additions & 8 deletions vignettes/Ambiguous_Colocalization.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@ knitr::opts_chunk$set(

This vignette demonstrates an example of ambiguous colocalization from trait-specific effects using the `colocboost`.
Specifically, we will use the `Ambiguous_Colocalization`, which is output from `colocboost` analyzing GTEx release v8 and UK Biobank summary statistics
(see more details of the original data source in Acknowledgement section).
(see more details of the original data source in Acknowledgment section).

```{r setup}
library(colocboost)
Expand Down Expand Up @@ -160,12 +160,12 @@ names(res$ambiguous_cos[[1]])

**Explanation of results** For each ambiguous colocalization, the following information is provided:

- `ambiguous_cos`: Contains variants indices and names of the original trait-specific uCoS used to construct this ambiguouse colocalization.
- `ambiguous_cos_overlap`: Contains the overlapping variants information across the uCoS used to construct this ambiguouse colocalization.
- `ambiguous_cos_union`: Contains the union of variants information across the uCoS used to construct this ambiguouse colocalization.
- `ambiguous_cos_outcomes`: Contains the outcomes indices and names for uCoS used to construct this ambiguouse colocalization.
- `ambiguous_cos_weight`: Contains the trait-specific weights of the uCoS used to construct this ambiguouse colocalization.
- `ambiguous_cos_puriry`: Contains the purity of across uCoS used to construct this ambiguouse colocalization.
- `ambiguous_cos`: Contains variants indices and names of the original trait-specific uCoS used to construct this ambiguous colocalization.
- `ambiguous_cos_overlap`: Contains the overlapping variants information across the uCoS used to construct this ambiguous colocalization.
- `ambiguous_cos_union`: Contains the union of variants information across the uCoS used to construct this ambiguous colocalization.
- `ambiguous_cos_outcomes`: Contains the outcomes indices and names for uCoS used to construct this ambiguous colocalization.
- `ambiguous_cos_weight`: Contains the trait-specific weights of the uCoS used to construct this ambiguous colocalization.
- `ambiguous_cos_puriry`: Contains the purity of across uCoS used to construct this ambiguous colocalization.
- `recalibrated_cos_vcp`: Contains the recalibrated integrative weight to analogous to variant colocalization probability (VCP) from the ambiguous colocalization results.
- `recalibrated_cos`: Contains the recalibrated 95% colocalization confidence set (CoS) from the ambiguous colocalization results.

Expand Down Expand Up @@ -206,7 +206,7 @@ if researcher decides to investigate these ambiguous colocalization events.
- While we provide recalibrated weights as a suggested approach for interpreting ambiguous results, users can still choose between recalibrated weights and trait-specific weights based on their research context.
- The `colocboost_plot` function will not consider it as colocalized but still showing them as uncolocalized events, with overlapping variants color labeled.

# Acknowledgement
# Acknowledgment

- The eQTL data used for the analyses described in this example results were obtained from GTEx release v8 from [GTEx Portal](https://gtexportal.org/home/downloads/adult-gtex/qtl).
- The GWAS summary statistics used for the analyses described in this example results were obtained from UK Biobank (UKBB)
Expand Down
8 changes: 4 additions & 4 deletions vignettes/Interpret_ColocBoost_Output.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -298,12 +298,12 @@ res$ucos_details$cos_ucos_purity

#### 3.5.4. Other components

- **`ucos_weight`**: Integrative weights for each trait-specific (uncolocalized) trait, used to recalibrate UCoS when traits are filtered out.
- **`ucos_weight`**: Integrative weights for each trait-specific (uncolocalized) trait, used to recalibrate uCoS when traits are filtered out.
- **`ucos_top_variables`**: Indices and names of the top variable for each uCoS, which is the variable with the highest VCP.
- **`ucos_purity`**: Includes three lists, each containing an $uS \times uS$ matrix, where $uS$ is the number of uCoS:
- `min_abs_cor`: Minimum absolute correlation of variables within (diagonal) UCoS or between (off-diagonal) different uCoS.
- `median_abs_cor`: Median absolute correlation of variables within or between UCoS.
- `max_abs_cor`: Maximum absolute correlation of variables within or between UCoS.
- `min_abs_cor`: Minimum absolute correlation of variables within (diagonal) uCoS or between (off-diagonal) different uCoS.
- `median_abs_cor`: Median absolute correlation of variables within or between uCoS.
- `max_abs_cor`: Maximum absolute correlation of variables within or between uCoS.

By analyzing these components, you can gain a deeper understanding of trait-specific (uncolocalized) effects that are not colocalized,
providing additional insights into the data.
Expand Down
4 changes: 2 additions & 2 deletions vignettes/LD_Free_Colocalization.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -117,15 +117,15 @@ While this method is computationally efficient, it has limitations due to the st
Users should interpret the results with caution, especially in regions with complex LD structures or multiple causal variants.


ColocBoost also provides a flexibility to use Hyprcoloc compatible format for summary statistics without LD matrix.
ColocBoost also provides a flexibility to use HyPrColoc compatible format for summary statistics without LD matrix.

```{r hyprcoloc-compatible}
# Loading the Dataset
data(Ind_5traits)
X <- Ind_5traits$X
Y <- Ind_5traits$Y

# Coverting to Hyprcoloc compatible format
# Coverting to HyPrColoc compatible format
effect_est <- effect_se <- effect_n <- c()
for (i in 1:length(X)){
x <- X[[i]]
Expand Down
6 changes: 3 additions & 3 deletions vignettes/Summary_Statistics_Colocalization.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -174,17 +174,17 @@ res$cos_details$cos$cos_index
```


## 3.4. Hyprcoloc compatible format: effect size and standard error matrices
## 3.4. HyPrColoc compatible format: effect size and standard error matrices

ColocBoost also provides a flexibility to use Hyprcoloc compatible format for summary statistics with and without LD matrix.
ColocBoost also provides a flexibility to use HyPrColoc compatible format for summary statistics with and without LD matrix.

```{r hyprcoloc-compatible}
# Loading the Dataset
data(Ind_5traits)
X <- Ind_5traits$X
Y <- Ind_5traits$Y

# Coverting to Hyprcoloc compatible format
# Coverting to HyPrColoc compatible format
effect_est <- effect_se <- effect_n <- c()
for (i in 1:length(X)){
x <- X[[i]]
Expand Down