Add Anderson rubin inference by kv9898 · Pull Request #621 · lrberge/fixest

kv9898 · 2025-11-28T23:04:08Z

This pull request adds support for the Anderson-Rubin (AR) weak-instrument robust test for instrumental variable (IV) models in the fixest package. It introduces two new functions, ar_test() and ar_confint(), along with their print methods, and includes comprehensive tests to ensure correct behavior across a variety of scenarios.

This is requested as part of #442.

New IV Robustness Features:

Added the Anderson-Rubin (AR) test for IV models via the new ar_test() function, which provides weak-instrument robust inference for coefficients of endogenous regressors. The related ar_confint() function computes AR confidence intervals by inverting the test.
Exported the new ar_test and ar_confint functions in the NAMESPACE to make them available for users.
Registered S3 print methods for the new result classes fixest_ar and fixest_ar_confint in the NAMESPACE.

Testing and Validation:

Added extensive tests for the new AR test and confidence interval functions in tests/fixest_tests.R, covering correct output structure, distribution selection, error handling for invalid input, and integration with model features like clustering, weights, and multiple instruments.

Co-authored-by: kv9898 <105025148+kv9898@users.noreply.github.com>

…an interval logic Co-authored-by: kv9898 <105025148+kv9898@users.noreply.github.com>

Co-authored-by: kv9898 <105025148+kv9898@users.noreply.github.com>

…/numeric CI, unified API Co-authored-by: kv9898 <105025148+kv9898@users.noreply.github.com>

Co-authored-by: kv9898 <105025148+kv9898@users.noreply.github.com>

* pass correct argument to calculate test * improve argument passing and modify the ci argument to take "numeric" * set ci to false when there is more than one endo var Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Refine doc Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * remove compute_ci * use vcov_arg_to_pass Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * set ci to false when conditions are not met Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* Initial plan * Add weights support to exact AR CI calculation Co-authored-by: kv9898 <105025148+kv9898@users.noreply.github.com> * Add tests for weighted AR CI calculation Co-authored-by: kv9898 <105025148+kv9898@users.noreply.github.com> * Address code review feedback: fix comment and use object$weights Co-authored-by: kv9898 <105025148+kv9898@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: kv9898 <105025148+kv9898@users.noreply.github.com>

* Initial plan * Update ar_test tests to use realistic DGP validated against ivmodel package Co-authored-by: kv9898 <105025148+kv9898@users.noreply.github.com> * Fix test 2 * Fix up to tests 6 and 7

kylebutts · 2025-11-29T13:50:30Z

Happy to spend a few hours reviewing this code, but could you first create a small working example showing this matches the results from ivDiag? https://yiqingxu.org/packages/ivDiag/articles/iv_tutorial.html

Also related to #593

kv9898 · 2025-11-29T13:57:56Z

Happy to spend a few hours reviewing this code, but could you first create a small working example showing this matches the results from ivDiag? https://yiqingxu.org/packages/ivDiag/articles/iv_tutorial.html

Also related to #593

Hi @kylebutts . During my internal testing, I benchmarked it against both ivmodel and ivDiag. However, I believe that ivDiag lacks precision: for when a closed form solution is possible (iid se with no fixed effects), ivmodel (which has an exact solution algorithm) and my PR's exact and numeric solution yield same results, but ivDiag yields a different one. I will present a reproducible example shortly

My PR is superior to

ivDiag, for mine is more precise (to the 8 decimal places by default), and allows for closed-form solutions when possible. Mine is also not conceivably slower than ivDiag (I don't feel the difference)
ivmodel, as it allows for different vcov choices, fixed effects. It is also more user-friendly by also taking an existing feols object.

kv9898 · 2025-11-29T14:27:46Z

Here's a simple repex:

library(fixest)
library(ivmodel)
library(ivDiag)

# Data generation ----
set.seed(123)

gamma <- 0.5
beta <- 2
obs <- 100

e_1 <- rnorm(obs, mean = 0, sd = 1)
z <- rnorm(obs, mean = 0, sd = 1)
x <- gamma * z + e_1
w <- rnorm(obs, mean = 0, sd = 1)
e_2 <- rnorm(obs, mean = 0, sd = 1) + 0.8 * e_1
y <- beta * x + w + e_2

df <- data.frame(y, x, z, w)

# ivmodel (B0=0) ----
ivm <- ivmodel(y, x, z, w) # This uses an exact closed-form solution
summary(ivm) # AR: F=10.79273, df1=1, df2=97, p=0.0014187
# CI: [1.2562, 2.5130]

ivdiag <- AR_test(df, "y", "x", "z", "w")
summary(ivdiag) # AR: F=16.2581, CI: [1.4889, 2.4556] NOT PRECISE!

reg_iv <- feols(y ~ w | x ~ z, data = df) # our PR
(test0 <- ar_test(reg_iv)) # AR: F=10.79, df1=1, df2=97, p=0.001419
# CI: [1.256, 2.513]
(test0n <- ar_test(reg_iv, ci = "numeric"))  # try our grid search algorithm
# CI: [1.256, 2.513], the numeric results are designed to be accurate to the 8th decimal place

# ivmodel (B0=1) ----
ivm_1 <- ivmodel(y, x, z, w, beta0 = 1)
summary(ivm_1) # AR: F=5.647947, df1=1, df2=97, p=0.019439
# CI: [1.2562, 2.5130]

# fixest (IV) ----
(test1 <- ar_test(reg_iv, beta0 = 1)) # must use my custom function
(test2 <- ar_test(reg_iv, beta0 = 1, ci = "numeric"))
# AR: F=5.647947, df1=1, df2=97, p=0.019439
# CI: [1.2562, 2.5130], the numeric results are designed to be accurate to the 8th decimal place

Fix obs_selection handling in AR test functions * Initial plan * Apply obs_selection filtering in .ar_test_core and .ar_ci_exact_iid Co-authored-by: kv9898 <105025148+kv9898@users.noreply.github.com> * pass weights to feols

kv9898 · 2025-11-30T10:22:55Z

@kylebutts Just realized I have probably seen you in Positron discussions? It's nice to see you in another repo again. I'm also interested in adding valid t-ratio inference from by Lee et al. (2022) and Montiel Olea &
Pueger's (2013) effective first stage F-test. But they will be separate PRs and I want to get this done first.

lrberge · 2025-12-16T16:09:04Z

Hi @kv9898 and thanks for this PR!
I'm not sure creating new functions is that necessary (although I understand the appeal!).
I'd like also to think to the integration with, eg, etable. I'll have a look in January.
Thanks again and in the meantime merry Christmas!

kylebutts · 2025-12-16T20:42:52Z

I'm working through the code and have some changes I'll commit to this branch when I finish.

I agree with @lrberge that we should think more about how users would use this. On the one-hand, we form a p-value or a confidence interval for the endogenous variable. We could imagine plugging in AR-based p-value or CI in that row of the regression table. Would it make sense to have a different kind of CI for the endogenous variable only?

Alternatively, it could be treated similar to the other fitstats and go below the coefficients

What do we think?

grantmcdermott · 2025-12-17T16:25:48Z

Alternatively, it could be treated similar to the other fitstats and go below the coefficients

Without having looked at the code, this would have been my first inclination.

lrberge · 2025-12-19T16:38:00Z

Without having looked at the code, this would have been my first inclination.

+1!

Full integration with streamlined user experience needs to be thought carefully.
This would change the coefficient table, it can be a chance to make a template for future similar additions.

kv9898 · 2026-01-14T10:09:22Z

@kylebutts Happy New Year! Any updates?

lrberge · 2026-01-14T10:27:44Z

Hi @kv9898, to do it correctly it actually implies big and well thought internal changes. I'm sorry but this won't be direct. I beg for your patience.

kv9898 · 2026-01-14T10:39:34Z

Hi @kv9898, to do it correctly it actually implies big and well thought internal changes. I'm sorry but this won't be direct. I beg for your patience.

Ohh I thought we were just planning for some nice wrappers around the function. No rush, take your time! Thanks for the quick reply, though.

Copilot AI and others added 17 commits November 26, 2025 22:46

Initial plan

32c3ab3

Implement ar_test and ar_confint functions for Anderson-Rubin inference

092623b

Co-authored-by: kv9898 <105025148+kv9898@users.noreply.github.com>

Fix code review comments: simplify validation, robust temp names, cle…

cc7c1ac

…an interval logic Co-authored-by: kv9898 <105025148+kv9898@users.noreply.github.com>

Improve robustness: limit temp name attempts, escape regex special chars

9a34798

Co-authored-by: kv9898 <105025148+kv9898@users.noreply.github.com>

Change ar_test header

3621eb4

pass correct vcov flag

c0f5296

Refactor AR test: add internal core, iid/non-iid distributions, exact…

1235f7e

…/numeric CI, unified API Co-authored-by: kv9898 <105025148+kv9898@users.noreply.github.com>

Fix floating-point comparison and remove leftover code in numeric CI

38fdad2

Co-authored-by: kv9898 <105025148+kv9898@users.noreply.github.com>

use correct degrees of freedom + formatting

cb17c64

Update AR test to use realistic DGP validated against ivmodel (#4)

c5e0c78

* Initial plan * Update ar_test tests to use realistic DGP validated against ivmodel package Co-authored-by: kv9898 <105025148+kv9898@users.noreply.github.com> * Fix test 2 * Fix up to tests 6 and 7

Fix tests 12-19

abb04d1

fix cluster

a053e04

Simplify vcov passing

3e11071

no exact CI calculated with fixed effects

d8f2dcf

update docs

eb945e5

kv9898 force-pushed the anderson-rubin-inference branch from 36a51be to eb945e5 Compare November 29, 2025 09:43

kv9898 added 2 commits November 29, 2025 09:50

Further refine doc

fc1b63f

Update markdowns and description

4ee3a95

Fix zero weight problem (#5)

8b489af

Fix obs_selection handling in AR test functions * Initial plan * Apply obs_selection filtering in .ar_test_core and .ar_ci_exact_iid Co-authored-by: kv9898 <105025148+kv9898@users.noreply.github.com> * pass weights to feols

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Add Anderson rubin inference#621

Add Anderson rubin inference#621
kv9898 wants to merge 20 commits intolrberge:masterfrom
kv9898:anderson-rubin-inference

kv9898 commented Nov 28, 2025 •

edited

Loading

Uh oh!

kylebutts commented Nov 29, 2025

Uh oh!

kv9898 commented Nov 29, 2025 •

edited

Loading

Uh oh!

kv9898 commented Nov 29, 2025 •

edited

Loading

Uh oh!

kv9898 commented Nov 30, 2025

Uh oh!

lrberge commented Dec 16, 2025

Uh oh!

kylebutts commented Dec 16, 2025

Uh oh!

grantmcdermott commented Dec 17, 2025

Uh oh!

lrberge commented Dec 19, 2025

Uh oh!

kv9898 commented Jan 14, 2026

Uh oh!

lrberge commented Jan 14, 2026

Uh oh!

kv9898 commented Jan 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Comments

Conversation

kv9898 commented Nov 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kylebutts commented Nov 29, 2025

Uh oh!

kv9898 commented Nov 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kv9898 commented Nov 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kv9898 commented Nov 30, 2025

Uh oh!

lrberge commented Dec 16, 2025

Uh oh!

kylebutts commented Dec 16, 2025

Uh oh!

grantmcdermott commented Dec 17, 2025

Uh oh!

lrberge commented Dec 19, 2025

Uh oh!

kv9898 commented Jan 14, 2026

Uh oh!

lrberge commented Jan 14, 2026

Uh oh!

kv9898 commented Jan 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

kv9898 commented Nov 28, 2025 •

edited

Loading

kv9898 commented Nov 29, 2025 •

edited

Loading

kv9898 commented Nov 29, 2025 •

edited

Loading