Skip to content
View gmcmacran's full-sized avatar

Block or report gmcmacran

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
gmcmacran/README.md

My repos focus on machine learning, statistics, functional programming, puzzle solving, and notes to myself. I program in python and R. Some repos are full-fledged data science tools and are published outside of github. Others are personal projects.

Open Source

  • python: statsmodels provides classes and functions for the estimation of many different statistical models.
  • python: gpu_glm regularized GLM models running on a GPU.
  • R: dann is an implementation of Hastie and Tibshirani’s Discriminant Adaptive Nearest Neighbor Classification.
  • R: tidydann adds the 'dann' model and the 'sub_dann' model to the Tidymodels ecosystem.
  • R: extendedFamily adds new links to R’s generalized linear models.
  • R: LRTesteR is a collection of hypothesis tests and confidence intervals based on the likelihood ratio.
  • R: GlmSimulatoR allows the user to easily and quickly create data for the generalized linear model.

Example Work

  • python: microsoftLTR trains a M.L. model that directly optimizes gain.
  • R: survivoR builds time to event models.
  • python: time_series_M4 compares multiple models using the M4 dataset.
  • python: translator translates English to Spanish with tensorflow.
  • python: semi_supervised_two explores the usefulness of semi-supervised machine learning.
  • python: anomaly_detection trains multiple anomaly detection models on a simulated dataset.

Implementing ML Models From Scratch

  • python: glm_irls is an implementation of generalized linear models from the ground up using numpy.
  • python: coord-descent-glm is an implementation of generalized linear models using coordinate descent and functional programming.

Hypothesis Testing

  • R: LRTesteR is a collection of hypothesis tests and confidence intervals based on the likelihood ratio.
  • R: TypeOneTypeTwoSim is a simulation of asymptotic type I error rates, type II error rates, and coverage rates of functions in LRTesteR.
  • R: calibration studys calibration of p values from likelihood ratio tests when sample size is small.
  • R: geometric_likelihood_ratio explores a distribution where asymptotic theory does not apply.

Simulation Studies

  • R: normalTestsCompare compares power of Gaussian goodness of fit tests.
  • R: medianTestsCompare compares the sign test to the empirical likelihood quantile test.
  • R: muTestsCompare compares the wilcoxon rank sum to the empirical likelihood test for the mean.
  • R: bayesian_p_values studies how changing the prior distribution's parameters affects p value calculations.

Functional Programming

Puzzle Solving

  • python: backtracking solving puzzles using backtracking algorithms.
    • Sudoku puzzles
    • Knights tour problem
    • N queens problem
    • Pizza Hut's pi day challenge.

Notes

  • R: glm_notes is a collection of notes about generalized linear models.
  • python: pip_package is a simple pip package with software development best practices.
  • python: anaconda_package is a simple conda package with software development best practices.
  • pencil: proofs is a collection of math proofs.
  • Steam Deck: dg2_steam_deck contains scrips for getting community maps for defense grid two maps working on steam deck.

Pinned Loading

  1. statsmodels/statsmodels statsmodels/statsmodels Public

    Statsmodels: statistical modeling and econometrics in Python

    Python 11.3k 3.3k

  2. LRTesteR LRTesteR Public

    A collection of hypothesis tests and confidence intervals based on the likelihood ratio.

    R

  3. tidydann tidydann Public

    adds the 'dann' model and the 'sub_dann' model to the Tidymodels ecosystem.

    R 1

  4. glm_notes glm_notes Public

    A collection of notes about generalized linear models.

    R

  5. proofs proofs Public

    A collection of math proofs.

  6. TypeOneTypeTwoSim TypeOneTypeTwoSim Public

    A simulation of type I error rates, type II error rates, and coverage rates of functions in LRTesteR.

    R