Recent Blog Posts

More Posts

Introduction Problem Set-up Data Hierarchical Model Prior Predictive Simulation Posterior Distribution Shrinkage Conclusion Introduction The original motivation for this post was to recreate an analysis in BDA (chapter 8), but as I started working through it, I realized it is an opportunity to write about several interesting concepts within the same framework. The overarching theme of this post covers 1. How to set up a multivariate logistic regression (not a multinomial or multivariable)

CONTINUE READING

Motivation Background Bayesian Inference and ANOVA Simulation Set up Frequentist pairwise comparisons Naive Tukey adjusted Multilevel Model Conclusion Motivation They say the best way to learn something, is to teach it! And that’s exactly what I intend to do. Inspired by Solomon Kurz’s blog posts on power calcualtions in bayesian inference, and Dr. Gelman’s blogs, here’s an attempt to open the guts of a problem I’ve struggled with.

CONTINUE READING

Introduction Results Code for simulation Conclusion Hello 2020! I’m going to start off this new year with a post that I’ve been meaning to put up for a while. It has nothing to do with statistics, but very much to do with my other love, the language R and it’s data manipulation libraries tidyr and data.table. Introduction For most of my work, the tidyverse set of packages get me where I want to be with the added ease of syntax, readibility and all the good stuff.

CONTINUE READING

Introduction Simulate data Model Estimating trends Estimating Derivatives Conclusion - Introduction As the title suggests, this post explores the powerful generalized additive models to model time-series data. More often than not, we associate modeling time-series data with forecasting i.e, understanding the underlying data generating process to then forecast the future. However, we could also be interested in understanding the nature of the current time-series, and also, comparing different time series.

CONTINUE READING

I think everyone at somepoint, regardless of what industry they’re in, have folders which have files saved as _v2,_v3… v_final… That can only mean one thing down the line - chaos. I always though version control when it came to code was for a group of people working on a project, and for a lone statistician in the group as myself, I wouldn’t need one. But, Jenny bryan’s Excuse me, do you have a moment to talk about version control changed my mind.

CONTINUE READING

Selected Publications

Guida F, Sun N, Bantis LE, Muller DC, Li P, Taguchi A, Dhillon D, Kundnani DL, Patel NJ, Yan Q, Byrnes G, Moons KGM, Tjønneland A, Panico S, Agnoli C, Vineis P, Palli D, Bueno-de-Mesquita B, Peeters PH, Agudo A, Huerta JM, Dorronsoro M, Barranco MR, Ardanaz E, Travis RC, Byrne KS, Boeing H, Steffen A, Kaaks R, Hüsing A, Trichopoulou A, Lagiou P, La Vecchia C, Severi G, Boutron-Ruault MC, Sandanger TM, Weiderpass E, Nøst TH, Tsilidis K, Riboli E, Grankvist K, Johansson M, Goodman GE, Feng Z, Brennan P, Johansson M, Hanash SM Assessment of Lung Cancer Risk on the Basis of a Biomarker Panel of Circulating Proteins (JAMA Oncology 2018)

Shiels MS, Kirk GD, Drummond MB, Dhillon D, Hanash SM, Taguchi A, Engels EA HIV Infection and Circulating Levels of Prosurfactant Protein B and Surfactant Protein D (Journal of Infectious Diseases 2018)

Çeliktas M, Tanaka I, Tripathi SC, Fahrmann JF, Aguilar-Bonavides C, Villalobos P, Delgado O, Dhillon D, Dennison JB, Ostrin EJ, Wang H, Behrens C, Do KA, Gazdar AF, Hanash SM, Taguchi A Role of CPS1 in Cell Growth, Metabolism and Prognosis in LKB1-Inactivated Lung Adenocarcinoma (JNCI 2017)

Capello M, Bantis LE, Scelo G, Zhao Y, Li P, Dhillon D, Patel NJ, Kundnani DL, Wang H, Abbruzzese JL, Maitra A, Tempero MA, Brand R, Firpo MA, Mulvihill SJ, Katz MH, Brennan P, Feng Z, Taguchi A, Hanash SM Sequential Validation of Blood-Based Protein Biomarker Candidates for Early-Stage Pancreatic Cancer (JNCI 2017)

Contact