ALDA stands for the book, Applied Longitudinal Data Analysis, by Singer and Willett. I’ve had this lying around for a… Read more
Using a Bootstrap to Estimate Power and Significance Level
I’ve been reading Common Errors in Statistics (and How to Avoid Them) by Phillip Good and James Hardin. It’s a… Read more
Editing a lot of variable names in a R data frame
Someone I work with asked about how to easily update lots of variable names in a R data frame after… Read more
Simulation to Represent Uncertainty in Regression Coefficients
Working through Gelman and Hill’s book can be maddening. The exposition is wonderful but recreating their examples leads to new… Read more
Scraping Virginia Tech Football Data, Part 2
In an earlier post I described how I went about scraping football data off the Virginia Tech athletics web site…. Read more
Playing with R Markdown
I was playing with R Markdown in R Studio and thought I’d share my results. I can’t believe how easy… Read more
A Logistic Regression Checklist
I recently read The Checklist Manifesto by Atul Gawande and was fascinated by how relatively simple checklists can improve performance… Read more
The standard deviation of the sampling distribution of the mean
Someone on Reddit posted Chapter 1 of Howard Wainer’s book, Picturing the Uncertain World. The name of the chapter is… Read more
Using Simulation to Compute Confidence Intervals
I’ve been working through Gelman and Hill’s book, Data Analysis Using Regression and Multilevel/Hierarchical Models. I originally wanted to read… Read more
Buffon’s Needle Problem, or How to use Probability to Estimate Pi
I gave a presentation on Buffon’s Needle Problem in a job interview once. Here’s the presentation I gave in PDF… Read more