Statistics Resources for the Rehabilitation Researcher
This is a repository for resources related to learning and doing statistics and data analysis and open science and using R and other important things. These resources have been curated for (but certainly not limited to) new and seasoned researchers in rehabilitation research and related fields.
Before you get bogged down in this overwhelming list of resources, I suggest you entertain yourself with this highly enjoyable scientific article (or at least listen to it during your commute this week, there is an audio version narrated by the authors) and remember that your goal is to do science, and these are just tools you might find useful along the way.
Things Could Be Better by Adam Mastroianni
Getting started
My recommendation is that you use this list of resources in a very ad-hoc fashion. Some of the sections I have ordered so that the resource I find myself recommending most often for the total notice is at the top (indicated with an asterisk). Others are just lists of resources or people that I have learned a great deal from and that I think you might find useful. But if they’re on this list, then minimally I have found them helpful at some point or another.
If you don’t see what you’re looking for, please consider asking for suggestions or recommendations on the discussions page (Yes, you will have to make a github account. It’s free. Students get extra perks and so do faculty).
A word of advice
These resources are generally based around the statistical programming language R, but this is mostly because its what I was trained in, what I know the most about, and probably the scripted language used most often in CSD research. Python is cool too sometimes. I wish I knew Julia. Perhaps someone will add a section for Python and Julia. If you really don’t want to learn a scripted/statistical language like R (or python, Julia, SQL…), then I suggest you go check out JASP, which is a free and open-source GUI build on top of R.
Learning R or any of the stats below, especially for anyone who has never written any code before, can be a daunting task. But it is a skill that is worth learning. The best way to learn R is to use it. The best way to use it is to have a project that you need to use it for. The resources below are fantastic - but I highly suggest you use them in tandem with a project you are highly motivated to do in R. For example, you do a few chapters or sections of one of these resources, then you go try to apply that knowledge in your own project and with your own data.
Also, its really helpful to have a mentor who knows a little bit more R or stats than you, even if only to check in with occasionally or give yourself some external accountability. If you don’t have that person, then its worth spending the time and effort to go find them (small plug for ASHA’s MARC program, which I have benefited greatly from, and the many mentors who have and continue to help me learn).
Some thoughts on AI
Generative AI often provides wrong, inefficient, or overly complex code (especially in R). It is often frequently wrong when asked questions about statistical concepts. It uses R packages that are often not well suited to your problem, outdated, or poorly maintained. While it can provide correct solutions to many problems, in many cases I think genAI can and will impede learning and retention of new skills. Moreover, copying and pasting content from genAI reduces the cognitive load during learning. For these reasons, I recommend anyone new to R or learning a statistical method be wary of using generative AI.
However, generative AI can be extremely useful for learning to code (e.g., learning why some code works and other code doesn’t or to explain a frustrating error message or to explain function documentation). I think this is particularly true for people who are completely new to programming/coding - many tutorials assume too much background knowledge and this can make the initial learning process extremely frustrating and result in new learners abandoning their goals, feeling like they are not skilled or knowledgeable enough to learn R, coding, or statistics.
Some appropriate and potentially beneficial uses of generative AI for your learning could be:
- Asking genAI to explain an error message to you
“Example prompt: I’m trying to read this CSV file into R, but I’m getting this error message that I pasted below. I’ve tried X, Y, and Z to fix the issue but none seem to be working. Can you explain the error message to me so that I can learn how to fix it?”
- Asking genAI to explain why one coding approach works while another does not
- Asking genAI to explain unclear documentation
- Asking genAI to give you feedback on a paragraph you wrote for your results
- Asking genAI to help you outline an approach to solving a problem without proving any code
Contributing
Please contribute!! You don’t need to be an expert in stats/quantitative methods. In fact, I’m particularly keen on learning what resources are helpful for students who are learning (re rediscovering) statistics.
Here’s how:
Go open an issue in the github repository (the “issues” tab) and include 3 things:
- a link or reference
- which section it belongs in
- and how you found it useful
If any links are broken or out of date, the github issues are a good place to let me know that so I can fix it or delete the resource.
Contributors: