## To students of statistics

Most of my posts are directed at teachers and how to teach statistics. The blog this week and next is devoted to students. I present principles that will help you to learn statistics. I’m turning them into a poster, which I will make available for you to printing later. I’d love to hear from other teachers as I add to my list of principles.

## 1. Statistics is learned by doing

One of the best predictors of success in any subject is how much time you spent on it. If you want to learn statistics, you need to put in time. It is good to read the notes and the textbook, and to look up things on the internet and even to watch Youtube videos if they are good ones. But the most important way to learn statistics is by doing. You need to practise at the skills that are needed by a statistician, which include logical thinking, interpretation, judgment and writing. Your teacher should provide you with worthwhile practice activities, and helpful timely feedback. Good textbooks have good practice exercises. On-line materials have many practice exercises.

Given a choice, do the exercises that have answers available. It is very important that you check what you are doing, as it is detrimental to practise something in the wrong way. Or if you are using an on-line resource, make sure you check your answers as you go, so that you gain from the feedback and avoid developing bad habits.

So really the first principle should really be “statistics is learned by doing **correctly**”.

## 2. Understanding comes with application, not before.

Do not wait until you understand what you are doing before you get started. The understanding comes as you do the work. When we learn to speak, we do not wait until we understand grammatical structure before saying anything. We use what we have to speak and to listen, and as we do so we gain an understanding of how language works. I have found that students who spent a lot of time working through the process of calculating conditional probabilities for screening tests grew to understand the “why” as well as the “how” of the process. Repeated application of using Excel to fit a line to bivariate data and explaining what it meant, enabled students to understand and internalise what a line means. As I have taught statistics for two decades, my own understanding has continued to grow.

There is a proviso. You need to think about what you are doing, and you need to do worthwhile exercises. For example, mechanically calculating the standard deviation of a set of numbers devoid of context will not help you understand standard deviation. Looking at graphs and trying to guess what the standard deviation is, would be a better exercise. Then applying the value to the context is better still.

Applying statistical principles to a wide variety of contexts helps us to discern what is specific to a problem and what is general for all problems. This brings us to the next principle.

## 3. Spend time exploring the context.

In a statistical analysis, context is vital, and often very interesting. You need to understand the problem that gave rise to the investigation and collection of the data. The context is what makes each statistical investigation different. Statisticians often work alongside other researchers in areas such as medicine, psychology, biology and geology, who provide the contextual background to the problem. This provides a wonderful opportunity for the statistician to learn about a whole range of different subjects. The interplay between the data and context mean that every investigation is different.

In a classroom setting you will not have the subject expert available, but you do need to understand the story behind the data. These days, finding out is possible with a click of a Google or Wikipedia button. Knowing the background to the data helps you to make more sensible judgments – and it makes it more interesting.

## 4. Statistics is different from mathematics

In mathematics, particularly pure mathematics, context is stripped away in order to reveal the inner pure truth of numbers and logic. There are applied areas involving mathematics, which are more like statistics, such as operations research and engineering. At school level, one of the things that characterises the study of maths is right and wrong answers, with a minimum of ambiguity. That is what I loved about mathematics – being able to apply an algorithm and get a correct answer. In statistics, however, things are seldom black-and-white. In statistics you will need to interpret data from the perspective of the real world, and often the answer is not clear. Some people find the lack of certainty in statistics disturbing. There is considerable room for discussion in statistics. Some aspects of statistics are fuzzy, such as what to do with messy data, or which is the “best” model to fit a time series. There is a greater need for the ability to write in statistics, which makes if more challenging for students for whom English is not their native language.

## 5. Technology is essential

With computers and calculators, all sorts of activities are available to help learn statistics. Graphs and graphics enable exploration that was not possible when graphs had to be drawn by hand. You can have a multivariate data set and explore all the possible relationships with a few clicks. You should always look at the data in a graphical form before setting out to analyse.

Sometimes I would set optional exercises for students to explore the relationship between data, graphs and summary measures. Very few students did so, but when I led them through the same examples one at a time I could see the lights go on. When you are given opportunities to use computing power to explore and learn – do it!

## But wait…there’s more

Here we have the first five principles for students learning statistics. Watch this space next week for some more. And do add some in the comments and I will try to incorporate your ideas as well.

That’s where my learning anxiety is right in between the “how” and the “why”, and my stubbornness sometimes prevents me from doing before understanding.

From a first year grad student, thank you for the advice Dr Nic. Keep the student-centered posts coming, pls 🙂

Pingback: Lies, Damned Lies, and Statistics (43): Cherry Picking Time Frames | P.a.p.-Blog // Human Rights Etc.

Pingback: Stat Blogs | Pearltrees

thank you so much Dr. Nic. i find information above is very helpful for me. i am an international student studying in Canada, and certainly i choose statistics as my major. as sophomore i am taking some courses about probability, and i sometimes get stuck to understand. i wonder if you have any website good sources to help stat problems? if so would you mind giving me ?

Statistics is not easy to understand but it is very interesting .

Any way I need some technical assistance on how to learn the basic principles and application of statistics.

Excuse My lack of knowledge but please help guide a sister in need.

The following sounds like Greek to Me

My research is about implementing student-centered learning approach to change the focus from teaching to learning. Every part prior to this point is well covered.

I’m stuck on how to value the null or alternative hypotheses

What descriptive and inferrential statistics to use

Null hypothesis- SCL approach will have no effect on how primary school students learn English skills compared to when they’re taught using a teacher-centered approach

Alternative hypothesis – SCL will have a significance effect on how primary school students learn English skills compared to when they’re taught using a teacher-centered approach

Population is 400

Sample size is 280

How does one go about CALCULATING the p-values carry out t tests or z test etc using the information above. What am I missing here

TIA

Hi. Your null and alternative hypotheses look fine. You will need to put your sample data (280 observations) into a computer package and get it to calculate the p-value. Because your population is so small compared with your sample, you may need to do a finite population correction, but don’t worry about that to start with. A good start is to graph your data. First do a dotplot of your measure of learning, for the two groups – with SCL and without SCL. If you do not have one measure of learning, you will need to work out how to get one.

Good luck

Nic