This is a short post, sometimes called e-begging!
I had been toying with the idea of a Kickstarter project, as a way for supporters of my work to help us keep going. Kickstarter is a form of crowd-sourcing, which lets a whole lot of people each contribute a little bit to get a project off the ground.

But we don’t really have one big project, but rather a stream of videos and web-posts to support the teaching and learning of statistics. Patreon provides a more incremental way for appreciative fans to support the work of content creators.

Rather than producing for one big publishing company, who then hold the rights to our material, we would love to keep making our content freely available to all. You can help, with just a few dollars per video.

The phrase I despise more than any in popular use (and believe me there are many contenders) is “Those who can, do, and those who can’t, teach.” I like many of the sayings of George Bernard Shaw, but this one is dismissive, and ignorant and born of jealousy. To me, the ability to teach something is a step higher than being able to do it. The PhD, the highest qualification in academia, is a doctorate. The word “doctor” comes from the Latin word for teacher.

Teaching is a noble profession, on which all other noble professions rest. Teachers are generally motivated by altruism, and often go well beyond the requirements of their job-description to help students. Teachers are derided for their lack of importance, and the easiness of their job. Yet at the same time teachers are expected to undo the ills of society. Everyone “knows” what teachers should do better. Teachers are judged on their output, as if they were the only factor in the mix. Yet how many people really believe their success or failure is due only to the efforts of their teacher?

For some people, teaching comes naturally. But even then, there is the need for pedagogical content knowledge. Teaching is not a generic skill that transfers seamlessly between disciplines. You must be a thinker to be a good teacher. It is not enough to perpetuate the methods you were taught with. Reflection is a necessary part of developing as a teacher. I wrote in an earlier post, “You’re teaching it wrong”, about the process of reflection. Teachers need to know their material, and keep up-to-date with ways of teaching it. They need to be aware of ways that students will have difficulties. Teachers, by sharing ideas and research, can be part of a communal endeavour to increase both content knowledge and pedagogical content knowledge.

There is a difference between being an explainer and being a teacher. Sal Khan, maker of the Khan Academy videos, is a very good explainer. Consequently many students who view the videos are happy that elements of maths and physics that they couldn’t do, have been explained in such a way that they can solve homework problems. This is great. Explaining is an important element in teaching. My own videos aim to explain in such a way that students make sense of difficult concepts, though some videos also illustrate procedure.

Teaching is much more than explaining. Teaching includes awakening a desire to learn and providing the experiences that will help a student to learn. In these days of ever-expanding knowledge, a content-driven approach to learning and teaching will not serve our citizens well in the long run. Students need to be empowered to seek learning, to criticize, to integrate their knowledge with their life experiences. Learning should be a transformative experience. For this to take place, the teachers need to employ a variety of learner-focussed approaches, as well as explaining.

It cracks me up, the way sugary cereals are advertised as “part of a healthy breakfast”. It isn’t exactly lying, but the healthy breakfast would do pretty well without the sugar-filled cereal. Explanations really are part of a good learning experience, but need to be complemented by discussion, participation, practice and critique. Explanations are like porridge – healthy, but not a complete breakfast on their own.

Why statistics is so hard to teach

“I’m taking statistics in college next year, and I can’t wait!” said nobody ever!

Not many people actually want to study statistics. Fortunately many people have no choice but to study statistics, as they need it. How much nicer it would be to think that people were studying your subject because they wanted to, rather than because it is necessary for psychology/medicine/biology etc.

In New Zealand, with the changed school curriculum that gives greater focus to statistics, there is a possibility that one day students will be excited to study stats. I am impressed at the way so many teachers have embraced the changed curriculum, despite limited resources, and late changes to assessment specifications. In a few years as teachers become more familiar with and start to specialise in statistics, the change will really take hold, and the rest of the world will watch in awe.

In the meantime, though, let us look at why statistics is difficult to teach.

Students generally take statistics out of necessity.

Statistics is a mixture of quantitative and communication skills.

It is not clear which are right and wrong answers.

Statistical terminology is both vague and specific.

One of the basic procedures, hypothesis testing, is counter-intuitive.

Because the teaching of statistics is comparatively recent, there is little developed pedagogical content knowledge. (Though this is growing)

Technology is forever advancing, requiring regular updating of materials and teaching approaches.

On the other hand, statistics is also a fantastic subject to teach.

Statistics is immediately applicable to life.

It links in with interesting and diverse contexts, including subjects students themselves take.

Studying statistics enables class discussion and debate.

Statistics is necessary and does good.

The study of data and chance can change the way people see the world.

Technlogical advances have put the power for real statistical analysis into the hands of students.

Because the teaching of statistics is new, individuals can make a difference in the way statistics is viewed and taught.

I love to teach. These days many of my students are scattered over the world, watching my videos (for free) on YouTube. It warms my heart when they thank me for making something clear, that had been confusing. I realise that my efforts are small compared to what their teacher is doing, but it is great to be a part of it.

I grew up reading Donald Duck comics. I love the Junior Woodchucks, and their Junior Woodchuck Guidebook. The Guidebook is a small paperback book, containing information on every conceivable subject, including geography, mythology, history, literature and the Rubaiyat of Omar Khayyam. In our family, when we want to know something or check some piece of information, we talk about consulting the Junior Woodchuck Guidebook. (Imagine my joy when I discovered that a woodchuck is another name for a groundhog, the star of my favourite movie!) What we are referring to is the internet, the source of all possible information! Thanks to search engines, there is very little we cannot find out on the internet. And very big thanks to Wikipedia, to which I make an annual financial contribution, as should all who use it and can afford to.

You can learn just about anything on the internet. Problem is, how do you know what is good? And how do you help students find good stuff? And how do you use the internet wisely? And how can it help us as learners and teachers of statistics and operations research? These questions will take more than my usual 1000 words, so I will break it up a bit. This post is about the ways the internet can help in teaching and learning. In a later post I will talk about evaluating resources, and in particular multimedia resources.

Context

Both the disciplines in which I am interested, statistics and operations research, apply mathematical and analytic methods to real-world problems. In statistics we are generally trying to find things out, and in operations research we are trying to make them better. Either way, the context is important. The internet enables students to find background knowledge regarding the context of the data or problem they are dealing with. It also enables instructors to write assessments and exercises that have a degree of veracity to them even if the actual raw data proves elusive. How I wish people would publish standard deviations as well as means when reporting results!

If you are struggling to understand a concept, or to know how to teach or explain it, do a web search. I have found some great explanations, and diagrams especially, that have helped me. Or I have discovered a dearth of good diagrams, which has prompted me to make my own.

Video

Videos can help with background knowledge, with explanations, and with inspiring students with the worth of the discipline. The problem with videos is that it takes a long time to find good ones and weed out the others. One suggestion is to enlist the help of your students. They can each watch two or three videos and decide which are the most helpful. The teacher then watches the most popular ones to check for pedagogical value. It is great when you find a site that you can trust, but even then you can’t guarantee the approach will be compatible with your own.

Social support

I particularly love Twitter, from which I get connection with other teachers and learners, and ideas and links to blogs. I belong to a Facebook group for teachers of statistics in New Zealand, and another Facebook group called “I love Operations Research”. These wax and wane in activity, and can be very helpful at times. Students and teachers can gain a lot from social networking.

Software

There is good open-source software available, and 30-day trial versions for other software. Many schools in New Zealand use the R-based iNZight collection of programmes, which provide purpose-built means for timeseries analysis, bootstrapping and line fitting.

Answers to questions

The other day I lost the volume control off my toolbar. (Windows Vista, I’m embarrassed to admit). So I put in the search box “Lost my volume control” and was directed to a YouTube video that took me step-by-step through the convoluted process of reinstating my volume control! I was so grateful I made a donation. Just about any computer related question can be answered through a search.

Interactive demonstrations

I love these. There are two sites I have found great:

A problem with some of these is the use of Flash, which does not play on all devices. And again – how do we decide if they are any good or not?

On-line textbooks

Why would you buy a textbook when you can get one on-line. I routinely directed my second-year statistical methods for business students to “Concepts and Applications of Inferential Statistics”. I’ve found it just the right level. Another source is Stattrek. I particularly like their short explanations of the different probability distributions.

Practice quizzes

There aren’t too many practice quizzes around for free. Obviously, as a provider of statistical learning materials, I believe quizzes and exercises have merit for practice with immediate and focussed feedback. However, it can be very time-consuming to evaluate practice quizzes, and some just aren’t very good. On the other hand, some may argue that any time students spend learning is better than none.

Live help

There are some places that provide live, or slightly delayed help for students. I got hooked into a very fun site where you earned points by helping students. Sadly I can’t find it now, but as I was looking I found vast numbers of on-line help sites, often associated with public libraries. And there are commercial sites that provide some free help as an intro to their services. In New Zealand there is the StudyIt service, which helps students preparing for assessments in the senior high school years. At StatsLC we provide on-line help as part of our resources, and will be looking to develop this further. From time to time I get questions as a result of my YouTube videos, and enjoy answering them ,unless I am obviously doing someone’s homework! I also discovered “ShowMe” which looks like a great little iPad app, that I can use to help people more.

This has just been a quick guide to how useful the internet can be in teaching and learning. Next week I will address issues of quality and equity.

Khan academy probability videos and exercises aren’t good either

Dear Mr Khan

You have created an amazing resource that thousands of people all over the world get a lot of help from. Well done. Some of your materials are not very good, though, so I am writing this open letter in the hope that it might make some difference. Like many others, I believe that something as popular as Khan Academy will benefit from constructive criticism.

I fear that the reason that so many people like your mathematics videos so much is not because the videos are good, but because their experience in the classroom is so bad, and the curriculum is poorly thought out and encourages mechanistic thinking. This opinion is borne out by comments I have read from parents and other bloggers. The parents love you because you help their children pass tests. (And these tests are clearly testing the type of material you are helping them to pass!) The bloggers are not so happy, because you perpetuate a type of mathematical instruction that should have disappeared by now. I can’t even imagine what the history teachers say about your content-driven delivery, but I will stick to what I know. (You can read one critique here)

Just over a year ago I wrote a balanced review of some of the Khan Academy videos about statistics. I know that statistics is difficult to explain – in fact one of the hardest subjects to teach. You can read my review here. I’ve also reviewed a selection of videos about confidence intervals, one of which was from Khan Academy. You can read the review here.

Consequently I am aware that blogging about the Khan Academy in anything other than glowing terms is an invitation for vitriol from your followers.

However, I thought it was about time I looked at the exercises that are available on KA, wondering if I should recommend them to high school teachers for their students to use for review. I decided to focus on one section, introduction to probability. I put myself in the place of a person who was struggling to understand probability at school.

Here is the verdict.

First of all the site is very nice. It shows that it has a good sized budget to use on graphics and site mechanics. It is friendly to get into. I was a bit confused that the first section in the Probability and Statistics Section is called “Independent and dependent events”. It was the first section though. The first section of this first section is called Basic Probability, so I felt I was in the right place. But then under the heading, Basic probability, it says, “Can I pick a red frog out of a bag that only contains marbles?” Now I have no trouble with humour per se, and some people find my videos pretty funny. But I am very careful to avoid confusing people with the humour. For an anxious student who is looking for help, that is a bit confusing.

So I watched the first video about probability and it was like any other KA video I’ve viewed, with primitive graphics and a stumbling repetitive narration. It was correct enough, but did not take into account any of the more recent work on understanding probability. It used coins and dice. Big yawn. It wastes a lot of time. It was ok. I do like that you have the interactive transcript so you can find your way around.

It dawned on me that nowhere do you actually talk about what probability is. You seem to assume that the students already know that. In the very start of the first video it says,

“What I want to do in this video is give you at least a basic overview of probability. Probability, a word that you’ve probably heard a lot of and you are probably just a little bit familiar with it. Hopefully this will get you a little deeper understanding.”

Later in the video there is a section on the idea of large numbers of repetitions, which is one way of understanding probability. But it really is a bit skimpy on why anyone would want to find or estimate a probability, and what the values actually mean. But it was ok.

The first video was about single instances – one toss of a coin or one roll of a die. Then the second video showed you how to answer the questions in the exercises, which involved two dice. This seemed ok, if rather a sudden jump from the first video. Sadly both of these examples perpetuate the common misconception that if there are, say, 6 alternative outcomes, they will necessarily be equally likely.

Exercises

Then we get to some exercises called “Probability Space” , which is not an enormously helpful heading. But my main quest was to have a go at the exercises, so that is what I did. And that was not a good thing. The exercises were not stepped, but started right away with an example involving two dice and the phrase “at least one of”. There was meant to be a graphic to help me, but instead I had the message “scratchpad not available”. I will summarise my concerns about the exercises at the end of my letter. I clicked on a link to a video that wasn’t listed on the left, called Probability Space and got a different kind of video.

This video was better in that it had moving pictures and a script. But I have problems with gambling in videos like this. There are some cultures in which gambling is not acceptable. The other problem I have is with the term “exact probability”, which was used several times. What do we mean by “exact probability”? How does he know it is exact? I think this sends the wrong message.

Then on to the next videos which were worked examples, entitled “Example: marbles from a bag, Example: Picking a non-blue marble, Example: Picking a yellow marble.” Now I understand that you don’t want to scare students with terminology too early, but I would have thought it helpful to call the second one, “complementary events, picking a non-blue marble”. That way if a student were having problems with complementary events in exercises from school, they could find their way here. But then I’m not sure who your audience is. Are you sure who your audience is?

The first marble video was ok, though the terminology was sloppy.

The second marble video, called “Example: picking a non-blue marble”, is glacially slow. There is a point, I guess in showing students how to draw a bag and marbles, but… Then the next example is of picking numbers at random. Why would we ever want to do this? Then we come to an example of circular targets. This involves some problem-solving regarding areas of circles, and cancelling out fractions including pi. What is this about? We are trying to teach about probablity so why have you brought in some complication involving the area of a circle?

The third marble video attempts to introduce the idea of events, but doesn’t really. By trying not to confuse with technical terms, the explanation is more confusing.

Now onto some more exercises. The Khan model is that you have to get 5 correct in a row in order to complete an exercise. I hope there is some sensible explanation for this, because it sure would drive me crazy to have to do that. (As I heard expressed on Twitter)

What are circular targets doing in with basic probability?

The first example is a circular target one. I SO could not be bothered working out the area stuff so I used the hints to find the answer so I could move onto a more interesting example. The next example was finding the probability of a rolling a 4 from a fair six sided die. This is trivial, but would have been not a bad example to start with. Next question involve three colours of marbles, and finding the probability of not green. Then another dart-board one. Sigh. Then another dart board one. I’m never going to find out what happens if I get five right in a row if I don’t start doing these properly. Oh now – it gave me circumference. SO can’t be bothered.

And that was the end of Basic probability. I never did find out what happens if I get five correct in a row.

Venn diagrams

The next topic is called “Venn diagrams and adding probabilities “. I couldn’t resist seeing what you would do with a Venn diagram. This one nearly reduced me to tears.

As you know by now, I have an issue with gambling, so it will come as no surprise that I object to the use of playing cards in this example. It makes the assumption that students know about playing cards. You do take one and a half minutes to explain the contents of a standard pack of cards. Maybe this is part of the curriculum, and if so, fair enough. The examples are standard – the probability of getting a Jack of Hearts etc. But then at 5:30 you start using Venn diagrams. I like Venn diagrams, but they are NOT good for what you are teaching at this level, and you actually did it wrong. I’ve put a comment in the feedback section, but don’t have great hopes that anything will change. Someone else pointed this out in the feedback two years ago, so no – it isn’t going to change.

This diagram is misleading, as is shown by the confusion expressed in the questions from viewers. There should be a green 3, a red 12, and a yellow 1.

Now Venn diagrams seem like a good approach in this instance, but decades of experience in teaching and communicating complex probabilities has shown that in most instances a two-way table is more helpful. The table for the Jack of Hearts problem would look like this:

Jacks

Not Jacks

Total

Hearts

1

12

13

Not Hearts

3

36

39

Total

4

48

52

(Any teachers reading this letter – try it! Tables are SO much easier for problem solving than Venn diagrams)

But let’s get down to principles.

The principles of instruction that KA have not followed in the examples:

Start easy and work up

Be interesting in your examples – who gives a flying fig about two dice or random numbers?

Make sure the hardest part of the question is the thing you are testing. This is particularly violated with the questions involving areas of circles.

Don’t make me so bored that I can’t face trying to get five in a row and not succeed.

My point

Yes, I do have one. Mr Khan you clearly can’t be stopped, so can you please get some real teachers with pedagogical content knowledge to go over your materials systematically and make them correct. You have some money now, and you owe it to your benefactors to GET IT RIGHT. Being flippant and amateurish is fine for amateurs but you are now a professional, and you need to be providing material that is professionally produced. I don’t care about the production values – keep the stammers and “lellows” in there if you insist. I’m very happy you don’t have background music as I can’t stand it myself. BUT… PLEASE… get some help and make your videos and exercises correct and pedagogically sound.

Dr Nic

PS – anyone else reading this letter, take a look at the following videos for mathematics.

Most students who enrol in an initial course in statistics at university level do so because they have to. I did some research on attitudes to statistics in my entry level quantitative methods course, and fewer than 1% of the students had chosen to be in that course. This is a little demoralising, if you happen to think that statistics is worthwhile and interesting.

Teaching a service course in statistics is one of the great challenges of teaching. A “Service Course” is a course in statistics for students who are majoring in some other subject, such as Marketing or Medicine or Education. For some students it is a terminating course – they will never have to look at a p-value again (they hope). For some students it is the precursor to further applied statistics such as marketing research or biological research. Having said that, statistics for citizens is important and interesting and engaging if taught that way. And we might encourage some students to carry on.

Yet the teachers and textbook writers seem to do their best to remove the joy. Statistics is a difficult subject to understand. Often the way the instructor thinks is at odds with the way the students think and learn. The mathematical nature of the subject is invested with all sorts of emotional baggage.

Here are some of the challenges of teaching a statistics service course.

Limited mathematical ability

It is important to appreciate how limited the mathematical understanding is of some of the students in service courses. In my first year quantitative methods course, I made sure my students knew basic algebra, including rearranging and solving equations. This was all done within a business context. Even elementary algebra was quite a stumbling block to some students, for whom algebra had been a bridge too far at school. There were students in a postgrad course I taught who were not sure which was larger, out of 0.05 and 0.1, and talked about crocodiles with regard to greater than and less than signs. And these were schoolteachers! Another senior maths teacher in that group had been teaching the calculation of confidence intervals, without actually understanding what they were.

I realised a few years ago that I am not a mathematician. I do not have the ability to think in the abstract that is part of a true mathematician. Operations Research was my thing, because I was good at mathematics, but my understanding was concrete. This has been a surprising gift for me as a teacher, as it has meant that I can understand better what the students find difficult. Formulas do not tell them anything. Calculating by hand does not lead to understanding. It is from this philosophy that I approach the production of my videos. I am particularly pleased with my recent video about confidence intervals, which explains the ideas, with nary a formula in sight, but plenty of memorable images.

Software

One of my more constantly accessed posts is Excel, SPSS, Minitab or R?. This consistent interest indicates that the course of software is a universal problem. People are very quick to say how evil Excel is, and I am under no illusions as to many of the shortcomings. The main point of my post was, however, that it depends on the class you are teaching.

As I have taught mainly business students, I still hold that for them, Excel is ideal. Not so much for the statistical aspects, but because they learn to use Excel. Last Saturday the ideas for today’s posts were just forming in my mind when the phone rang, and despite my realising it was probably a telemarketer (we have caller ID on our phone) I answered it. It was a nice young woman asking me to take part in a short survey about employment opportunities for women in the Christchurch Rebuild. After I’d answered the questions, explaining that I was redundant from the university because of the earthquakes and that I had taught statistics, she realised that I had taught her. (This is a pretty common occurrence for me in our small town-city – even when I buy sushi I am served by ex-students). So I asked her about her experience in my course, and she related how she would never have taken the course, but enjoyed it and passed. I asked about Excel, and she told me that she had never realised what you could do with Excel before, and now still used it. This is not an isolated incident. When students are taught Excel as a tool, they use it as a tool, and continue to do so after the course has ended.

When business students learn using Excel, it has the appearance of relevance. They are aware that spreadsheets are used in business. It doesn’t seem like time wasted. So I stand by my choice to use Excel. However if I were still teaching at University, I would also be using iNZight. And if I taught higher levels I would continue to use SPSS, and learn more about R.

Textbooks

As I said in a previous post Statistics Textbooks suck out all the fun. Very few textbooks do no harm. I wonder if this site could provide a database of statistics texts and reviews. I would be happy to review textbooks and include them here. My favourite elementary textbook is, sadly, out of print. It is called “Taking the Fear out of Data Analysis”, by the fabulously named Adamantis Diamantopoulos and Bodo Schlegelmilch. It takes a practical approach, and has a warm, nurturing style. It lacks exercises. I have used extracts from it over the years. The choice of textbook, like the choice of software, is “horses for courses”, but I think there are some horses that should not be put anywhere near a course. I do wonder how many students use textbooks as anything other than a combination lucky charm and paper weight.

In comparison with the plethora of college texts of varying value, at high-school level the pickings for textbooks are thin. This probably reflects the newness of the teaching of statistics at high-school level. A major problem with textbooks is that they are so quickly out of date, and at school level it is not practical to replace class sets too often.

Perhaps the answer is online resources, which can be updated as needed, and are flexible and give immediate feedback. ;-)

Emotional baggage

I was less than gentle with a new acquaintance in the weekend. When asked about my business, I told him that I make on-line materials to help people teach and learn statistics. He proceeded to relate a story of a misplaced use of a percentage as a reason why he never takes any notice of statistics. I have tired of the “Lies, damned lies, and statistics” jibe and decided not to take it lying down. I explained that the world is a better place because of statistical analysis. Much research, including medical would not be possible in the absence of methods for statistical analysis. An understanding of the concepts of statistics is a vital part of intelligent citizenship, especially in these days of big and ubiquitous data.

I stopped at that point, but have pondered since. What is it that makes people so quick to denigrate the worth of statistics? I suspect it is ignorance and fear. They make themselves feel better about their inadequacies by devaluing the things they lack. Just a thought.

This is not an isolated instance. In fact I was so surprised when a lighthouse keeper said that statistics sounded interesting and wanted to know more, that I didn’t really know what to say next! You can read about that in a previous post. Statistics is an interesting subject – really!

But the students in a service course in statistics may well be in the rather large subset of humanity who have yet to appreciate the worth of the subject. They may even have fear and antipathy towards the subject, as I wrote about previously. Anxiety, fear and antipathy for maths, stats and OR.

People are less likely to learn if they have negative attitudes towards the subject. And when they do learn it may well be “learning to pass” rather than actual learning which is internalised.

So what?

Keep the faith! Statistics is an important subject. Keep trying new things. If you never have a bad moment in your teaching, you are not trying enough new things. And when you hear from someone whose life was changed because of your teaching, there is nothing like it!

There is much talk about “flipped classrooms” and the wonders of Khan Academy, YouTube abounds with videos about everything…really! Even television news reports show YouTube clips. Teachers and instructors use videos in their teaching, and get their students to watch them at home, ready to build on in class time. A well put-together video can provide a different way of looking at a problem that helps a student to learn. Videos are endlessly patient and can be paused and watched at the students’ pace. (See my earlier post on multimedia for a fuller discourse on good multimedia.) The problem is: How is a teacher to know what is good and what is not? This seems to be especially difficult in an area like statistics.

I decided this week to see what was on offer and summarise for you all. To narrow it down I chose the topic of Confidence intervals. This topic is pretty universal to statistics courses, and is conceptually tricky. I wanted to see if there was a quick way of working out if a video is any good or not, without having to watch them. I was prepared to suffer so that my readers would not have to.

Videos about confidence intervals are mostly awful

And suffer I did. Not to beat around the bush – many of the videos I watched were awful. There is no other word for it. Not only were they slow, boring, mathematically based, unscripted and unedited, but many of them were just plain wrong. Back in 2008 I went looking for a video about confidence intervals for my students, and realised I had to make my own. It is still true. I do not doubt that the videos are well intentioned. Many of them may have been made (as mine were originally) for a specific class (or family member), and thus were not intended for a larger audience. Maybe those ones should come with disclaimers. “I’m not sure I really know what I am talking about – view at your own risk.”

I put “Confidence Intervals” into the YouTube search engine and examined nine of the top offerings. Mostly I went for the videos with the most views, as this would appear to be a way of filtering out poor material. (wrong again, as you will see) I also included two of my own videos.

Most these videos should not be seen by a wider audience. No – that’s not right – most of them should not be seen – by anyone. The impression they give of statistics is of a bumbling professor talking about formulas and looking up tables. Nothing in them gives a single hint about how interesting, applied and relevant the subject of statistics is. Maybe there needs to be a wikipedia approach to supposedly educational videos to provide quality control.There is just one video other than my own two that I approve of – by Keith Bower. (Biased, I know – feel free to respond.)

A possible way if you wish to find useful videos, might be to get the students to find a video they think is good, then you check that it is correct. Trying to find a good video about statistics is not a good use of your time – unless of course, you just go straight to the Statistics Learning Centre channel or Keith Bower. :-)

If any of you gentle readers have a video you think is worth a second look, please put the link in the comments.

Brief reviews of ten videos on Confidence Intervals in no especial order except that I left the three good ones until last

I started with the videos that appeared at the top. They have paid to be first in the list, so I thought they might be good. As it turns out they are very similar to each other and from the same stable, it seems. I found them lacklustre, though not totally harmful.

1. Statistics – Confidence Intervals
Channel: EducatorVids2 Videos on the channel: 1192
17268 views Loaded 24 Oct 2011 (32 views per day) Duration 3:25

This video and the next one are part of some sort of course. This video seemed to start in the middle of a lecture “Now let’s go on to some examples”. The layout was utilitarian, with a talking head, and a screen showing the working. The video, like most of them, was not scripted. The content was based on a Mathematical example with no context. I don’t really know what she was talking about. But at least it was short.

Very similar format to EducatorVids2. The bulk of the content was around a medical example with 7 subjects. Again it was not scripted. The computation was tedious so that I had to fast forward.

And here is the one you have all been waiting for: Khan Academy. I should know better than to suggest that the mighty Khan is less than perfect (my previous post about KA continues to provoke defensive vitriol.) But here goes:

3. Confidence Interval 1
Khan Academy Videos on the channel: 3492
167, 213 views, loaded 28 Oct 2010 (186 views per day) Duration 14:03
246 likes 20 dislikes.

Like all Khan Academy videos (as far as I know) the format is very simple with a black screen with printed example. Again the video is not scripted, and consists of a lot of repetition as Khan doesn’t like empty air while he is writing.It is actually a lead into confidence intervals, doing a theoretical exercise involving the sampling distribution. Thus it talks about probabilities. It would have been better to entitle it, preparation for confidence intervals, as it doesn’t actually teach about confidence intervals, and includes probability. Khan included steps to using tables to find a t value. This video was really not nice. And it took 14 minutes! That is 14 minutes I will never have again. It is also a long time to find out that it doesn’t actually teach about confidence intervals. This video is one of the worst of the ten I viewed, and has far more views than it ought.

4. Statistics is easy: Confidence Interval
aghasemi4u Videos on the channel: 4 about statistics
a remarkable 296,456 views, loaded 23 March 2007, (136 per day) Duration 5:00
186 likes and 83 dislikes

This video was simple and reasonably well put together, with nice diagrams, but only three slides in its five minutes. The narration is unscripted and uses probability to describe the confidence interval (wrong!). There was a focus on the mathematical formula.

5. . Confidence Intervals
Madonna USI Videos on the channel: 22
18,403 views Uploaded 9 Nov 2009 (15 per day) Duration 9:42
102 likes, 2 dislikes

A brief description of what confidence intervals are as well as a couple examples.Live person working on a whiteboard. Refers to a textbook. Very slow. Definition wrong – Says that we are 95% confident that the value that we found is within the range. I’m hoping this is just a slip of the tongue, but it should have been editted out.

Summarised before and after, which can be tedious. Mathematically based. Slick graphics, but glacially slow in parts. Gives an example with no context. This is not statistics! Tedious. To be fair, there are lots of positive comments, and as the title says “how to calculate confidence intervals” there is no requirement to explain what they are when you get them. The channel is all about “how to calculate” and nothing about context, so I think it is a bit of a misnomer to call it “Statistics is fun”.

7. Confidence Intervals Part1 YouTube Larry Shrewsbury Videos on the channel: 15
82,006 views. Uploaded on 12 Jul 2009 (128 per day) Duration 7:42
136 likes 53 dislikes

Part of an enterprise “Taking the Sadistics out of learning Statistics”
I found the voice irritating as it seems patronising. However some people find my accent distracting, (wot eksent?) so I can’t really be too hard on that. Very formula based, looking at the mathematics rather than the interpretation. The best part was an interesting animation – very nice way of looking at traditional confidence intervals that I hadn’t seen before.

Keith’s presentation isn’t visually exciting, but he is correct and clear and that goes a long way. His is just a talking head – but he is an interesting presenter and very fluent. His video has branching, such that you can click to go to another video if needed. I’ve found all his videos sound and sensible. (I got “p is low, null must go” from one of his videos.)

9. Confidence Intervals in Excel
UCMSCI Videos on the channel
17797 views Loaded 25 Dec 2008 (11 per day)
26 likes 0 dislikes

This was one of my earlier videos. It is scripted with visuals to help in comprehension. It takes the classical approach to confidence intervals and puts emphasis on the idea of level of confidence. Addresses the aspects that affect the width of confidence intervals. Discusses the formula for confidence intervals, and shows how to use Excel to calculate them. (I don’t think I really loaded it on Christmas day! Maybe some strange dateline thing)

This video will disappoint the mathematicians, as there are no formulas. But students love it. The point of the video is to explain what a confidence interval is, and what things affect the size of the interval. It makes use of diagrams and examples to help students understand the concepts. It is tightly scripted and edited with no wasted time. People can always pause if they need to, but it is difficult to speed up a slow presentation.

Epilogue (Obituary?)

And there you have it, folks – there is no easy way to work out which videos are going to be useful for your students without watching them all. Sorry. And if you expect me to go through this again with another topic, you clearly didn’t get the subtext.

“Less is More” is a bit of a funny title for a mathematical blog!

Garlic bread and Ice Cream Sundaes

Back in the seventies, garlic bread became very popular in our household. I loved its buttery, salty, garlicky goodness, and made it quite often. One time I decided that if a little bit of garlic was yummy, then lots of garlic would be even more delicious. I was wrong! The garlic bread was barely edible, and the house and its occupants gave off a distinctive aroma for several days. More garlic did not mean “better”. From then on whenever I used garlic, I would recite in my head “More is not always better.”

Similarly it is fun to see children given a whole range of ice cream flavours, sauces and toppings and watch them create a dessert with EVERYTHING. From experience we know that there are only so many different forms of sugar and fat that should be added to ice cream at one time. If we are smart, we have several bowls, one with chocolate and nuts, one with caramel and crunchy toffee, one with raspberry and biscuit crumbs. That way we can appreciate the different flavours, without having them overridden. Having said that, we then discover that there comes a point of diminishing or even negative returns on investment. The final bowl of ice cream is often regretted.

Enough of food!

“Less is more” applies to teaching

The statement “Less is more” applies to teaching, and particularly subjects like Statistics and Operations Research.

As I learned with the garlic bread, we need to be careful not to give students too much. It is tempting, when developing on-line resources to keep including every possible activity, video and link that is relevant. However we have found that too many activities become overwhelming. It is tempting, as instructors to want to give plenty of practice and every possible resource. We assume that students can pick which items are useful for them. Instead we found that conscientious students want to complete EVERYTHING, and get discouraged when there is so much to do. They possibly don’t need to do all the activities, and waste their time on the easy ones.

We need to be selective about how we use our students’ time. Unless the homework or activity is going to help them learn and accomplish the goals of the course, it should not be there. I am reminded of the hell that was homework for my older son and me when he was going through middle-school. The teacher believed that more homework was better, and the result was misery in our family. Eventually I cried, “Enough!” and arranged an interview with her. I asked her for the specific learning objectives of the “worksheet”, which I know was an unfair question. Clearly the objective of worksheets is to keep the parents of conscientious girls (and the very uncommon conscientious boys) happy because their children were getting homework to do. She never did come up with learning objectives that satisfied me, so William (or rather, I) ceased to do her homework sheets, concentrating instead on times-tables, reading and handwriting. (Or generally nothing at all!)

But I digress. The point is – don’t waste student timeon “busy” work. If students understand the process and internalise a skill after ten examples, then they do not need another ten. I DO believe in drill or practice, but it needs to be well developed and practising the skills we wish students to develop. For example there is no need for students to calculate by hand the standard deviation of ten sets of numbers devoid of context. However there is great value in large numbers of questions getting students to determine which test is appropriate in a given scenario.

If you really want to make more resources, rather than making more tests, provide a larger question bank for the current quizzes. That way students can do the quiz multiple times to achieve mastery, but those who have mastered the material immediately can move on.

We should not teach all we know

And as with the ice cream sundaes, when choosing content, what we leave out is as important than what we put in. We should not attempt to teach all we know. When writing the scripts for my videos I find it is important to stick to the main ideas and get them well explained. Sometimes total accuracy is sacrificed in the interests of comprehensibility. I come back to the dreaded question, “Where do babies come from?”, the answer to which depends enormously on the source of the question and context. Seldom is a full biological explanation required or even desirable.

Leonardo Da Vinci is purported to have said, “Simplicity is the ultimate sophistication.” It is the art of the true teacher to be able to reduce complex ideas into a simple form. Bill Bryson is the master of this. In his book, “A Short History of Nearly Everything”, Bryson puts forth complex ideas in ways that a layperson can understand. This is a skill I seek after as a teacher, and try to use in my videos and resources.

Choosing the statistical test – in simple terms

I was unhappy with the branching diagrams commonly used to teach how to choose a statistical test. I felt that there was a more integrative way to express this that would also help peoples understanding. I came up with quite a different diagram that is featured in our most popular video to date.

The students love it. But there are aspects about the diagram which could be looked at a different way. For example I ask “How many samples?”, and say that an independent samples t-test is used on two separate samples. Really it could also be defined as one sample with two variables – the measurement variable and another variable for group membership. When people are just coming to grips with new ideas, they don’t need to see multiple ways of doing things. If they are at the stage to see the other way of looking at it, they aren’t going to need the diagram.

Another very cool thing Da Vinci said was “Art is never finished, only abandoned.” On that note, I will stop now.

Confidence intervals are needed because there is variation in the world. Nearly all natural, human or technological processes result in outputs which vary to a greater or lesser extent. Examples of this are people’s heights, students’ scores in a well written test and weights of loaves of bread. Sometimes our inability or lack of desire to measure something down to the last microgram will leave us thinking that there is no variation, but it is there. For example we would check the weights of chocolate bars to the nearest gram, and may well find that there is no variation. However if we were to weigh them to the nearest milligram, there would be variation. Drug doses have a much smaller range of variation, but it is there all the same.

You can see a video about some of the main sources of variation – natural, explainable, sampling and due to bias.

When we wish to find out about a phenomenon, the ideal would be to measure all instances. For example we can find out the heights of all students in one class at a given time. However it is impossible to find out the heights of all people in the world at a given time. It is even impossible to know how many people there are in the world at a given time. Whenever it is impossible or too expensive or too destructive or dangerous to measure all instances in a population, we need to take a sample. Ideally we will take a sample that gives each object in the population an equal likelihood of being chosen.

You can see a video here about ways of taking a sample.

When we take a sample there will always be error. It is called sampling error. We may, by chance, get exactly the same value for our sample statistic as the “true” value that exists in the population. However, even if we do, we won’t know that we have.

The sample mean is the best estimate for the population mean, but we need to say how well it is estimating the population mean. For example, say we wish to know the mean (or average) weight of apples in an orchard. We take a sample and find that the mean weight of the apples in the sample is 153g. If we only took a few apples, it is only a rough idea and we might say we are pretty sure the mean weight of the apples in the orchard is between 143g and 163g. If someone else took a bigger sample, they might be able to say that they are pretty sure that the mean weight of apples in the orchard is between 158g and 166g. You can tell that the second confidence interval is giving us better information as the range of the confidence interval is smaller.

There are two things that affect the width of a confidence interval. The first is the sample size. If we take a really large sample we are getting a lot more information about the population, so our confidence interval will be more exact, or smaller. It is not a one-to-one relationship, but a square-root relationship. If we wish to reduce the confidence interval by a factor of two, we will need to increase our sample size by a factor of 4.

The second thing to affect the width of a confidence interval is the amount of variation in the population. If all the apples in the orchard are about the same weight, then we will be able to estimate that weight quite accurately. However, if the apples are all different sizes, then it will be harder to be sure that the sample represents the population, and we will have a larger confidence interval as a result.

Three ways to find confidence intervals

Traditional (old-fashioned?) Approach

The standard way of calculating confidence intervals is by using formulas developed on the assumptions of normality and the Central Limit Theorem. These formulas are used to calculate the confidence intervals of means, proportions and slopes, but not for medians or standard deviations. That is because there aren’t nice straight-forward formulas for these. The formulas were developed when there were no computers, and analytical methods were needed in the absence of computational power.

In terms of teaching, these formulas are straight-forward, and also include the concept of level of confidence, which is part of the paradigm. You can see a video teaching the traditional approach to confidence intervals, using Excel to calculate the confidence interval for a mean.

Rule of Thumb

In the New Zealand curriculum at year 12, students are introduced to the concept of inference using an informal method for calculating a confidence interval. The formula is median +/- 1.5 times the interquartile range divided by the square-root of the sample size. There is a similar formula for proportions.

Bootstrapping

Bootstrapping is a very versatile way to find a confidence interval. It has three strengths:

It can be used to calculate the confidence interval for a large range of different parameters.

It uses ALL the information the sample gives us, rather than the summary values

It has been found to aid in understanding the concepts of inference better than the traditional methods.

There are also some disadvantages

Old fogeys don’t like it. (Just kidding) What I mean is that teachers who have always taught using the traditional approach find it difficult to trust what seems like a hit-and-miss method without the familiar theoretical underpinning.

Universities don’t teach bootstrapping as much as the traditional methods.

The common software packages do not include bootstrap confidence intervals.

The idea behind a bootstrap confidence interval is that we make use of the whole sample to represent the population. We take lots and lots of samples of the same size from the original sample. Obviously we need to sample with replacement, or the samples would all be identical. Then we use these repeated samples to get an idea of the distribution of the estimates of the population parameter. We chop the tails off at a given point, and we give the confidence interval. Voila!

Answers to the disadvantages (burn the straw man?)

There is a sound theoretical underpinning for bootstrap confidence intervals. A good place to start is a previous blog about George Cobb’s work. Either that or – “Trust me, I’m a Doctor!” (This would also include trusting far more knowledgeable people such as Chris Wild and Maxine Pfannkuch, and the team of statistical educators led by Joan Garfield.

We have to start somewhere. Bootstrap methods aren’t used at universities because of inertia. As an academic of twenty years I can say that there is NO PAY OFF for teaching new stuff. It takes up valuable research time and you don’t get promoted, and sometimes you even get made redundant. If students understand what confidence intervals are, and the concept of inference, then learning to use the traditional formulas is trivial. Eventually the universities will shift. I am aware that the University of Auckland now teaches the bootstrap approach.

There are ways to deal with the software package problem. There is a free software interface called “iNZight” that you can download. I believe Fathom also uses bootstrapping. There may be other software. Please let me know of any and I will add them to this post.

In Summary

Confidence intervals involve the concepts of variation, sampling and inference. They are a great way to teach these really important concepts, and to help students be critical of single value estimates. They can be taught informally, traditionally or using bootstrapping methods. Any of the approaches can lead to rote use of formula or algorithm and it is up to teachers to aim for understanding. I’m working on a set of videos around this topic. Watch this space.

This post is about the What and Why of Twitter. I’ll leave more about the “How” for another day. Thanks to Priscilla Allan (whom I met on Facebook) for asking!

Twitter

I love Twitter. Given the choice between Facebook and Twitter, there is no competition. I once read a really good summary (in 140 characters) of the difference, but can’t find it. The essence is that on Facebook we lose friends we’ve had for years, whereas on Twitter we make friends with people we have never met. Twitter was there for me in the year of earthquake aftershock hell that we in Christchurch lived through, and hope is over. When I was woken in the night with my pulse racing by the bed and room rocking, preceded by the sound of a large truck or train barrelling towards me, Twitter, and all my #eqnz friends were there. When I have something to say, I have an audience. No one may be listening – but sometimes they are. I have become a better and more connected educator as a result of Twitter. And often I just lol at the funny things people tweet.

I can imagine much what I just wrote made little sense to the uninitiated, so let me back up a little.

The fundamentals of Twitter explained

For the technical stuff and how Twitter came about, just Google it. Or look at the official Twitter site. To me Twitter is like a river of communication from myriad sources. Anyone can read what anyone writes, in 140 characters or fewer, which is called a Tweet. The river of tweets flows continually, and you use a web application or a phone application to gain access to it. You can use search terms like a fishing net to find any recent tweets about any subject of interest. When you find a person you think is interesting, you click on “Follow”, and from now on anything they tweet will be channelled off automatically to your Home tab. They will also be able to send you a direct message privately. I currently follow 830 people, which gives me a fairly large amount to read. I skim a lot, or just don’t care if I miss something. Often tweets include links, which fortunately are shortened automatically by Twitter so you don’t use up your 140 characters on a really long url. I always tweet when I upload a blogpost.

I started out following app developers, as we had just brought out our game app, Rogo, and had met several developers at an iOS barcamp. I also tried to build a customer base for Rogo using Twitter, but it wasn’t successful. I follow some celebrities until they annoy or bore me and then I stop following them. I like to follow QI because it is funny and informative. I follow Jon Winokur @AdviceToWriters because one day I would like to write something longer and more gripping than a statistics teaching blog. I follow several other bloggers, because what they say interests and inspires me. I follow several news sources because you get the news through Twitter faster than any other way. Sometimes I follow people because I want them to follow me – it often works. I follow some inspirational sites, but generally only for a short time, and then I get bored and unfollow them.

I’ve ended up with a rather eclectic mixture, though generally it has left-leaning tendencies as I unfollow people whom I vehemently disagree with.

Twitter is delightfully democratic. When I bought a Pilates DVD, I tweeted to @suzannecbowen to say how much I liked the stretching part. And she tweeted back. I love it when people tweet to me, particularly in praise of my blog, videos, app or website.

When I post my blog each week, I love seeing that people “Favorite” or “Retweet” my tweet. When they do this it gets the blog out for more people to see.

Twitter for educators

It is good to hear what other people are up to and connect and share ideas. Sometimes a certain time is set aside for a discussion, such as #mathchat. Someone posts a discussion question, and people respond. It can be a bit confusing as the comments pass each other, but also invigorating. Having to concentrate your thoughts into 140 characters is an interesting discipline.

Through #mathchat I found some teacher blogs, which are so inspiring. I’ve also had almost heated exchanges with maths teachers about the nature of statistics in relation to mathematics. Sometimes you can attend a conference vicariously through your twitter friends. I’m looking forward to hearing about the NCTM meeting in April, via the twitter stream.

One blogger describes Twitter as like being at a party with some of the great minds of education, and just listening in.

Here is a link to a handbook on Twitter for teachers:

This morning I asked the following into the ether:

Writing a blogpost about Twitter today. How do you feel about Twitter, as an educator? As a human being? #mathchat#edchat#orms#stem

I got the following responses: Gregory Taylor @mathtans As educator, get to see new perspectives along with some echos of my own struggles. Also see there is life outside teaching.

Also, for both, get to share my interests (and sense of humour) in hope that others find it useful. Teaches being concise too!

Mar 2 Billy Connelly on Route 66. Magic! Maybe one day M, J and I will travel Route 66 and I’ll blog, Mark take photos, Jonathan perform.

I will leave you with some of my favourite tweets from others:

Dan Rockwell @Leadershipfreak All meetings should move from complexity and confusion to simplicity and action. Leading meetings means finding simplicity.

Patrick Honner @MrHonner @RogoNic My question is: how can someone really understand statistics without understanding probability and set theory? I don’t see how.

Renee @approx_normal @RogoNic I love your blogs and videos. I show them in my stats classes all the time. And you have great things to retweet!

Stat Fact @StatFact ‘To understand God’s thoughts we must study statistics, for these are the measure of His purpose.’ — Florence Nightingale

Shaun Dakin @IsCool In one year, GUNS murdered 35 in Australia, 39 in England and Wales, 194 in Germany, 200 in Canada, and 9,484 in the United States.

Quaker Quotes @QuakerQuotes Success is the ability to go from one failure to another with no loss of enthusiasm – Churchill

Sesame Street @sesamestreet Oscar: First day of summer. For most that means it’s time to dust off the ol’ grill. Me? I prefer to leave the dust on!

For people who understand them, graphs tell a story. To the initiated, even a p-value, and some summary statistics can help to tell a story. Part of the role of a statistician is to extract the story from the data. The role of a statistics teacher is to enable students first to recognise that there is a story, then to enable them to tell the story through the tools of analysis and communication.

Time series data, especially seasonal time series data, yields its story abundantly. For this reason I changed my mind about the teaching of time series analysis at high school. I used to think that it was far too complex for high school students and should be left to higher education. In a way that is true, but if you stick to the basic concepts, it is a contextually rich area of study.

Time series data is full of little hazards, not the least being auto-correlation. We can use moving averages to take out the bumps and exponential smoothing to be more responsive to more recent data. We can deseasonalise and fit a trend line, predict and then put the seasonality back in. There are weighty (in more ways than one) volumes dedicated to time series analysis and the various discoveries and inventions that have helped us draw meaning from the past and forecast the future.

Because of the inherent complexity of time series analysis, I used to think that time series was not an appropriate part of the high school curriculum.

However, if a storytelling approach is used, backed up by appropriate software, then time series is a wonderful introduction to statistics. It is a good example of modelling, it has clear purpose, and the contexts can be fascinating.

Time series analysis is a clear example of the concept of a model, as there are so many different ways that it is possible to model a set of time series data. In contrast, when you teach linear regression with only one possible predictor variable, on data that is nicely behaved, there is generally one sensible model to use. This gives students the idea that you are trying to find “the right model”. This is not the case with time series, as models change, depending on how we choose to define the model.

Another selling-point for time series analysis is that its main function is forecasting. We all want to have crystal balls that can predict the future. The main reason we study a time series is to understand the patterns of data so that we can project into the future, usually for economic reasons. There is no question of “Why are we doing this, Miss?”, as the purpose of the analysis is self-evident.

There are numerous economic time series available from official statistics sites. In New Zealand I went to Infoshare and in the US there is Economagic. Some of the series are fascinating. (I like the three peaks per year in jewellery sales in the US – December, February and May.)

Analysis can be difficult, and Excel is hideous for time series graphing and deseasonalising. There has been a free front end for R set up, called iNZight, which enables straight-forward time series analysis. One drawback is that it only allows for one model, which I fear perpetuates the “there is one model” mindset.

But the opportunities for storytelling are there. You can talk about trend, seasonality, variation, the relative contribution of each. As teachers and students are exposed to more and more time series graphs, they are better able to tell stories. The graphs of the seasonal shape are rich with story-telling potential.

To support this we have made four videos about time series analysis, and an app, which is still in the pipeline. We hope that these will help develop the confidence of teachers and students to tell stories about time series data. We also have further quizzes and step-by-step guide to writing up a time series analysis. You can get much of this for free from our Free Resources page on StatsLC.com.

For teachers where there is limited access to computer resources, I have an earlier post with some ideas of how to overcome this problem and emphasise the story in time series data: Teaching Time Series with Limited Computer access.

Understanding Time series analysis

Time Series analysis using iNZight:

How to write up a time series report:

and an example of a time series report (aimed at Year 13 students in New Zealand, but a good general framework for report writing.)