teaching statistics is awesome

Stats and cats

Anna Fergusson (Martin) — Mon, 08 Apr 2024 22:28:08 +0000

Recently I had the pleasure of chatting with one of our amazing learning designers (Dr Evija Trofimova) at Waipapa Taumata Rau (University of Auckland) about the undergraduate data technologies course I teach (STATS 220). Evija then wrote a teaching story based on some of the pedagogical approaches I have been using in this course, which you can read here. Yes, it features cats (including a photo my very talented photographer sister took of me and my cat Elliot!)

Exploring YouTube auto-captions

Anna Fergusson (Martin) — Thu, 26 Aug 2021 01:54:25 +0000

Back in June, I gave a five minute talk as part of the opening session of USCOTS – the U.S. Conference On Teaching Statistics. We were warned to practice our talks to make sure we keep to our time limit, which made me wonder how many words I could actually fit into five-minute talk. Since recordings are made for my lectures, I had this idea to explore the number of words I use when teaching, by analysing my lecture recordings.

I was able to obtain automatically generated captions for each of my lectures via the YouTube data API and a R package called {tuber}. In total, for my STATS 100 (Concepts In Statistics) course this ended up being 33 lectures, or around 28 hours of “lecture talk”. However, I don’t actually talk for all of this time since I try to make the lectures as interactive as possible :-).

The captions generated provide the start time and end time as well as the words spoken, remembering that the captions are automatically generated using YouTube’s algorithm, so are not 100% accurate. Below are the first eight rows from the caption file for my first lecture of the semester.

The first eight lines of the caption file from my first lecture

You may have noticed me talking about “lockdown” and the use of positive words like “happy”, “amazing”, and “fantastic”. Noticing these features of my text planted the seed of an idea for me to explore my “teaching vocabulary” later in the exploration. Which is how EDA works right? As you look at the data and start to work with it, more questions for the data happen.

I was initially confused by the end_time variable/attribute. If you compare the end_time for one row with the start_time for the row below it, you can see often they overlap. After some searching on the internet, I discovered that this is because the timings provided are about when (start_time) and how long (end_time) the caption stays visible on the video when it is played, which was like, duh, that’s why they are called captions Anna! I had forgotten the data wasn’t created by a human actually transcribing my lectures so I could do this exploration, and this data was created for another purpose.

Take a look at lines 6, 7 and 8 – interesting right? At this point I went back and watched that part of my video to get to understand the data better, to watch the captions appear and match this behaviour back to my data.

A screenshot from the first lecture video, showing the captions from lines 2 and 3 (partial)

It then all made sense because, at most, two lines are shown for the captions. There’s this scrolling down movement, so the bottom line moves to the top when the next line appears on the bottom, which explains the overlap of end_time and start_time. (Go watch a YouTube video now and turn on the captions to see what I mean!)

But then I noticed the words also appear individually on each line when I say them but there’s no timing information for that in the subtitles data I had. So I went back to the YouTube Data API information and discovered that there are five different formats available for the captions. None of these formats contain information about when individual words are timed to appear, so I guess that’s just YouTube algorithmic “magic”.

Again, this is also what happens in data exploration, right? You need to constantly find and use contextual knowledge to make sense of your data, not that I even have “data” yet to visualise. For this kind of data – captions – I need to spend some time getting to know it in its current form so I can make a plan to create variables/attributes I can use for exploration. I can’t always define variables first before I’ve even spent time familiarising myself with the data source.

At this point, I had to spend some time thinking about what I wanted to find out, to guide how I try to extract meaning from the caption data. As I described earlier, even though my initial motivation was about how many words I could speak in five minutes, I also became interested in exploring what words I use when teaching e.g. Am I a “positive” teacher? Are there certain words I say a lot? Obviously, for the “speaking rate” focus I need to take into time, but for the “teaching vocab” focus maybe I don’t – oh wait, except if I want to track something like whether my teaching language changes as the course progresses, I will need the lecture number.

Thinking about what questions I have for the data made me think about how I will need to merge together the caption data from across all 33 lectures. I needed to add another variable that records what lecture number each caption belonged to, so I could compare between and across different lectures. I’ve done lots of merging like this before, so this kind of behaviour is almost automatic. Past experiences with exploring data help me to know what is possible for future explorations.

But before I scaled things up and merged all the caption files together as well as other data manipulations, I decided to try some things out first with just the first lecture – a small scale exploration. I actually honestly didn’t know where to start with my original “how many words can I say in five minutes” motivation! So, I thought about just counting how many words are in the caption file and how long the recording was to get started.

Here’s the last line in the caption file:

The last line of the caption file from my first lecture

Oh, that’s right! I definitely went over time those first few lectures “online”. Lectures are supposed to be 50 minutes long, and this one was around 64 minutes. Hmmm…. interesting, now I have another idea for what I could explore with this data! I could create a data set where each row is a different lecture, and one of the variables measures how long in minutes between my first and last word spoken, as a proxy for the length of my lecture.

But back to the word count. According to the caption file, I managed to get 59 710 words spoken in those 64 minutes! Wait, really? I immediately tried to make that number related to something personal to me – my PhD dissertation has a maximum of 100 000 words. So 59 710 words can’t possibly be right! Turns out it wasn’t – I was counting the number of characters/letters, not the number of words. When I corrected my code, the total number of words came out to be 12 073. Which then made me wonder about the length and complexity of the words I use when I lecture. When I compare 59 710 letters/characters to 12 073 words, seems like my words are on average just under 5 letters long but I know just using a summary statistic doesn’t reveal the distribution of word lengths.

I don’t mean to overstate the point, but it is a really important one. Exploring data, data exploration, exploratory data analysis – any combination of the words “explore” and “data” – is not just ask one question, get some data, make some plots, write something about the plots, in that order and that direction. Already in this post, you can see how I am shuttling between questions, data and context, using both statistical and computational thinking to support this journey of discovery. I hope you can also see that I am invested in this exploration because it’s about me and something that I’m really interested in. This helps to “keep me going” even though I haven’t visualised anything yet

Now let’s talk about data structures. A rectangular (or tidy) data set is one where each row is a different entity, each column is a different variable/attribute about that entity, each cell is the value for that entity for that variable/attribute. From the caption data, I can create different data sets not just one – it totally depends on what questions I am asking of the data. I’ll stick to the current focus on individual words and use the caption data to create a data set where each word is on its own row.

First 12 rows of a data set created based on individual words

In this form, I can develop more variables e.g. word length, sentiment of word – oh wait, we haven’t talked about sentiment analysis yet! I’ll come back to that later (Edit: Nope, not in this post!). How about a plot of word length?

Plot of length of words spoken

Yes, my eyes focused on features that summarise the data (median 4 letters, positive skew, middle 50% between 3 and 5 words), but I was also drawn to the 14 and 15 length words because it’s my data and what words were these? I filtered my data set to find words that were 14 letters or longer to produce the subset of the table below:

Longest words spoken in the first lecture

Yeah, those are totally words I would have said, particularly the combination of “troubleshooting” and “computationally” with respect to learning to code I wondered why I was talking about this only eight minutes into my lecture (my approach is to sneak in coding stuff not start the lecture with it) but then I remembered that we had issues using one of the apps I had developed so I was talking about ways to help students resolve these issues. It makes sense in this context to talk about these individual cases.

At this point in the exploration, I stopped and thought about where to next. I have the data I need to answer my initial question, How many words can I say in five minutes?, right? I “know” the lecture was 64 minutes and 8 seconds long. I “know” that I said 12 073 words during this time. Using some maths, that’s around 188.3 words per minute, or 941.3 words per five minutes.

But I’ve made some HUGE assumptions with what I said I “know” as the basis of this calculation. For example, I know that I didn’t speak continuously for all 64 minutes of this lecture, so can I find a way to find the “non speaking” times? Can the caption times help with this? I also know that the YouTube algorithm for creating the captions includes filler words such as “um” and “uh” – should these be included in my analysis? The captions may not have captured all the words I spoke, or may have added more. And of course, I have only explored one of my lectures – the first one of the semester – how I spoke in this lecture may be different from others.

And there’s just so much else I could explore. Can I find a way to measure how “interactive” my lectures are e.g. by using “silence” time? (assuming when I am not talking, students are doing something!). What are the key words I use the most often? Do I keep saying the same annoying phrase to my students? How positive are my lectures in terms of words I use? Do I tend to speak a similar rate of words per minute, or do I speed up and slow down sometimes e.g. at the end of the lecture when I realise I am running out of time?

I’ll finish this post by showing you the top two “words” I spoke during the first lecture, after removing stop words like “a”, “the”, “of”, etc., because they seem kind of appropriate for this post: Um ……. data!

Top two words used in the first lecture

Did you like this partial example of an exploration with data? I’m planning on sharing more! It’s a core part of my teaching preparation – to keep doing explorations myself, sharing these with students, and then supporting them to do their own. It’s really hard to teach something if you haven’t experienced it for yourself, and each time I work through a data exploration it reminds me just how many things you have to consider along the way. I didn’t end up including all the ideas I had for exploring this data but hopefully I provided enough to get you thinking. You can read more about my current focus on explorations in variations here.

Explorations in variation

Anna Fergusson (Martin) — Mon, 23 Aug 2021 02:48:29 +0000

Thinking about what it means to explore data, and how to teach students to explore data, has become a passion of mine ever since I started exploring teaching with a wider range of data. It started back in 2015¹, when I worked on rewriting a set of lectures for our very large introductory statistics course at the University of Auckland for a new chapter called “Exploring data”. A paper I wrote and presented at the International Conference on Teaching Statistics (ICOTS) describes one example from this data exploration chapter involving social media data. I tell the story of using Instagram data to learn more about people who visited the Eiffel Tower, motivated by my observation of people taking photographs at the Eiffel Tower and wondering how similar these photos were.

Important aspects to note about how we currently teach data exploration in our intro stats course is that we focus on: different sources of data, how features of data can be visualised and summarised, how other sources of data can be used and combined, how new variables can be developed and how questions – many many questions – drive the exploration of data. We do not provide a formal structure to the explorations for students, instead we provide examples of different explorations that start with questions like Are the olympics games a “game for all ages”? Importantly and crucially, we do not teach or assess sample-to-population ideas for this chapter. Instead, we focus on welcoming students into the exciting world of data, a world of creativity, discoveries and possibilities. We try to get the point across that our data exploration is guided and informed by a range of knowledge beyond the statistical, including personal, data-contextual, cultural, social, ethical, computational, and so much more.

Or at least, that is the vision! Of course, to teach these things is harder than writing them down in a paper or describing them in this post. But one thing that has helped so much in my teaching of data explorations is that I do, and continue to do, data explorations of my own. This helps me really experience and reflect on what it means to explore data, and also means I can share these experiences with students. Two paragraphs from the R for data science book written by Hadley Wickham and Garrett Grolemund nicely describe my feelings when I am exploring data, or doing exploratory data analysis (EDA):

EDA is not a formal process with a strict set of rules. More than anything, EDA is a state of mind. During the initial phases of EDA you should feel free to investigate every idea that occurs to you. Some of these ideas will pan out, and some will be dead ends. As your exploration continues, you will home in on a few particularly productive areas that you’ll eventually write up and communicate to others … EDA is fundamentally a creative process. And like most creative processes, the key to asking quality questions is to generate a large quantity of questions. It is difficult to ask revealing questions at the start of your analysis because you do not know what insights are contained in your dataset. On the other hand, each new question that you ask will expose you to a new aspect of your data and increase your chance of making a discovery. You can quickly drill down into the most interesting parts of your data—and develop a set of thought-provoking questions—if you follow up each question with a new question based on what you find.
Wickham, H., & Grolemund, G. (2016). R for data science: import, tidy, transform, visualize, and model data.

When I first read this chapter from the R for data science book back in 2018, I made the following tweet:

But now that I have been really teaching data exploration for the last few years, I don’t think high school statistics teachers – or any level statistics teachers – should be fearful! That’s not the same as me saying that teaching exploratory data analysis is easy. But I believe the inital fears that there will not be enough structure for students will be quickly resolved if teachers are really given an opportunity to explore data with their students and see the kinds of awesome ways of reasoning and thinking with data that are possible for all students with a careful balance of structure and variation (which is what statistics is all about!)

First, a point of clarification. At the moment in our senior school levels I believe we have an impoverished view of EDA. We have limited and confined it to merely something we do on the way to “making a call” based on box plots, constructing a confidence interval or conducting a randomisation test. That is, we are not using it for exploration in itself, but to “check” data as part of a focused investigation involving a pre-planned form of analysis. We should of course, always look at data when we “make calls” or similar. But let’s look at one sentence from the earlier paragraphs from Wickham and Grolemund again, “EDA is fundamentally a creative process.” Exploratory data analysis can be, and should, so much more. For example, think of what I could learn about you if I explored your last two years of work emails? Or your last month of credit card transactions? Exploration does not need to be constrained by important sample versus population ideas, we can learn so much from immersing ourselves in a rich multivariate data set and following the data to make new discoveries or to develop fuzzy ideas that could be made more clearer using appropriate modelling and visualisation approaches.

Second, teaching exploratory data analysis is hard! It’s all those three things combined – teaching how to explore, teaching about different forms of data, and teaching different ways to analyse these data – across and within multiple contexts that need integration and by using tools that support effective data visualisation. So you have to think carefully about what it is about doing EDA that you value the most for student learning. You can’t teach all the things within any one learning task or within one year/semester. For my teaching, I care the most about providing learning opportunities where students can follow their curiosity to learn from data through visualisations and interpretations within personally-relevant contexts. I want them to be both creative and skeptical about how and what they learn. I’m willing and prepared to put in the mahi to help my students learn these skills, by getting them to explore and write frequently and by reading their work and providing personal feedback.

Fortunately, there is some great work happening in the adjacent and complementary fields of data science education and data journalism. To pick just one example from many awesome people working in this area, Sara Stoudt and Deb Nolan have written a book called Communicating with data: The art of writing for data science. They also provided a wonderful poster and video for this year’s USCOTS conference with some practical ideas for teaching. Sara is giving a talk at the upcoming IASE satellite conference on Captions: The unsung heroes of data communiation which also promises to provide pedagogical approaches effective data visualisation.

I am also going to try to help by starting to share more examples of explorations in variation though this blog. These will be a mix of data explorations I have done just for fun and personal learning, data explorations I have used with students, explorations shared by other data scientists and other ideas for explorations I come across.

I’ve recently shared the approach of Exploring data landscapes, which is one very important part of how I teach and assess data exploration. Last weekend I presented a workshop where I took one of these data landscapes (food prices) and went through how this data landscape could be used to explore time series data, categorical data developed from visual assessments of photos, and recipe data (a mix of numeric and categorical variables) created by crowdsourcing data collecting within a class. Below is the recording from this workshop and here is a link to the slides I used for the talk.

¹ Actually, that’s not entirely true. I wrote this document about exploratory data analysis to support the New Zealand curriculum back in 2010, which much help and support from Maxine Pfannkuch and Pip Arnold. It’s just I didn’t get to teach EDA in serious way until I moved to the university level ?

An example of an experiment conducted online using the random redirect tool (allocate.monster)

Anna Fergusson (Martin) — Fri, 16 Jul 2021 02:35:11 +0000

A few years back, I created the first version of the random redirect tool that now lives at allocate.monster. I developed the tool to support New Zealand statistics students to conduct questionnaire-based experiments online, but soon started getting emails from Masters/PhD student and “actual” researchers about using the tool.

I think that’s pretty cool, and just shows how connected the statistics we teach in high school classrooms is to what happens in actual research practice. It’s also pretty cool how varied the contexts for the research are e.g. perceptions of animal faces, impact of screen size on purchasing behaviours, framing of information about the dairy industry, Google search personalisation, and more!

There are so many things you can experiment with using questionnaires, and with appropriate ethical guidance school-level students can get creative with their designs! Some ideas I’ve previously shared are in my post Ideas for using technology to design and carry out experiments online and Tracey Webster gave a presentation a few years back with examples of how she had used the random redirect tool.

My plan is to give my students brief summaries of these research studies that use allocate.monster, as well as a link to the actual research report for more details. I’ll then ask them to describe the key features of the experiment design and then to create their own simplified version of the “inspo” experiment (e.g. a simple randomised design with two independent groups).

I’m a huge fan of students being encouraged to adapt the design of an existing experiment, given the experiment is suitable for implementatation at the high school level. This approach means students first have to understand the context and rationale behind the original experiment, and then consider how they might adapt it to see if the “effect” can be found in a similar situation.

I’ll finish with a brief summary of one of the allocate.moster enabled research studies that particularly appealed to me! And yes, it features cats!

Do people rate photos of animals differently when tears are added digitally? Participants were randomly allocated to view photos of animal faces, with or without digitally added tears, and then asked to rate animals across different attributes (emotional intensity, aggressiveness and friendliness). The researchers found animals with tears were rated less agressive but more friendly.

Source: Picó, A., & Gadea, M. (2021). When animals cry: The effect of adding tears to animal expressions on human judgment. PloS one, 16(5), e0251083. DOI: https://doi.org/10.1371/journal.pone.0251083

Exploring data landscapes and so much more

Anna Fergusson (Martin) — Tue, 29 Jun 2021 01:14:44 +0000

It was awesome to be one of the opening speakers at this year’s US Conference on Teaching Statistics this morning at 4am – thank you again Allan Rossman and Kelly McConville for the invitation! Each of the speakers had five minutes to share something related to the conference theme of Broadening horizons, and all of the other speakers in this session gave amazing talks. You’ll soon be able to check out videos of their talks and slides here, which I would highly recommend. In the meantime, you can find my slides here: bit.ly/datalandscapes

[PS it’s not too late to register and attend USCOTS virtually. There are still a bunch of awesome talks and “breakout sessions” on offer over the rest of this week, including one co-presented by Chris Franklin and Pip Arnold on Friday at 5:15am!]

I also have to thank Allan Rossman for inviting me to write a guest post for his amazing Ask good questions blog last year. Writing the guest post Popularity contest encouraged me to share more about what I’ve been exporing in terms of my statistics and data science teaching, particularly in terms of how to design tasks that can engage and support a wide range of students to learn from data. This led to me co-writing a paper with Chris Wild titled On traversing the data landscape: Introducing APIs to data-science students.

For my five minute talk this morning, I focused on this idea of data landscapes and tasks that I’ve been using in my intro stats course (STATS 100) to allow students to customise their data learning opportunities for weekly investigations or explorations. The basic idea is that students choose at the start of the task what specific data landscape they want to travel through for their learning journey. I provide some structure to the task to guide their journey, and as well as submitting evidence that they reached the destination, they also write a travel review (i..e learning reflection) about their journey.

One of the examples I cover in the talk happens in the first week of STATS 100. We explore time series data, and both the tutorial and lab tasks for the week are based around food prices within New Zealand. The tutorial task asks students to choose a meal that they enjoy with family/friends, find a recipe for the meal, and then predict the cost to make this meal a year from now (among other things, like describing the trend/s etc.). The lab task that follows picks up the statistical ideas from the tutorial task, and guides students to learn about creating effective visualisations of time series data using the programming language R, again with the opportunity for students to select different food items to explore and compare how mean prices change over time.

When I wrote the guest post for Allan and the paper with Chris earlier this year, I would describe my data landscapes adventures as being on the domestic level – in that travel was limited to a few locations close to home. But this year, over the six months of teaching of STATS 100 during both summer school semester and semester one, I went full international-level travel – providing student data customisation for every single weekly tutorial and lab task. I definitely have the “travel bug” and now can’t imagine teaching any other way.

What I look for now in terms of data sources are opportunities for students to follow their own curiousity and for their personal interest and genuine not knowing something to be the motivation for their investigation or exploration. I do unashamedly make it all about them for these weekly tasks – What don’t YOU know? Why are YOU interesting in finding this out? How will YOU use this data source to explore or investigate an answer? What did YOU discover about the world through this task?

That’s not to say we don’t also explore important socially and culturally relevant/provocative data and modelling contexts as part of the course – I haven’t even discussed the weekly readings, the notes, interactive examples and three hours of “lectorials” that also make up the learning for each week! You’ll just have to wait to learn more about these other important aspects of the curriculum and assessment design – I plan on writing more posts about my adventures with teaching STATS 100 over the coming months.

In the meantime, if you’re teaching high school level stats and are interested to learn more about using APIs or other cool data science approaches, then please get in touch via my email: a.fergusson@auckland.ac.nz. I’m already working with a small group of teachers but there’s always room for more #undercoverdatascience

Stats with Cats (and other animals)!

Anna Fergusson (Martin) — Mon, 24 Aug 2020 00:48:41 +0000

I wrote a guest post earlier this week for Allan Rossman’s excellent blog Ask Good Questions. If you aren’t already subscribed to Allan’s blog you should be! He spent a year writing a new post every week, so there are so many very good advice and ideas for teaching statistics on his blog. Allan’s work with Beth Chance on teaching simulation-based inference has influenced a lot of what we teach in New Zealand, so you’ll also recognise some of the activities (and look for the shout out to New Zealand!)

The post I wrote features lots of photos of cats but also, and more importantly, gives you an idea of the kinds of activities I’ve been using to introduce statistics and data science students to coding. In the post, I also talk about the Popularity contest app that allows you to sample photos from Pixabay.

https://askgoodquestions.blog/2020/08/17/59-popularity-contest/

If cats aren’t your thing, then the guest post the week before features ladybugs and lizards and was written by none other than Christine Franklin – another amazing US-based statistics educator and researcher who fortunately loves visiting us here in New Zealand whenever she can!

https://askgoodquestions.blog/2020/08/10/58-lizards-and-ladybugs-illustrating-the-role-of-questioning/

And just by total coincidence, but if counting spots is your thing then check out this new app I developed called 101 dalmatians!

https://learning.statistics-is-awesome.org/dalmatians/

Catch a random sample of dogs and use the sample to estimate the mean number of spots per dog for the small population of 101 dalmatians. For a bit of context, you could re-watch some of the original version of the movie first!

While we’re talking about dogs, awesome educator Julia Crawford (Cognition Education) shared this video on the Stats teachers NZ Facebook page as a good discussion starter for experiments.

I don’t have a dog, so have tried out experiments with my cat Elliot. The video below was made for my students when we first went into lockdown (I also tried the cat and square challenge a few years ago). I’ve also added snippets from the videos the super awesome Dr Michelle Dalyrymple and Emma Lehrke sent me of their dogs more successfully engaging with the activity!

Of course, don’t forget you can also contribute to the It’s raining cats and dogs (hopefully) project, by making a data card about each of your dogs or cats. I’m going to create the next set of cards soon, and include a digital platform to work with the cards (similar to Stickland).

And, ????, how could we forget about emoji? Pip Arnold has been making and sharing a bunch of videos and resources for using CODAP with younger statistic students. Did you know you can use emoji in sampler plugin for CODAP? Just copy them from a web age and paste them into tool. When you use the emojis as values in formulae, just make sure to put “” quotes around them. You see emoji in action below, and check out how this all was set up in CODAP here.

https://codap.concord.org/app/static/dg/en/cert/index.html#shared=https%3A%2F%2Fcfm-shared.concord.org%2FVw54ngaPhEKWL6wyRSyP%2Ffile.json

For more modelling activities, this time using TinkerPlots, check out Anne Patel‘s presentation for the Auckland Mathematical Association. Her presentation covers a wide variety of important teaching ideas and resources, with lots of practical advice based on her nearly-finished PhD research. Sure, there’s nothing about cats or dogs but she does talk about Census At School, which doesn’t yet ask questions about dogs and cats but maybe could!

A small sample of ideas

Anna Fergusson (Martin) — Sat, 15 Aug 2020 09:16:57 +0000

While I continue to decide whether to quit Facebook, I’ve been trying to keep on top of my admin responsibilities for the Stats Teachers NZ Facebook group while keeping an eye on any stats-related posts on the NZ Maths teachers Facebook group. Since not everyone is on Facebook, I thought I’d do a quick post sharing some of the ideas for teaching stats I’ve recently shared within these groups.

How is the bootstrap confidence interval calculated?

The method of bootstrap confidence interval construction we use at high school level in NZ is to take the central 95 percentile of the bootstrap distribution (the 1000 re-sampled means/medians/proportions/differences etc). There are other bootstrap methods (but we don’t cover these at high school level) and because of the approach we use you can get non-symmetrical confidence intervals.

Here are a couple of videos featuring Chris Wild talking about bootstrap confidence intervals:

You can read more about the research project and development of VIT that informed the implementation of simulation-based inference for NZ statistics at the school level here: http://www.tlri.org.nz/sites/default/files/projects/9295_summary%20report_0.pdf

A quick but helpful article with more background about norm-based confidence intervals and bootstrap confidence intervals in terms of teaching: https://new.censusatschool.org.nz/wp-content/uploads/2012/08/Confidence-intervals-what-matters-most.pdf

A recent article by Mark Hooper for the SDSE (Statistics and Data Science Educator) provides an activity for introducing bootstrapping: https://sdse.online/posts/SDSE19-004/

Does shape of the bootstrap distribution tell us anything about whether some values in a confidence interval are more likely to be the true value of the parameter?

All the values in the confidence interval are plausible in terms of the population parameter (well, except for the case of impossible values e.g. a negative value when estimating the mean length of a piece of string, or 0% when estimating a population proportion when your sample proportion was not 0%!). As an extra note, we often see skewness in the bootstrap distribution when using small samples whose distributions are skewed (since we resample from the original sample). Small samples are not that great at getting a feel for what the shape of the underlying/population distribution is.

Is a bootstrap confidence interval a 95% confidence interval?

Sample size is a key consideration here With large sample sizes, the bootstrap method does “work” about 95% of the time, hence giving us 95% confidence. But, just like norm-based methods (e.g. using 1.96 x se), with small samples our confidence level will not be as high using the “central 95% percentile” approach.

Can students use both the CL5 and CL6 rules when making a call? Why can’t the CL5 rule be used with sample sizes bigger than 40?

The rules are designed to scaffold student understanding of what needs to be taken into account when comparing samples to make inferences about populations. Once students learn and can use a higher level rule, they should use this rule by itself. The two rules use different features of the sample distributions and do not give the same “results”. If you use both at the same time, you are encouraging an approach where you select the method that will give you the result you want!

In terms of whether the rule “works” we have to consider not just the cases of “making a call” when we should, but also the cases of “not making a call” when we should. Yes, the CL5 “works” when applied to data from bigger samples than 40, in terms of “evidence of a true difference”. The problem is that for larger sample sizes, when using the CL5 rule, you become much more likely to think the data provides “no evidence of a true difference” when really it does. In this respect, the rule does not “get better” as you increase sample size It’s too stringent, which is why we move to higher curriculum level “rules” or approaches, ones where we learn to take sample size (among other things) into account.

If a sample size is larger, does that mean it is more representative of the population?

Let’s say you have access to 5000 people who voted for the national party in the last election and ask them whether they support Judith Collins as the next PM, and obtain a sample proportion. If you used this sample proportion to construct a confidence interval, it would have small margin of error (narrow interval, high precision), BUT the confidence interval would probably “miss the target” if you were wanting to infer about the proportion of all NZers who support Judith Collins as the next PM because of high bias/inaccuracy

It is important to know the “target” population for the inference you want to make using the same, and check if the sample you are using was taken from this population. In terms of teaching sample to population inference, we need to use a random sample from this population. Our inference methods only model sampling error (how do random samples from populations behave) not nonsampling error (everything else that can go wrong, including the method used to select the sample). If we can’t use a random sample (which in practical terms is pretty difficult to obtain when your sampling frame is not a supplied dataset), then we need to consider how the sample was obtained and also be prepared to assume/indicate even more uncertainty for our inference, in addition to what we are modelling based on sampling variation

Watch out for a common student misconception that larger populations require larger samples. The population size is not important or relevant (unless you want to get into finite population corrections), it’s the size of the sample that is important in terms of quantifying sampling error. Hence why it was a question in my first stage stats test a couple of weeks ago!

A tool I developed that is handy for exploring confidence intervals for single proportions and the impact of sample size and the value of the sample proportion can be found here: https://learning.statistics-is-awesome.org/threethings/

How can you find good articles for 2.11 Evaluate a statistically based report?

I’ve written a little bit about finding and adapting statistical reports here. To summarise, I find newspaper articles are often not substantial enough, since 2.11 requires the report to be based on a survey and students need to be given enough info about how the survey-based study was carried out to be able to critique it. Often the executive summary from a national NZ-based survey works better (with some trimming, adaption). I like NZ on Air based surveys, as this recent one looks do-able with some adaption: Children’s Media Use Survey 2020 – it even mentions TikTok!

Can you create a links to iNZight Lite and VIT online with data pre-loaded?

Yes – I made a video about setting up data links to iNZight Lite here:

If you want to use the Time Series module with your data, just chance the “land=visualize” part of the URL to “land=timeSeries”.

What should a student do if they get negative forecasts from their time series model, when the variable being modelled can’t take on negative values?

You want the student to go back and take a look at the data! And then the model. And ask themselves – what’s gone wrong here? Is it how I’m modelling the trend? Or is it how I’m modelling the seasonality? Or both? Is the trend even “nicely behaved” enough to model? Same with the seasonality

Often the data shows why the model fitted will not do a good job, even before looking at the forecasts generated. We should be encouraging students to look at the data that was used to build the model, particularly for time series when we are focusing on modelling trend and seasonality. Students should be encouraged to ask – why is the model generating negative values for the forecast? How did it learn to do this from the data I used? Can I develop a better model?

Go big or go home!

Anna Fergusson (Martin) — Thu, 02 Jul 2020 23:33:30 +0000

On Tuesday, my good friend Dr Michelle Dalrymple won this year’s Prime Minister’s Science Teacher award. It was so great to be able to fly down to Christchurch with Maxine Pfannkuch to watch the live streaming of the award ceremony with Michelle, her family and her colleagues at Cashmere High School. Michelle was the first mathematics and statistics teachers to win the prize, and it couldn’t have gone to a more deserving teacher!

You can read more about the awesomeness of Michelle in the links below:

In her acceptance speech, Michelle thanked me for being her “statistics hero”. Well, turns out she’s also mine and here’s just one example of why!

After a year or so after I moved from teaching high school statistics to teaching a very large introductory statistics course, I had conversation with Michelle where I complained about how much I missed doing the kinds of hands-on interactive activities that are so important for teaching statistics. I told her what I was being told by others at the university level: that you just can’t do those kinds of things with large lectures, there’s too many students, it won’t work, things could go wrong, not all the students will want to do this, etc.

Michelle listened to me first and then suggested that I try doing something small initially. She told me about one of her activities – comparing how long it takes to eat M&Ms using a plastic fork vs chopsticks – and suggested doing this with just 10 of my 500 students. She explained that I could ask for volunteers, bring them down to the front of the lecture theatre, record the data live, and then use this within the same lecture. I tried this activity out and it worked brilliantly – just imagine a whole lecture theatre of students cheering on students eating M&M’s!

In her pragmatic way, Michelle helped me remember that there’s always a way to do what you know is best for teaching and learning. Her encouragement and attitude to “make it happen” inspired the first of many interactive activities I have since developed to use in my teaching of intro stats. It’s natural to focus on the limitations that a teaching environment or system presents, especially for very large introductory statistics classes of over 300 students. But what Michelle helped me re-affirm in terms of my teaching approach for “large scale teaching” is that it can be more helpful and rewarding to think of the opportunities that working with such a large group of students offers.

Which is one of the reasons why we (Rhys Jones, Emma Lehrke and I) have set up a new sub blog that focuses specifically on teaching large introductory statistics courses. It’s called “Go big or go home!“. In this blog we will share our experiences with trying to build more interactivity and engagement within our very large lecture-based classes. I know that many people reading this blog are statistics teachers based at the school level, so I haven’t assumed you will want to receive emails about new posts for this sub blog. Check out the Go big or go home! blog if you’re interested in reading more and subscribing to this new blog.

Um ….. here’s a new tool for exploring probability distributions!

Anna Fergusson (Martin) — Sat, 13 Jun 2020 01:01:56 +0000

Actually, it’s not a new tool exactly, more a re-working of the existing modelling tool I’ve already shared on this blog, but with a new name and web location – the probability distribution explorer!

I developed the probability distribution explorer as part of my Masters research into teaching probability distribution modelling. The proposed teaching framework and the tool were developed in response to use of data for distribution modelling for AS91586, in particular the need for students to demonstrate use of methods related to the distribution of true probabilities versus distribution of model estimates of probabilities versus distribution of experimental estimates of probabilities.

The tool was developed primarily to support comparisons of the “distribution of experimental estimates of probabilities” and “distribution of model estimates of probabilities”. When reviewing research literature, I found limited examples of how to teach this comparison using an informal approach i.e. not using a Chi-square goodness-of-fit test. Consequently, I also found a lack of statistically sound criteria to enable drawing of conclusions in such resources as textbooks, workbooks and assessment exemplars.

This led to my research, which involved a small group of New Zealand high school statistics teachers. Focusing on the Poisson distribution, the criteria used by ten Grade 12 teachers for informally testing the fit of a probability distribution model was investigated. I found that criteria currently used by the teachers were unreliable as they could not correctly assess model fit, in particular, sample size was not taken into account.

After exploring the goodness-of-fit using my visual inference tool, teachers reported a deeper understanding of model fit. In particular, that the tool had allowed them to take into account sample size when testing the fit of the probability distribution model through the visualisation of expected distributional shape variation. I’ve re-developed the tool this year to support NZQA as they explore opportunities for assessment within a digital environment. A team of teachers are developing prototype assessment activities for AS91586 and these will be trialled with students in schools later in the year.

The video below gives a general introduction to the tool, using data on how many times I say “um” when I’m teaching. The video itself provides another source of data because, um … well, you’ll see if you watch!

More videos, teaching notes and related resources can be found here: stat.auckland.ac.nz/~fergusson/prob_dist_explorer/teachers/

mathstatic site issues

Anna Fergusson (Martin) — Wed, 13 Nov 2019 03:19:53 +0000

Just a quick post to let you know that the mathstatic.co.nz site is hopefully only temporarily down, and I am working with my hosting company to get it back online ASAP. This affects the random redirect tool, the BYOP sampler tool and the experiment lab page, which will not be available until this gets sorted. I’ll update this post soon with a progress update!

UPDATE ONE

It seems the issue is that some overseas dodgy folk have been using the random redirect tool for fraudalent things like phishing scams. So, I’m going to restrict the URLs that can be used – which means analysis time to identify which sites/URL patterns to accept e.g. Google forms, survey monkey etc.

UPDATE TWO

mathstatic.co.nz is back up and running! It probably was a couple of hours ago, but I have been rewriting the code that processes the random redirect requests. Below are the main changes to the random redirect tool to better prevent issues in the future.

Due to abuse of this tool by dodgy folk, only links with domains on the approved list will now be accepted! Please complete this form to request a domain to be added to the approved list, but don’t expect any new additions to happen any time soon (this is a free tool remember and was created for simple classroom-based randomised experiments with Google forms).

Any random redirect URLs created using this tool can be disabled at any time. If this has happened to you and you are a legitimate teacher, educator or researcher, then send me an email and I might be able to help you.

UPDATE THREE

After emailing me this morning to say everything was sorted with mathstatic.co.nz, my webhosting company then decided to set my site to “maintenance” mode this afternoon and remove some crucial code used to redirect the URLs to the right locations on my website I’m trying to get things rest back to what they were now.

UPDATE FOUR

Well, I had been meaning to retire the old mathstatic.co.nz website anyway! I’m not sure when mathstatic will be online again, so:

You can now set up random redirects here: https://allocate.monster (this will be the place to do this from now on!)
You can access the BYOP sampler here: https://statistics-is-awesome.org/BYOP (this will be the place to do this from now on!)
At some point soon I’ll move the experiment lab to https://learning.statistics-is-awesome.org but not today, so for now it is unavailable.

I think that’s everything. If there is something else not working, then please let me know!