This post is about a prediction model I’m working on that aims to predict the gender of a teacher based on a few questions.
On World Statistics Day, I posted about a survey for high school teachers. Thanks to all the teachers who completed the survey! There is not quite enough data to share yet but I have used the data to make a start on a prediction model.
The plan is that, after asking you some questions, I can use your answers to predict your gender. Share with your teacher friends and let them be impressed with statistics too!
Once I get more data, I will “share the secrets” including the data so you can use this context for some exploratory data analysis of your own with your students.
This post features a quiz asking you to predict the gender of someone based on what they write about themselves on a dating website.
How does the quiz work?
You will be given 20 different “About me” descriptions taken from public profiles displayed on a dating website. For each description, you will need to select whether you think this description is from a male or female profile. Note: Your choices will be recorded and this data may be used for a future post. No personal information about you will be recorded.
Time limit: 0
0 of 20 questions completed
For this quiz, you will be given 20 different “About me” descriptions taken from public profiles displayed on a dating website. For each description, you will need to select whether you think this description is from a male or female profile.
You have already completed the quiz before. Hence you can not start it again.
Quiz is loading...
You must sign in or sign up to start the quiz.
You have to finish following quiz, to start this quiz:
0 of 20 questions answered correctly
Time has elapsed
You have reached 0 of 0 points, (0)
Question 1 of 20
I am loyal, trustworthy, honest laid back, logical, clean, dd free, can iron, clean and cook. Even tempered confident, long suffering and spontaneous. Motivated kind and an old fashioned romantic.
Question 2 of 20
Im one of a kind, quiet but understanding. like cats and guitars and 70-80s music. Easy on the eye.
Question 3 of 20
I am a fun, outgoing kinda person who loves to travel and am passionate about youth work in New Zealand.
Question 4 of 20
Im fun, outgoing, love good times love to travel and party. especially love the Irish and Americans, they have neat acsents.
Question 5 of 20
Fun loving and playful looking for like minded no strings attached if your keen leave a message and we can have some fun.
Question 6 of 20
I am an easy going person who enjoys the simple things in life like good company, sunshine, nature, nice food and fun exercise. To relax I like to go on walks and to have outdoor adventures.
Question 7 of 20
Hi I down to earth person. I love cracking jokes and such. Love sight seeing and going to the cinemas. I really don’t like smoking just don’t like the it. But I love drinking a bit.
Question 8 of 20
Im loyal trustworthy and kind. I will do anything I can to make you laugh and smile and cheer you up when you are down.
Question 9 of 20
Hi, I’m interested to meet people willing to share ideas and experiences with. I like free and frank communication.
Question 10 of 20
My friends would say I’m loyal, fun and a bit quirky… .I like food, sports, walking, pets, music and much more!!
Question 11 of 20
I am a honest caring trusting loving person I enjoy making people happy and laugh helping people support my friends
Question 12 of 20
I’m kind and sincere person. I try to see positive moments in each situation. My motto is “everything that doesn’t kill us, makes us stronger!”. Sometimes I am inquisitive like a child.
Question 13 of 20
I am honest, loyal, faithful, trusting, kind, caring, loving and fun, with compassionate and generous nature. Some time I have a wild adventurous side of me, I have a wonderful sense of humor.
Question 14 of 20
I like going to the beach ,pools ,bowling , I’ve skating and going to the movies in my spare time . I like reading manga and watching anime and movies and am into watching TV…
Question 15 of 20
I am a calm, honest and open minded person. I like to be happy and make others happy as much as i can.
Question 16 of 20
Kind, caring, animal lover who loves to travel and explore new things. Not keen on pubs , prefer a nice barbeque instead. Intermittently reading when time allows… .
Question 17 of 20
I’m a sociable, outgoing, fun loving and easy going person, looking for someone to share adventures with. It’s sometimes difficult to meet new people and I thought this would be a great way to find…
Question 18 of 20
Im fun type of person love movies, sports, and want to try new things. im honest, kind and respectful, hope I will find my match here.
Question 19 of 20
Oh I hate these things… I’m outgoing… Energetic. Into fitness… Healthy eating… Love to get out kayaking or out walking/running my dogs… Want to try new things in life… Love…
Question 20 of 20
Like to make new freinds pretty easy going chill with freinds and family very understanding have a warm heart like to help out.
I haven’t heard anything from anyone with any problems, and there seems to be a bit of traffic to the challenge page, so hopefully this is going well. I’ll allow checking of the first list of reserved words tomorrow. Students should put in what they predict the readability score will be for each word. These predicted scores will be checked against the actual readability scores and students will be given an overall result e.g. 85%. Oh, and just because you’re a teacher too you’ll get this idea for an investigative question/problem……. How long does it take a student to submit a swear word into a text analysis tool?
Related “reading themed” statistical investigation ideas
Check out http://josephrocca.com/randomsentence/ where you can generate “random” sentences from books that are no longer under U.S.A. copyright restrictions – so books generally published before the early 20th century. You could compare the process for random sampling sentences from digital books to processes for random sampling sentences from physical books (so much here with different sampling methods). You could give students an actual physical book and challenge them to estimate the total word count (check using the digital version!), or get students to devise a way to compare the “readability” of two books, or….?
So what was so surprising?
Recap: I got 10 dominoes from a supermarket recently and was surprised to find that all 10 were different (there are 50 different dominoes to collect). Ok, so on the face of it this may look like a familiar (and not super awesome) starter. Collecting cereal cards, ice block sticks, seed packets…….. But I was surprised to see this because I was thinking that a random process like this would mean I should expect to see at least one double up e.g. like seeing runs of heads when you flip a coin. When I thought about it more, I realised I wasn’t taking into account there were 50 dominoes – this makes a difference.
More about SOLO
SOLO stands for the Structure of the Observed Learning Outcomes. It’s a model/taxonomy for defining different levels of understanding or thinking and was developed by J. Biggs and K. Collis in 1982. I’ve been using SOLO in my teaching of statistics since around 2006 and think it’s awesome. It fits so well with building conceptual understandings of statistics rather than just procedural ones. I use SOLO in (at least) two ways: (1) to structure good questions for students to use when working with data, questions to make them think at different levels and (2) to plan my teaching of a topic e.g. what are the key ideas (not skills)?
The prices increased from Jan to Feb and then decreased from Mar to May and then increased again…..
I think I like this answer on Quora re how to explain over-fitting of models. Some of the language is a bit off – I think if you swap the word “hypothesis” for “model” and remove “experiments” and replace with “observations” it reads better. But I like the idea of how to explain to students that a model is not about getting a perfect fit to the observed data and that simpler can be better (e.g. go for the minimum number of trend lines as possible that tell the general story of what is happening……).