# Central limit theorem | Inferential statistics | Probability and Statistics | Khan Academy

2597 ratings | 1220346 views
Html code for embedding videos on your blog
you guys are shaping history. thank you.
임상일 (2 months ago)
Your video is easy to understand. Thank you^^
Atharva #breakthrough (3 months ago)
but i have heard that once you get a 2.75 it is impossible to get exactly the same value again. then how can we get a bell curve?
John Landon Miller (3 months ago)
Too much droning goddamit
First explanation, that I saw, that gets the dimensions of the variables right. Interesting though is, that the CLT is differently defined in Japan.... In Japan every Sample S_i must have a mean of u.
kinsa khan (4 months ago)
Where can I get the proof to this theorem ???
Roger Syversen (6 months ago)
he is sampling by Benford's law
An insurance company has 25000 customers if the annual claim of the company is a random variable with mean 320 and standard deviation 540. approximate the probability that the total annual claim of the company exceeds 8.3million naira
Jin Tao (7 months ago)
I don't think that if the sample goes infinity,then the mean will be very close to 2.75 as he assumed in the video. I think it will be 3.3333…for 1*2/6+3*1/6+4*1/6+6*2/6=10/3.Anyone agrees with me?
Thank you for this explanation. I would like to ask you a question. If we got a distribution (probably heavy tailed) and take many samples with different size (for example the first sample will have 30 samples, the second 40, the third 34, the fourth 51 and so on) will then the distribution of those samples be normal?
HeyImRod (8 months ago)
Fantastic!
Firas Kais (8 months ago)
Did you mean to right the symbol of the mean instead of variance in the center of graph toward the beginning?
La Ri (8 months ago)
I nearly cried because there's no single thing that I understand about central limit theorem.But you sir made it so easy for me.Thank you 😭
trainee2018 (9 months ago)
that's super cool
5555555555555555555555555
John Mare (11 months ago)
what is p0hat? my professor keeps using it and i havent seen it in the other videos either, please, anyone explain this
Suzanna Reinhardt (10 months ago)
It's the empirical probability. So basically it's how many successes you have in your experiment over your total number of attempts
Just Mimi (11 months ago)
cramming for my stats exam tomorrow
Joe Knight (11 months ago)
what a good looking 4
UNH Psychology (1 year ago)
This is painful
hovenmoz (1 year ago)
what if your crazy original distribution was not symmetric. Would your repeated sampling averages still result in a normal distribution?
Laubotje (1 year ago)
Centril li.imit theory is big If tru.......
William JSS (1 year ago)
Jeez, thanks for driving it home! You need to get with a publisher and go wide, you explain in the most basic, and common fundamental way for easy learning.
Kristian R. Brasel (1 year ago)
if your sample size was infinite wouldn't you get the same mean every time?
Pharaoh on LFS (1 year ago)
I just felt like this central limit theorem thing is super important to learn along with standard deviation... and the 80-20 rule :) I was thinking, is this whole "normal distribution" thing really as simple as this? - The data keeps coming from the same "odds" or the same source, aka the original sample you drew. Because the odds haven't changed, and you're originating the data from the same set that hasn't been modified half way through the experiment, isn't it sort of expected that you should end up with a normal distribution? Like when you play the same arcade game, or you're at the circus, no matter who plays and for how many times, the odds are the same? Just seems like a complicated way of saying "the odds haven't changed", so "on average" you'll end up with "X" at a certain rate or whatever. Unless I'm missing something lol
Gihan Panditha (1 year ago)
cool
TheImaxify (1 year ago)
You never mentioned that n (sample size) should go to infinity before this happens! please correct it.
kushagra deep (1 year ago)
thanks for your existence i have a question if the sample size is six, but instead of numbers we have categorical variables. E.g, S¹=[pc, laptop, pc, mobile, mobile, mobile]. now we can't take the mean. how can central limit theorem help us now?
Uranus (1 year ago)
Fantastic explanation. A shame that most teachers are not educated enough to be able to understand and explain things like this to their students.
Victor Serra (1 year ago)
He gets 5 samples and then averages them. What is the size of the original samples? 5 as well?
MD Abir Choudhury (1 year ago)
Thanks for the explanation!
Jonathan Cavazos (1 year ago)
Show me the Data!
AbsorbIt (1 year ago)
Is that a Greek 4? lol
centreal limit theorem for dummies huh? I sure am a dummy so no problem with that for sure.
Sarnen (1 year ago)
This is a boring, rambling, over-explanation that never seemed to get to the point until the last minute. I was better off google'ing the definition than listening to his longwinded anecdotes. GET. TO. THE. POINT. I'm not here for a comedian or to listen to a verbal flood. The reason people are here is because they're having a hard time learning with their peers. Meaning they need the bottom line, faster, with less distractions from concepts, anecdotes, and words that aren't directly and immediately related to the topic.
NaNa Naomi (1 year ago)
Hi do you do tutoring offline?
Errin Bass (1 year ago)
Great video!
Megan Taylor (1 year ago)
How would you calculate the mean and standard of the sample distribution that would result from take n number of samples from a distribution?
GirlGirlicious (1 year ago)
You, sir, are The Real MVP!
Justine Tenney (1 year ago)
I swear to god he keeps saying meme and not mean. The internet is ruining me XD
Ramachandra jr (1 year ago)
Does that mean normally distributed data is a characteristic of not the population distribution but the act of drawing a simple random sample itself?
Junisha Kaul (1 year ago)
The man is so not right about the concept. For the central limit theory to apply n >30. He constructed a normally distributed curve using a sample size of 4, i.e n=4. Are the other viewers delusional ?? I think the guy might have been high on acid.
Junisha Kaul (1 year ago)
Man there's an error. You cant have more than 256 samples. Do the math.
Ramachandra jr (1 year ago)
@Junisha Sampling is different from possible combinations. We draw a sample here, that means we are going to roll a die and note down the result. The same way we note down the result of 10000 such samples. Sal here is assuming that he'd get 4 with some blah blah probability and 5 with some... probability(just assumptions).
Junisha Kaul (1 year ago)
If x = 1,3,4 or 6 and the sample size is 4, there would be 4*4*4*4 possibilities i.e. 4^4 possibilities =a maximum of 256 possible outcomes so by taking 10,000 samples you will be repeating each 1 about 40 times.
Ramachandra jr (1 year ago)
And why is that?
Roberto Martini (1 year ago)
Cool
i6mi6 (1 year ago)
Drinking game: drink whenever you hear the word "sample"
Skye Paul (3 months ago)
I'm trying to study here shhh
Sam Dave Pollard (3 months ago)
Are you intoxicating that I'm insinuated?
nelly aviles (5 months ago)
lets just kill braincells before the exam..
Roger Syversen (6 months ago)
no
Jonathan Lunenfeld (10 months ago)
i6mi6 I might need to play that game to get over my probability score... D;
Mark Rudis (1 year ago)
in one word, gazilion!
Sera Chrysanthemum (1 year ago)
Heck yeah, this is a great motivating video... gives an outline of the idea and why it's so cool and important!
Obinna Daniel (2 years ago)
I mean this is brilliant! Got me thinking and understanding deeply.
Henna George (2 years ago)
If x = 1,3,4 or 6 and the sample size is 4, there would be 4*4*4*4 possibilities i.e. 4^4 possibilities =a maximum of 256 possible outcomes so by taking 10,000 samples you will be repeating each 1 about 40 times.
brenda pham (2 years ago)
how do you denote the number of times you have samples? example: so you say sample size n=4, but 100 of those samples, how do we denote that?
notthatbl (2 years ago)
He says that with an infinite sample size, it approaches a normal distribution, but shouldn't it just be a straight line at the mean? (albeit this is a normal distribution with a standard deviation of 0)
Steve Buscemi (8 months ago)
notthatbl what he means is not the size of each sample (if you had one sample of an infinite size, yeah, it'd be one precise point), but if you had an infinite number of samples (infinite samples with only 4 in each sample)
Kristian R. Brasel (1 year ago)
I had the same question, I think what he is trying to say is that as the sample size grows it gets closer to a normal distribution. However there must be a point of diminishing return, because once the sample size reaches the number of events you will not get an approximation but the actual mean, which would be the same every time. Another interpretation could be if each sample was of increasing sample size until it reached the number of events. That would likely give you a normal distribution.
Junisha Kaul (1 year ago)
If the SD is 0 there wouldn't be a normal distribution.

Simple is beautiful
Rachita Dehury (2 years ago)
Thank u so very much u helped me a lot
Koen Vos (2 years ago)
Best explanation out there. Thanks, Sal!
craft with NIKKI (6 months ago)
It's only explanation.... I need it's proof, but explanation is well defined.
Love Live Life (2 years ago)
THANK YOU!!!
Quazi Nuzhat (2 years ago)
you nailed it. thanx a lot
esmeralda4884 (2 years ago)
You explain this better than a textbook. You are a great!
Vishnu Vardhan (2 years ago)
Matthew wroblewski (2 years ago)
Very, very cool stuff. Also, you're obviously a smart guy, Sal. But at the same time, you're incredibly accommodating to us students. Thank you for your sincerity and empathy, sir.
baldy hardnut (3 months ago)
smart guy is an understatement lol
I love statistics!
Shareef (2 years ago)
damn this is actually really cool
James (2 years ago)
Yeah fuck this I'm going to bed
Q Q (10 months ago)
If its a sign wouldn't that mean it is the comment you needed to see? Would imply you should work your ass off so you're not cramming last day...
Renee Knezovich (11 months ago)
This is not the comment I needed to see at 11:14pm with my exam in two days (not tomorrow, but the day after) hahahaha XD Perhaps its a sign lol
Don The Best (2 years ago)
What happens to the standart deviation if we work on means of sample instead of means of one large sample ? Someooone? :)
Dont you want to make a video about Dunnett test, Bonnferroni-Holm es correction etc.?
Asther Phoenix (2 years ago)
check out their channel . they have it
you are amazing , thank you , not only for this video , but for all your videos that i have been using for 3 years :)
Tilak Chand Dhital (2 years ago)
U r helping me a lot :D
Megan Maloney (2 years ago)
These videos have been tremendously helpful! Thank you SO MUCH for making them! The concepts make so much more sense when I can see them being worked out.
Khamis Buol (3 years ago)
;)
Kieran Sibley (3 years ago)
trollololololol
cherrychapstickgurl (3 years ago)
this helped me with psychology! thank u sal
Eugene deschamps (1 year ago)
yha there is is how crazy do you think you patient is and if he going to kill anybody there is a general equation is to start guessing the medication that you think that fits the med book for that diagnoses then again guess the doses you gonna cram down his esophagus the more the better because the more patients you overdose is better for the bank book at the end
cherrychapstickgurl (2 years ago)
+General Dash that's right! you need math in psychology too
General Dash (2 years ago)
+cherrychapstickgurl wut?lol
Joe Dunbar (3 years ago)
khan is awesome ! Im in this course that could not explain this well. I need to know the principle and Khan blew it out of the water ! I know the principle and the APPLICATION ! sweet
Oisin Hughes (3 years ago)
thanks very much for putting this together  - your a role model in helping folks understand the world around them.
Sera Chrysanthemum (1 year ago)
There are many people like Sal on YouTube, though he's one of the OG's, which is something that's friggin awesome about this world!
atiger92 (3 years ago)
At 7:24 the narrator says that a sample size of 4 and a sample size of 20 would have the same mean.  That is not correct!  They might be close, but it highly unlikely that the mean of any sample sizes would have exactly the same mean.  Khan Academy...you posted something misleading.  I'd suggest vetting the narrators more closely and make sure they don't tarnish your reputation.
Bobby Bullock (2 years ago)
+atiger92 I think you misunderstood the comment - he meant that the means of the sampling distributions are not a function of the number of observations in a sample with the same probability distribution (i.e., equal chances of rolling a 1, 3, 4, 6).  In statistics it is very common to refer to the idea of an "infinite sample" to make points and in that case, taking an infinite number of observations from a sample size of 4 and a sample size of 20 would indeed produce the exact same mean (again given the same chances as before).  Of course he understands that if he rolled his dice 4 times and took the mean of that, then rolled the dice 20 times and took the mean of those rolls, that they would not likely be equal.  However, if he took the mean of an infinite number of 4-rolls, it would be equal to the mean of an infinite number of 20-rolls.
Amol Buch (3 years ago)
You are just awesome..!
TheSnow720 (3 years ago)
I watched this video and didn't understand a thing at all. Then I went to class and read my textbook and it all became crystal clear!
ronak shah (3 years ago)
Great explanation!!
great vid!
latesq1 (3 years ago)
What does the central limit theorem tell us about samples from a bi-modal distribution ?
tiivc (3 years ago)
The whole point of the central limit theorem is that it applies to all distributions at all times as long as they have a well-defined mean and variance.
TheSevenofMine (3 years ago)
I like it that it's called Khan Academy. KHAAAAAAN!!
KWS ATL (3 years ago)
went to lecture today and read the chapter and was clueless. I watched the first 7 minutes of this and the concept is crystal clear!!
Hannibal EnemyofRome (3 years ago)
Very instructive video!
Gopika Rajanikanth (3 years ago)
Honestly, this almost 10 minute video helped me understand something we were learning in class for like 2 weeks! Thank you so much!
crni195 (1 year ago)
lol, thats not bad.. we did this in 1 lesson and they expect you to know this on oral exam and im not even studying mathematics, im in civil engeneering
Nils Berg (3 years ago)
You're saving my ass! Thank you!
Nithin Reddy (3 years ago)
felixthainsomniac (3 years ago)
thank you
Eric Antolovic (3 years ago)
Khan Academy you are the bomb dig.
danielle4823 (3 years ago)
Its unclear if we need to increase the sample size of Si or if we need to increase the number of samples S that we take from the population
Dustin Watson (3 years ago)
You need to increase the sample size of Si. The rule of thumb is if you took a sample size of >=40  and plotted their means then you would get a normal dist curve.  (30 can also work if the population variance is known, but its usually not, hence why you are using the CLT)
Musliston (4 years ago)
Great! That was very clear ^^
rafaelaprende (4 years ago)
Thank you! :-)
YUNOS RIDUWAN (4 years ago)
eric (4 years ago)
HAHAHAHAHA
Glorified Truth (4 years ago)
WOW!!!!  That's hilarious!  Please, do you have any other gems to share???
Trainwhrek (4 years ago)
SNORE FEST loljk good lesson -.-
Richard Guzman (4 years ago)
Idiot
Fronte999 (4 years ago)
I don't understand. That sampling to determine the mean tends to cluster in a pattern that looks like a bell is not a surprise. It doesn't tell us that a '2' or a '5' are impossible or not or what the first chart looked like, except as to it's mathematical mean. So what was all of the stuff at the beginning of the video about?  All of the elaboration that it is not a regular pattern or that '2' and '5' are possible or not possible. Every set of data has a mean, and taking potshots at it would of course look like a bell curve. No matter what the data looks like. So what was the point of telling us what it looked liked?
Dustin Watson (3 years ago)
Regression Analysis (and statisitics) is fundamental to any scientific experiment as well.
Marcos Ponce (4 years ago)
When you have a bell curve, a normal distribution, you are then capable of meeting assumptions in another statistical concept known as regression analysis. Regression analysis, to me, is absolutely amazing. It is widely used in economics but also in a plethora of other fields as well.
MrHav1k (4 years ago)
Well explained, but the arithmetic for the actual numbers is a bitch.
Rissa (4 years ago)
It is good to know that your brain malfunctions from time to time, too lol Thanks for the great videos!!!
LeahInTheRye (5 years ago)
Thanks! Got a test that includes this section next week and it had me stumped
karafofubuntu (5 years ago)
Thank you so much for a clear explanation.
Valentin Tihomirov (5 years ago)
Ok. Varying the sample size between 1 and ∞ we get the transit distribution between our original one and the normal distribution.
Youn-Jee Rengnez (5 years ago)
OMG, I'm taking a grad level biostats class and you are so saving my ass right now.
atom (5 years ago)
Why can't teachers explain things this clearly, why do they have to act all scholarly?
Alex Boorman (5 years ago)
I heard about the elegance of math: think I just got it!
Lorassa (5 years ago)
THANK YOU!! Your videos are amazing, such an amazing and intelligent man!
Quinn Culver (5 years ago)
Very nice explanation. Hats off.