Lately I have become very interested in probabilities, variance and standard deviation (as you may notice from my other threads!)
My current thoughts are regarding 'streaks' in even money games.
If a streak of say 7 or more blacks in roulette happens approx 0.0064 of the time.
- because this is quite a low number is there a high variance of this event occuring?
- how would I calculate this?
Thanks in advance for anyone who may be able to help!
to 8 in a row is just the same as it staying at 7.
From many of your other posts your understanding of the basics is very limited IMO. (Do not fear, you are in the majority)Quote: Walkinshaw30tHi,
Lately I have become very interested in probabilities, variance and standard deviation (as you may notice from my other threads!)
Start here
http://stattrek.com/probability/set-and-subset.aspx?Tutorial=Stat
or here
http://stattrek.com/probability-distributions/binomial.aspx?Tutorial=Stat
Teach yourself statistics
You are asking an advanced question and need to understand more basics (like what is variance and how is it calculated and what does that result mean...)Quote: Walkinshaw30tMy current thoughts are regarding 'streaks' in even money games.
If a streak of say 7 or more blacks in roulette happens approx 0.0064 of the time.
- because this is quite a low number is there a high variance of this event occuring?
- how would I calculate this?
to learn how to solve this.
for 18/38 = 0.005350844897313
18/37 = 0.006449045889425 (1 in 155.061 trials but each TRIAL is NOT 1 spin in length for your example)
I gather you want 18/37 which you did not even specify.
That figure is only for the next 7 spins.
Not in 10 spins: 0.016384062529891
or
in 100 spins: 0.275468908341602
# of trials increase so does the probability of seeing your streak (run)
start here
http://www.pulcinientertainment.com/info/Streak-Calculator-enter.html
(there are also a few WoV threads about this linked at that page. I think it still is accurate.)
https://wizardofvegas.com/forum/questions-and-answers/math/8141-on-average-how-many-trials-will-it-take-to-see-a-streak-of-8-qs-for-fun/#post119117
average # of spins = 300.01
variance = 86649.92307
standard deviation = 294.4
where p = probability of success (18/37 in your example)
q = 1-p
r = length of the streak (run)
The distribution (for this "wait" time) is NOT a normal distribution
It is not the same as the # of reds in 300 spins
Good Luck.
The one mistake I see you always making in your other threads is
mixing up the standard deviation for the number of successes in N trials (say # of reds in 100 spins)
and the standard deviation of the wagers over N trials (say # of bets on red in 100 spins).
They are totally two different animals but it appears to me (I may be 100% wrong)
you consider them both the same.
Maybe others will reply there too
Quote: EvenBobWhat difference does it make. The chance of it going
to 8 in a row is just the same as it staying at 7.
Yes I understand that- that is not what Im interested in. Im interested in standard deviation- how much it can be relied on and how many trials would be required to see the results fall within the expected standard deviations eg 68% with in 1, 95% within 2 etc
Quote: 7crapsFrom many of your other posts your understanding of the basics is very limited IMO. (Do not fear, you are in the majority)
Start here
http://stattrek.com/probability/set-and-subset.aspx?Tutorial=Stat
or here
http://stattrek.com/probability-distributions/binomial.aspx?Tutorial=Stat
Teach yourself statistics
You are asking an advanced question and need to understand more basics (like what is variance and how is it calculated and what does that result mean...)
to learn how to solve this.
for 18/38 = 0.005350844897313
18/37 = 0.006449045889425 (1 in 155.061 trials but each TRIAL is NOT 1 spin in length for your example)
I gather you want 18/37 which you did not even specify.
That figure is only for the next 7 spins.
Not in 10 spins: 0.016384062529891
or
in 100 spins: 0.275468908341602
# of trials increase so does the probability of seeing your streak (run)
start here
http://www.pulcinientertainment.com/info/Streak-Calculator-enter.html
(there are also a few WoV threads about this linked at that page. I think it still is accurate.)
https://wizardofvegas.com/forum/questions-and-answers/math/8141-on-average-how-many-trials-will-it-take-to-see-a-streak-of-8-qs-for-fun/#post119117
average # of spins = 300.01
variance = 86649.92307
standard deviation = 294.4Var(N) = 1−(p^(1+2r)−qpr(1+2r)/q^2p^2r
where p = probability of success (18/37 in your example)
q = 1-p
r = length of the streak (run)
The distribution (for this "wait" time) is NOT a normal distribution
It is not the same as the # of reds in 300 spins
Good Luck.
The one mistake I see you always making in your other threads is
mixing up the standard deviation for the number of successes in N trials (say # of reds in 100 spins)
and the standard deviation of the wagers over N trials (say # of bets on red in 100 spins).
They are totally two different animals but it appears to me (I may be 100% wrong)
you consider them both the same.
Maybe others will reply there too
Cheers thanks for reply and sending through the links,
Im probably not explaining myself clear enough but what I want to know is: if a streak of 7 reds happens on average 1/155. What is the variance associated with this happening? Would it be possible to go 300 spins without seeing a 7 streak? What about 2500?
And how many standard deviations away from average these would be?
I understand this is quite a difficult question.
Thanks
This shows your lack of understanding the concept of standard deviation.Quote: Walkinshaw30tIm interested in standard deviation- how much it can be relied on and how many trials would be required to see the results fall within the expected standard deviations eg 68% with in 1, 95% within 2 etc
standard deviation estimates (quite well with a good sample size) the Binomial Probability Distribution a range of possible outcomes.
It really is as simple as that. 1SD is always 1SD when we talk about a normal distribution.
Your streak question is NOT a normal distribution as seen from the photo I posted.
Let us start with a simple example.
Copy and Paste here
Say Bet RED 18/37 for 74 spins
How many REDS in 74 spins
EV = 36
Is this correct??
I could be wrong here.
But we know we will not always get exactly 36 reds in 74 spins.
There is variance.
We should just calculate a table using the Binomial Probability Distribution and you can add all the values up and see what is what.
There is math to that unless you use a program to do it for you.
Here we are
http://stattrek.com/probability-distributions/binomial.aspx?Tutorial=Stat
x prob[X=x] prob[X<x] prob[X>=x] prob[X<=x] prob[X>x]
21 0.0001854 0.0001207 0.9998793 0.0003061 0.9996939
22 0.0004230 0.0003061 0.9996939 0.0007291 0.9992709
23 0.0009061 0.0007291 0.9992709 0.0016353 0.9983647
24 0.0018242 0.0016353 0.9983647 0.0034594 0.9965406
25 0.0034563 0.0034594 0.9965406 0.0069157 0.9930843
26 0.0061710 0.0069157 0.9930843 0.0130867 0.9869133
27 0.0103932 0.0130867 0.9869133 0.0234799 0.9765201
28 0.0165275 0.0234799 0.9765201 0.0400074 0.9599926
29 0.0248363 0.0400074 0.9599926 0.0648437 0.9351563
30 0.0352937 0.0648437 0.9351563 0.1001374 0.8998626
31 0.0474577 0.1001374 0.8998626 0.1475951 0.8524049
32 0.0604149 0.1475951 0.8524049 0.2080101 0.7919899
33 0.0728448 0.2080101 0.7919899 0.2808549 0.7191451
34 0.0832190 0.2808549 0.7191451 0.3640739 0.6359261
35 0.0901018 0.3640739 0.6359261 0.4541756 0.5458244
36 0.0924729 0.4541756 0.5458244 0.5466485 0.4533515
37 0.0899736 0.5466485 0.4533515 0.6366221 0.3633779
38 0.0829950 0.6366221 0.3633779 0.7196171 0.2803829
39 0.0725787 0.7196171 0.2803829 0.7921958 0.2078042
40 0.0601639 0.7921958 0.2078042 0.8523596 0.1476404
41 0.0472661 0.8523596 0.1476404 0.8996257 0.1003743
42 0.0351830 0.8996257 0.1003743 0.9348088 0.0651912
43 0.0248047 0.9348088 0.0651912 0.9596135 0.0403865
44 0.0165562 0.9596135 0.0403865 0.9761697 0.0238303
45 0.0104566 0.9761697 0.0238303 0.9866263 0.0133737
46 0.0062452 0.9866263 0.0133737 0.9928715 0.0071285
47 0.0035247 0.9928715 0.0071285 0.9963963 0.0036037
48 0.0018783 0.9963963 0.0036037 0.9982746 0.0017254
49 0.0009442 0.9982746 0.0017254 0.9992188 0.0007812
50 0.0004473 0.9992188 0.0007812 0.9996660 0.0003340
51 0.0001994 0.9996660 0.0003340 0.9998654 0.0001346
Here is an easier way
Binomial Standard Deviation = for 1 trial = square root of (P * (1-P)) = 0.49981
This is in your studies
You know how to use that value?
For trials less than about 30 we do not even think of using it.
But for 74 spins we do (the data is very close to being normally distributed, right?? There still will be a small error)
what is the square root of 74?
Multiply that by the SD of 1 trial = 0.49981
about 4.3?
So a 1SD range = about 4.3(about 68% or look here: nice table about half way down)
http://en.wikipedia.org/wiki/Standard_deviation
EV = 36
SD = 4.3
That 1SD range of 31.7 to 40.3 IS a 68% probability. (68.2689492%)
It does not matter how many trials there are, 500, 5000 or 5 million
A 1 SD range = 68.2689492%. That means about a 32% probability to be outside of 1SD, equally too high or too low.
A 2SD range is wider, as is a 3SD and a 4SD range. There are probabilities related to each range.
of course it is IMPOSSIBLE to get exactly 31.7 Reds in 74 spins, using my example
so we can see we have approximated the Binomial distribution.
How close are we to the 1SD range??
try 32 to 40 = 70.5% so we gots a small error.
But with a small sample size, the error is to be expected. A larger sample size and the smaller the error.
There is a formula for that too but you need to really understand the Binomial Probability Distribution
and variance and standard deviation first.
Then see how to use the central limit theorem to solve common math related problems.
So, there must be a very specific question you want answered other than the streak question above.
Fire away and see where your understanding now lies.
Continued Good Luck
yes. Does this not seem right??Quote: Walkinshaw30tWould it be possible to go 300 spins without seeing a 7 streak?
only a 0.632729561221891 probability of seeing at least 1 such streak.
That is about a 36.8% chance (probability) of NOT seeing such a streak.
a handy streak calculator here
http://www.pulcinientertainment.com/info/Streak-Calculator-enter.html
expected # of runs of length 7 or more in 2500 spins = about 8.26Quote: Walkinshaw30tWhat about 2500?
This is just the average. Here is part of that distribution using a computer program in Excel
Event Run Probability 1 in
0 runs of length 7 or more 0.000208547 4,795.08
at least 1 run of length 7 or more 0.999791452787651 1.00
at least 2 runs of length 7 or more 0.997977803412864 1.00
at least 3 runs of length 7 or more 0.990139483734160 1.01
at least 4 runs of length 7 or more 0.967693431018414 1.03
at least 5 runs of length 7 or more 0.919781386396954 1.09
at least 6 runs of length 7 or more 0.838469580627413 1.19
at least 7 runs of length 7 or more 0.724186199048575 1.38
at least 8 runs of length 7 or more 0.587364455037691 1.70
at least 9 runs of length 7 or more 0.444931004198057 2.25
at least 10 runs of length 7 or more 0.313958401980447 3.19
at least 11 runs of length 7 or more 0.206251693272468 4.85
at least 12 runs of length 7 or more 0.126240487898944 7.92
Now you go from asking about the chance (probability) of an event to the standard deviation over many trialsQuote: Walkinshaw30tAnd how many standard deviations away from average these would be?
I understand this is quite a difficult question.
Thanks
I gave the formula for that in my first post. I guess it went past you too quickly.
If not what sample size would be required to achieve the expected SD results? What would the variance be for 100 samples of 74 spins?
[spoiler]
x prob[X=x] prob[X<x] prob[X>=x] prob[X<=x] prob[X>x]
21 0.0001854 0.0001207 0.9998793 0.0003061 0.9996939
22 0.0004230 0.0003061 0.9996939 0.0007291 0.9992709
23 0.0009061 0.0007291 0.9992709 0.0016353 0.9983647
24 0.0018242 0.0016353 0.9983647 0.0034594 0.9965406
25 0.0034563 0.0034594 0.9965406 0.0069157 0.9930843
26 0.0061710 0.0069157 0.9930843 0.0130867 0.9869133
27 0.0103932 0.0130867 0.9869133 0.0234799 0.9765201
28 0.0165275 0.0234799 0.9765201 0.0400074 0.9599926
29 0.0248363 0.0400074 0.9599926 0.0648437 0.9351563
30 0.0352937 0.0648437 0.9351563 0.1001374 0.8998626
31 0.0474577 0.1001374 0.8998626 0.1475951 0.8524049
32 0.0604149 0.1475951 0.8524049 0.2080101 0.7919899
33 0.0728448 0.2080101 0.7919899 0.2808549 0.7191451
34 0.0832190 0.2808549 0.7191451 0.3640739 0.6359261
35 0.0901018 0.3640739 0.6359261 0.4541756 0.5458244
36 0.0924729 0.4541756 0.5458244 0.5466485 0.4533515
37 0.0899736 0.5466485 0.4533515 0.6366221 0.3633779
38 0.0829950 0.6366221Time will tell
Nice question.Quote: Walkinshaw30tFor the example you gave of 74 spins-
if i was to make 74 spins and write down results of which sd bracket the results fell into and repeat this 100 times,
can i guarantee that 68% will be within 1 standard deviation, 95% within 2 SD etc
No guarantee when we deal with
the probability of a random event or many random events
unless the probability for success or failure equals 1 or 0.
This is an advanced question.Quote: Walkinshaw30tIf not what sample size would be required to achieve the expected SD results?
The answer depends on what 'error' one is willing to accept because we are dealing, in these gambling examples,
with Independent and identically distributed random variables
http://en.wikipedia.org/wiki/Independent_and_identically_distributed_random_variables
68% +/- 10% error (58% - 78%)
68% +/- 1% error (67% - 69%)
requires a 100 times larger sample size than the 10% error
68% +/- .1% error (67.9% - 68.1%)
requires a 100 times larger sample size than the 1% error
100 sessions of 74 spins each.Quote: Walkinshaw30tWhat would the variance be for 100 samples of 74 spins?
We track red at 18/37
You want to know how many sessions would fall in that 1SD range.
We keep it simple and use 68% as the probability for 1SD for 1 event.
This has been proven in many prob and stats book. Used here without any proof.
Again, this is a basic binomial probability distribution type question.
where
N = # of trials
P = Prob of success (is a constant) = 68%
N*P = EV = 68 (sessions expected to fall into the 1SD range)
Variance = N*P*(1-P)
I get 21.76
SD = square root of the variance I get 4.664761
Your 1SD range over 100 sessions would be 68 + and - 4.664761
63.335238 to 72.664761
I even ran a simulation using WinStats
Thanks again Rick Parris
I used 68.2% as the probability because I can
group middle freq freq/100
----------------------------------------------
45.50 <= x < 46.50 46.00 2 0.00%
46.50 <= x < 47.50 47.00 5 0.00%
47.50 <= x < 48.50 48.00 11 0.00%
48.50 <= x < 49.50 49.00 30 0.00%
49.50 <= x < 50.50 50.00 57 0.01%
50.50 <= x < 51.50 51.00 134 0.01%
51.50 <= x < 52.50 52.00 273 0.03%
52.50 <= x < 53.50 53.00 535 0.05%
53.50 <= x < 54.50 54.00 1012 0.10%
54.50 <= x < 55.50 55.00 1831 0.18%
55.50 <= x < 56.50 56.00 3085 0.31%
56.50 <= x < 57.50 57.00 5197 0.52%
57.50 <= x < 58.50 58.00 7891 0.79%
58.50 <= x < 59.50 59.00 12500 1.25%
59.50 <= x < 60.50 60.00 18344 1.83%
60.50 <= x < 61.50 61.00 25693 2.57%
61.50 <= x < 62.50 62.00 34605 3.46%
62.50 <= x < 63.50 63.00 44998 4.50%
63.50 <= x < 64.50 64.00 55828 5.58%
64.50 <= x < 65.50 65.00 65865 6.59%
65.50 <= x < 66.50 66.00 75514 7.55%
66.50 <= x < 67.50 67.00 81316 8.13%
67.50 <= x < 68.50 68.00 85331 8.53%
68.50 <= x < 69.50 69.00 84564 8.46%
69.50 <= x < 70.50 70.00 80606 8.06%
70.50 <= x < 71.50 71.00 72816 7.28%
71.50 <= x < 72.50 72.00 63378 6.34%
72.50 <= x < 73.50 73.00 51571 5.16%
73.50 <= x < 74.50 74.00 40655 4.07%
74.50 <= x < 75.50 75.00 30246 3.02%
75.50 <= x < 76.50 76.00 21284 2.13%
76.50 <= x < 77.50 77.00 14459 1.45%
77.50 <= x < 78.50 78.00 9049 0.90%
78.50 <= x < 79.50 79.00 5382 0.54%
79.50 <= x < 80.50 80.00 2954 0.30%
80.50 <= x < 81.50 81.00 1605 0.16%
81.50 <= x < 82.50 82.00 767 0.08%
82.50 <= x < 83.50 83.00 353 0.04%
83.50 <= x < 84.50 84.00 148 0.01%
84.50 <= x < 85.50 85.00 61 0.01%
85.50 <= x < 86.50 86.00 29 0.00%
86.50 <= x < 87.50 87.00 10 0.00%
87.50 <= x < 88.50 88.00 4 0.00%
88.50 <= x < 89.50 89.00 2 0.00%
----------------------------------------------
grouped data
items: 1000000
minimum value: 46.00
first quartile: 65.00
median: 68.00
third quartile: 71.00
maximum value: 89.00
mean value: 68.21
midrange: 67.50
range: 43.00
interquartile range: 6.00
mean abs deviation: 3.72
sample variance (n): 21.68
sample variance (n-1): 21.68
sample std dev (n): 4.66
sample std dev (n-1): 4.66
----------------------------------------------
cumulative
----------------------------------------------
45.50 <= x < 46.50 46.00 2 0.00%
46.50 <= x < 47.50 47.00 7 0.00%
47.50 <= x < 48.50 48.00 18 0.00%
48.50 <= x < 49.50 49.00 48 0.00%
49.50 <= x < 50.50 50.00 105 0.01%
50.50 <= x < 51.50 51.00 239 0.02%
51.50 <= x < 52.50 52.00 512 0.05%
52.50 <= x < 53.50 53.00 1047 0.10%
53.50 <= x < 54.50 54.00 2059 0.21%
54.50 <= x < 55.50 55.00 3890 0.39%
55.50 <= x < 56.50 56.00 6975 0.70%
56.50 <= x < 57.50 57.00 12172 1.22%
57.50 <= x < 58.50 58.00 20063 2.01%
58.50 <= x < 59.50 59.00 32563 3.26%
59.50 <= x < 60.50 60.00 50907 5.09%
60.50 <= x < 61.50 61.00 76600 7.66%
61.50 <= x < 62.50 62.00 111205 11.12%
62.50 <= x < 63.50 63.00 156203 15.62%
63.50 <= x < 64.50 64.00 212031 21.20%
64.50 <= x < 65.50 65.00 277896 27.79%
65.50 <= x < 66.50 66.00 353410 35.34%
66.50 <= x < 67.50 67.00 434726 43.47%
67.50 <= x < 68.50 68.00 520057 52.01%
68.50 <= x < 69.50 69.00 604621 60.46%
69.50 <= x < 70.50 70.00 685227 68.52%
70.50 <= x < 71.50 71.00 758043 75.80%
71.50 <= x < 72.50 72.00 821421 82.14%
72.50 <= x < 73.50 73.00 872992 87.30%
73.50 <= x < 74.50 74.00 913647 91.36%
74.50 <= x < 75.50 75.00 943893 94.39%
75.50 <= x < 76.50 76.00 965177 96.52%
76.50 <= x < 77.50 77.00 979636 97.96%
77.50 <= x < 78.50 78.00 988685 98.87%
78.50 <= x < 79.50 79.00 994067 99.41%
79.50 <= x < 80.50 80.00 997021 99.70%
80.50 <= x < 81.50 81.00 998626 99.86%
81.50 <= x < 82.50 82.00 999393 99.94%
82.50 <= x < 83.50 83.00 999746 99.97%
83.50 <= x < 84.50 84.00 999894 99.99%
84.50 <= x < 85.50 85.00 999955 100.00%
85.50 <= x < 86.50 86.00 999984 100.00%
86.50 <= x < 87.50 87.00 999994 100.00%
87.50 <= x < 88.50 88.00 999998 100.00%
88.50 <= x < 89.50 89.00 1000000 100.00%
Say one flips a fair coin 10 times.
We expect 5 Heads and 5 Tails, most would agree.
But if 1 million people did this experiment we would find that less than 25% of them actually got exactly 5 Heads.
More got 6 or higher or 4 or less
Distributions of random events
"Many events can't be predicted with total certainty.
The best we can say is how likely they are to happen, using the idea of probability."
http://www.mathsisfun.com/data/probability.html
Excellent
More tools in the tool box