top of page
  • Writer's pictureJackson Curtis

The Conditional Birthday Problem

Most people who have recently taken a statistics class have heard of ‘The Birthday Problem,’ or the surprising fact that it only takes a group of 23 people to have a >50% chance of two people sharing a birthday. However, the mathematical framing of The Birthday Problem isn’t quite the same as the logistical problem that might occur with a quickly growing Latter-Day Saint family when two kids from different sides of the family share a birthday. In the family case, you’re not so much interested in “will a random group of x people have a birthday problem?” as “How many more people can we add until we create a birthday problem?” This second question conditions on the fact that your existing family does not yet have any birthday collisions. As I have gotten married and siblings on both sides have been starting and growing their families, the logistical issues that might arise when we have a “birthday problem” need to be considered.

My immediate family currently consists of 22 people, so it might seem like the birthday problem is right around the corner. However, that’s not quite right. We need to condition on the fact that we have not yet experienced the birthday problem in order to calculate how long we can expect to avoid it. For example, the next person to join our family only has a 22/365= 6% chance of causing the birthday problem, so we are a long way away from having a 50% chance of a birthday problem in our family by the time we reach 23 people. We can create a simple formula to calculate the probability of experiencing the birthday problem in x additions to the family given n members of the family (who haven’t doubled up) as follows:

We can then use the standard birthday problem formula to calculate the numerator and denominator:


The chart above shows that we’ll need 10 additional people (31 total) before we have a >50% chance of a birthday problem.

Accounting for all known information

However, in stats, you should always account for (condition on) all the most relevant data you have, so all our calculations that assume random, evenly-distributed birthdates go out the window once you have a due date for a baby. For example, if a baby is due on January 13th, the most appropriate calculation to make is to look specifically at the days around January 13th that might cause a birthday problem, in our case: the 6th, 14th, and 20th. If we make a simplifying assumption that the baby is equally likely to come in the seven days before or after its due date, our new probability of creating a birthday problem is 3/15 = 20%.

In conclusion, the 23rd addition to the family, Monica and Jackson’s new baby, has a 20% chance of causing a birthday problem.


Editor’s note: After writing this blog, our doctor informed us that the baby's due date is not actually until January 28th. This was what I originally had my parents read to break the news to them that we were expecting. It went even better than expected, and they did not see it coming at all.

0 comments

Recent Posts

See All

コメント


Post: Blog2_Post
bottom of page