Dave Rogers
Bandaged ice that stampedes inexpensively through
This is a problem in statistics that's cropped up for me yesterday. I have a feel for the numbers, but I'd like some other opinions just to satisfy my curiosity. This is one for the maths and statistics geeks.
As I've mentioned before, I was diagnosed with chronic myeloid leukemia (CML) in December 2023. It's under control with drugs, though there are some issues with side effects at the moment, and the likelihood is that I'll die with it, not of it. In cancer treatment that's a win.
Very soon after diagnosis, I found that my wife knows someone, also in Ipswich, who was diagnosed with CML a few months before me. I also found out yesterday that a co-worker has a relative who was recently diagnosed with CML. The question that occurred to me is, how likely is it, given the incidence rate, that I should have two people in the same town, with the same disease, diagnosed in the same three-year period, within two degrees of separation of me?
There are some useful numbers available. Incidence of CML is about 850 new cases per year in the UK (numbers from 2017-19 NHS records). UK population is about 66 million, and Ipswich population about 133,000, suggesting about 1.7 average new cases per year in Ipswich. That means there should be 5 people diagnosed with CML in 2023, 2024 and 2025. There are no known causes or associations for CML, only a correlation with age, so it seems to behave like random chance. Degrees of separation is more tricky; I've heard it asserted that most people have about 100 friends and associates, and overlaps mean that about 1000 people are within 2 degrees.
So we have the situation that a sample of order 1000 people, in a town of 133,000 people, contains three out of five randomly selected individuals from that population. We can remove one of those (me) from the statistics, avoiding the Texas Sharpshooter fallacy, but the associations with the other two have no causal connection with CML (these people were within 2 degree of separation before any of us were diagnosed). The expectation, I think, would be a few per cent chance that I'm within 2 degrees of one other CML sufferer, but less than 1% chance of two. So the question is, how unlikely is this situation? And, if it is, does that suggest the CML incidence rates are higher than expected?
Sorry this came out so lengthy, but statistical questions are rarely simple. Thoughts, anyone?
Dave
As I've mentioned before, I was diagnosed with chronic myeloid leukemia (CML) in December 2023. It's under control with drugs, though there are some issues with side effects at the moment, and the likelihood is that I'll die with it, not of it. In cancer treatment that's a win.
Very soon after diagnosis, I found that my wife knows someone, also in Ipswich, who was diagnosed with CML a few months before me. I also found out yesterday that a co-worker has a relative who was recently diagnosed with CML. The question that occurred to me is, how likely is it, given the incidence rate, that I should have two people in the same town, with the same disease, diagnosed in the same three-year period, within two degrees of separation of me?
There are some useful numbers available. Incidence of CML is about 850 new cases per year in the UK (numbers from 2017-19 NHS records). UK population is about 66 million, and Ipswich population about 133,000, suggesting about 1.7 average new cases per year in Ipswich. That means there should be 5 people diagnosed with CML in 2023, 2024 and 2025. There are no known causes or associations for CML, only a correlation with age, so it seems to behave like random chance. Degrees of separation is more tricky; I've heard it asserted that most people have about 100 friends and associates, and overlaps mean that about 1000 people are within 2 degrees.
So we have the situation that a sample of order 1000 people, in a town of 133,000 people, contains three out of five randomly selected individuals from that population. We can remove one of those (me) from the statistics, avoiding the Texas Sharpshooter fallacy, but the associations with the other two have no causal connection with CML (these people were within 2 degree of separation before any of us were diagnosed). The expectation, I think, would be a few per cent chance that I'm within 2 degrees of one other CML sufferer, but less than 1% chance of two. So the question is, how unlikely is this situation? And, if it is, does that suggest the CML incidence rates are higher than expected?
Sorry this came out so lengthy, but statistical questions are rarely simple. Thoughts, anyone?
Dave