Oystein
Penultimate Amazing
- Joined
- Dec 9, 2009
- Messages
- 18,903
I am following a Facebook page and monitor the development of the number of "Likes" it has.
For a while now, I have written down the number of likes every day around the time of my second coffee in the morning (so sample frequency wasn't exactly constant). I found that the number increases by between 20 and 80 on most days.
However, this is a net gain: They get more than 20-80 new "likes" per day, but some users no doubt "unlike" them, or delete their account.
When I sample from hour to hour, I see that sometimes they have a net loss.
When I sample every 5 minutes, I get an even clearer picture.
What I want to figure out is: Is it possible, and how, to estimate from such sampling how many total new "likes" and how many "unlikes" there are per time intervall?
To give you an idea, here are the numbers for the last 3 hours, sampled at pretty constant 5-minute intervals:
minutes|likes|gain|loss
0|264067||
5|264067||
10|264067||
15|264067||
20|264067||
25|264067||
30|264067||
35|264066||-1
40|264063||-3
45|264064|1|
50|264063||-1
55|264062||-1
60|264062||
65|264064|2|
70|264063||-1
75|264063||
80|264064|1|
85|264064||
90|264064||
95|264064||
100|264063||-1
105|264063||
110|264063||
115|264063||
120|264063||
125|264063||
130|264063||
135|264063||
140|264063||
145|264064|1|
150|264064||
155|264064||
160|264064||
165|264064||
170|264065|1|
175|264065||
180|264065||
185|264065||
190|264065||
195|264065||
200|264065||
205|264066|1|
In total, only 1 "like" was lost, but I observed 7 gained and 8 lost in the meantime.
In many intervals, the number doesn't change. This will often mean there was neither a "like" gained nor lost. But could sometimes mean that 1 was lost, 1 gained. Or +2-2. Etc.
So the net loss -1 could be +7-8, or +8-9, or +9-10, etc.
Assuming that likes and unlikes arrive randomly (should be normally distributed), but realizing that rates may change during the day and from day to day, is there a way to use such sampling to figure out how many likes and unlikes there really are?
For a while now, I have written down the number of likes every day around the time of my second coffee in the morning (so sample frequency wasn't exactly constant). I found that the number increases by between 20 and 80 on most days.
However, this is a net gain: They get more than 20-80 new "likes" per day, but some users no doubt "unlike" them, or delete their account.
When I sample from hour to hour, I see that sometimes they have a net loss.
When I sample every 5 minutes, I get an even clearer picture.
What I want to figure out is: Is it possible, and how, to estimate from such sampling how many total new "likes" and how many "unlikes" there are per time intervall?
To give you an idea, here are the numbers for the last 3 hours, sampled at pretty constant 5-minute intervals:
0|264067||
5|264067||
10|264067||
15|264067||
20|264067||
25|264067||
30|264067||
35|264066||-1
40|264063||-3
45|264064|1|
50|264063||-1
55|264062||-1
60|264062||
65|264064|2|
70|264063||-1
75|264063||
80|264064|1|
85|264064||
90|264064||
95|264064||
100|264063||-1
105|264063||
110|264063||
115|264063||
120|264063||
125|264063||
130|264063||
135|264063||
140|264063||
145|264064|1|
150|264064||
155|264064||
160|264064||
165|264064||
170|264065|1|
175|264065||
180|264065||
185|264065||
190|264065||
195|264065||
200|264065||
205|264066|1|
In total, only 1 "like" was lost, but I observed 7 gained and 8 lost in the meantime.
In many intervals, the number doesn't change. This will often mean there was neither a "like" gained nor lost. But could sometimes mean that 1 was lost, 1 gained. Or +2-2. Etc.
So the net loss -1 could be +7-8, or +8-9, or +9-10, etc.
Assuming that likes and unlikes arrive randomly (should be normally distributed), but realizing that rates may change during the day and from day to day, is there a way to use such sampling to figure out how many likes and unlikes there really are?
)