Twitter recently announced that you can get close to the rank of any given Twitter user with high accuracy by entering your blind count in the following formula:
exp ($ a + $ b * log (follower_count))
where $ a = 21 and $ b = -1.1
This, obviously, is much more effective than sorting the entire list of users by the followers account for a given user.
If you have a similar dataset from another social site, how could you get the values for $ a and $ b to match this dataset? In principle, a list of frequencies whose distribution is assumed to be power-law.
source
share