As the best Google Analytics / BigQuery client, our question is: What data is more accurate?
I tend to tend to make BigQuery more accurate, because we can see the raw data, but we donβt know how Google Analytit uses the method to calculate its numbers.
I also think a lot of this has to do with SAMPLING.
When you calculate something as simple as Total Pageviews for a single page, Google Analytics numbers line up to BigQuery within .00001% :
sum (case when regexp_match (hits.page.pagepath, r '(? i: /contact.aspx)') and hits.type = "page" then 1 else 0 end) as total_pageviews
When you compute something more complex, like unique pages for a single page, Google Analytics numbers are 5% greater than BigQuery. Note that this is a sample for a maximum of 1 million:
count (distinct (case when regexp_match (hits.page.pagepath, r '(? i: /contact.aspx)') and hits.type = "page" then concat (fullvisitorid, string (visitid)) end), 1000000) as unique_pageviews
Iβd like to know what others think or what Google developers themselves can explain.
source share