本文介绍了Google Analytics(分析)报告的会话数比服务器日志中的命中数少10倍的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

假设我有一个页面mysite.com/mypage

在GA中该URL在特定持续时间内的着陆页报告为我提供了许多会话-例如50次.

Landing Page Report in GA for this URL for a specified duration, gives me a number of sessions - say 50.

在相同的时间内,我检查了Apache的access.log,并做了一个grep "GET /mypage,我的点击量增加了10倍左右(例如500).

For the same duration, I checked Apache's access.log, and did a grep "GET /mypage, and I've got around 10x more hits -- say 500.

在GA& amp;之间,我们怎么能有10倍的异常值?服务器日志?热门歌曲去哪了?

How can we have a 10x anomaly between GA & Server Logs? Where did the hits go?

此异常也存在其他持续时间.我比较了各种时长.

This anomaly is present for other durations too. I've compared various durations.

在有人说出这种情况的标准原因之前,我要指出:

Before someone tells the standard reasons for this, let me point out that:

  1. 相差2倍或3倍是可以理解的,但相差10倍是可以理解的.
  2. 不,这不是Bot Traffic.我从日志中提取了所有唯一 IP,这些IP是99%唯一的.因此,流量全部来自不同的IP.
  3. 我还分析了用户代理,它们看起来都很真实(使用iPhone,三星等各种型号的手机)
  4. GA还表示,此报告基于100%的数据(不包括抽样).
  5. 正如我所指出的,我只计算对/mypage的GET请求.也就是说,我没有计算资产下载,网站图标点击量等.
  1. A difference of 2x or 3x is understandable, but not 10x.
  2. No, this is not Bot Traffic. I extracted all unique IPs from the logs, and the IPs are 99% unique. So the traffic is all coming from different IPs.
  3. I also analyzed user agents, and they all look real (with various models of phones like iPhone, Samsung etc.)
  4. GA also says that this report is based on 100% data (sampling ruled out).
  5. As I pointed out, I'm only counting the GET requests to the /mypage. That is, I'm not counting asset downloads, favicon hits etc. etc.

我进行了另一项测试.我获取了所有IP,然后将它们设为唯一,然后针对每个IP分析了该IP带来了多少点击.我发现84%的IP中没有第二个请求.他们只提出了1个请求.

I performed another test. I took all IPs, then made them unique, then for each IP I analyzed how many hits came from that IP. I found from 84% of the IPs, there's no second request. They made only 1 request.

我已阅读在Google Analytics(分析)和服务器点击之间进行模拟并已妥善处理了接受的答案中给出的所有内容.

I've read Anamoly between google analytics and server hits and have taken care of everything given in the accepted answer.

可能是什么?关于如何调试的任何线索?流量来自付费Facebook广告.

What could it be? Any clues on how to debug this? The traffic is coming from Paid Facebook Ads.

推荐答案

Facebook在移动设备上具有某种预加载机制,该机制可为许多外部对象获取数据,以防万一用户可能希望实际查看它们.

Facebook has some sort of pre-load mechanism on mobile, that fetches data for a lot of external objects just in case the user might want to actually view them.

显然,该名称为"Facebook Liger",请检查此处描述的内容是否与您看到的请求相匹配:"> http://inchoo.net/dev-talk/magento-website-hammering-facebook-liger/

Apparently that thing is called "Facebook Liger", check if what is described here matches the requests you’re seeing: http://inchoo.net/dev-talk/magento-website-hammering-facebook-liger/

您应该能够通过User-Agent标头检测到此情况,甚至可以将这些请求从您的分析中排除.

You should be able to detect this via the User-Agent header, and maybe exclude those requests from your analytics.

这篇关于Google Analytics(分析)报告的会话数比服务器日志中的命中数少10倍的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!

10-29 08:58