Posts Tagged ‘ compete ’

Which Traffic Stats and Auditing Tools Should you Use for your Web Site?

Saturday, May 8th, 2010

All webmasters need to know what kind of traffic their web sites are getting. The choices to keep tabs on site statistics are varied, but there’s three that are completely free and among the most widely used: Awstats, Sitemeter, and Google Analytics. Awstats is a program written in perl for all platforms, while both Sitemeter and Google Analytics are web-based tools requiring no installation.

Image by Christian Ferrari

If you don’t run your own server, and your hosting company doesn’t provide Awstats pre-installed, it isn’t going to be your solution and you’ll have to use one of the web-based services instead.  If you do have a shell account but are too intimidated to do anything that involves typing commands into a non-graphical interface, then once again stick to the web-based options. If you’re looking for the most comprehensive statistics, Awstats does  generate the most in-depth reports for a single site.  Awstats needs to be set-up as a daily recurring job (crontab in Linux/Unix servers), and uses your server’s log files. It supports Apache style logs as well as those produce by IIS (Windows) and even parses and produces statistics from mail logs. You run an interactive non-GUI configuration script to install Awstats on your server. If you want to get more sophisticated geographical statistics, you’ll need to install one of the geo plugins (perl modules, in reality) and the corresponding database. MaxMind offers its free geoip database and perl module. Because Awstats uses your web server’s logs, it won’t have any statistical holes such as can happen with web-based services. With a web-based service, the visitor’s browser could fail to download the javascript, either accidentally or by  filtering it out, that is needed to report the user’s activity back to the stats provider’s site. Awstats can also give you statistics on the many robots that visited your site, whereas a web-based tracker can’t provide such data since spiders don’t download javascript files, let alone interpret them.

Sitemeter has been around for many years and until Google Analytics came along, it was the favorite statistical tool used by hundreds of thousands of bloggers. All you have to do to get this free service is to sign-up on their site and then paste a bit of javascript in your site’s footer, so that accesses to all pages get metered. Each account you open can only cover one site, thus if you have ten sites, you’ll have to register ten times. To view your sites’ statistics, you have to log-in to their web site.  Once there you can view details on the most recent activity on your site. Statistics for previous days or months are limited to totals. A lot of the other statistics are limited to the last 100 visitors to your site. Thus, you can see that someone from Japan visited recently, but you can’t tell how many Japanese visited in the last hour, day or week. You can get enhanced statistics only by upgrading to a more advanced plan, which of course involves a monthly payment. Apart from these limitations, Sitemeter has  been frequently criticized for throwing a copious number of tracking cookies at your site’s visitors on behalf of an advertiser, SpecificClick.net.

Google is the most trusted entity on the Internet, and thus it’s no surprise that its Analytics service has quickly become the main site traffic  tracking  tool used by most sites. Analytics requires that you register for a Google mail account and paste a bit of javascript into your site’s footer, similar to Sitemeter. You view your accumulated statistics via their web site, also like Sitemeter. The major difference between Analytics and Sitemeter, is that you can access all your web sites’ statistics via the same interface, whereas Sitemeter requires a different log-in for each.  Having everything in one place is a great convenience.  Analytics gives you every feature up front and free. No need to buy an upgrade.  You can review detailed statistics for any date or date range, even well in the past, something that’s not possible with Sitemeter’s free plan. There is one drawback to Analytics, although it’s rather insignificant: statistics are compiled only once a day. Sitemeter allows you to see the latest activity, but the truth is, that’s not so useful except to determine if there’s any problem with your site.  Many big name sites use Google Analytics.

To resume, a statistical compiler such as Awstats is a lot more complicated to set-up, but does provide the most reliable and complete data. On the other hand, web-based statistical sites are extremely easy to set-up and use, but can’t provide stats on anything that isn’t a web page, such as images, flash, or Java applets, and also can’t keep track of accesses by web crawlers. The best option among the web-based stats services is Google Analytics. However, if you have a site with a lot of traffic and need hard core traffic reports, use Awstats. You can still use Google Analytics as a secondary source of statistics.

If you also need  to have a credible external audit of your site’s traffic, perhaps to show the public, the competition, or a potential buyer that you’re a serious contender,  you’ll have to use a reputable web-based auditing service for this. You can sign-up for a free account at Quancast, or pay hefty fees to services like Nielsen, comScore or Compete. Typically, you’ll have to paste some javascript into your site’s footer. Hopefully, such code won’t clash with your Analytics or Sitemeter javascript, but if you really need an an audit service, it’s worth switching to using Awstats for your internal traffic reports.

Related Articles:

Are Alexa, Quantcast and Compete Credible?

Monday, April 5th, 2010

It’s puzzling why so many otherwise intelligent people think the web site statistics purveyed by Alexa and Compete.com are reliable, when in reality they’re usually rough estimates which they derive from their own data sources, not actual metering from the web sites they list. If you’re the owner of one of the web sites whose performance is being low balled in this manner, you’re probably quite upset knowing the negative fallout this could produce.

Image by Dominik Gwarek

How rough are their numbers? I checked a web site I own on both these free services. I know from using Google Analytics that it gets about 35,000 unique visitors a month, 9,630 of which originate from the USA, yet the US unique visitor traffic figure was off by over 50 percent in Compete. Meanwhile in Alexa, which doesn’t give an estimate, the same web site ranked in the 770,000 range, while sites that have much less traffic than mine ranked in the 200,000 range. Testing Quantcast, which also only bothers with traffic originating in the USA, the uniques number was closer to the mark for the same web site (8,900) . A lucky guess?

What do they use to come up with these numbers? Alexa, for its part, has a help page that attempts to answer this question: “Alexa computes traffic rankings by analyzing the Web usage of millions of Alexa Toolbar users and data obtained from other, diverse traffic data sources.” Alexa toolbar users? How many of those millions actually use it? Furthermore, “diverse traffic data sources” is mightily vague. Perhaps they mean Alexa web site searches? Millions of Alexa toolbar users still can’t replace the accuracy of actual metering on the real web sites. Oddly, Alexa doesn’t offer any such device, which I’m sure millions of web site owners would be glad to add to their home page to set the record straight.

Alexa does utter this disclaimer on their site, to be fair (excerpted): “The traffic data are based on the set of toolbars that use Alexa data, which may not be a representative sample of the global Internet population. To the extent that our sample of users differs from the set of all Internet users, our traffic estimates may over- or under-estimate the actual traffic to any particular site…Sites with relatively low traffic will not be accurately ranked by Alexa…Generally, traffic rankings of 100,000 and above should be regarded as not reliable“. So, from the horse’s mouth: the stats for most sites in their database mean nothing! I suspect that the toolbar is a strong marketing tool for Alexa and they have less interest in accuracy.

Compete.com, on the other hand, takes the high road in its help page: “Compete has developed a unique methodology created by experts in the fields of mathematics, statistics and the data sciences to aggregate, transform, enhance and normalize data in order to estimate U.S. Internet traffic.” Then they go on to say that their numbers are arrived by monitoring the activity of two million members of the Compete community, which I believe to mean people who register on their web site, the vast majority of whom would have opted for the free plan, being that the next plan up costs 199 dollars a month! Looking over what services a free account provides, I fail to see how meaningful statistics can be extracted from their activities on the site.

As for tools to meter your own site’s traffic, that doesn’t appear to be an option with Compete until you buy the enterprise package, which entails having to call them for the cost (uh-oh!). Compete gives the appearance of being the luxury web site metrics provider. I conclude that unless you’re part of that exclusive club, their stats for your site are meaningless! I also found it enormously humorous that they describe their methodology as being superior to that of Google Analytics, going on and on about how deleted cookies and users with multiple computers can seriously alter unique visitor tracking. Yes, using actual data is sub-par as opposed to estimates!

Quantcast is the only one of the three to have estimates that have some semblance in reality, Their explanation for how they get their data: “We collect directly-measured data from the millions of web destinations controlled by our Quantified Publishers.” I don’t know what that means, but it sounds serious! The good thing about Quantcast is that they do give you free traffic monitoring for your site. You just paste a little bit of code in your site’s footer just as you would for Google Analytics.

Compete, Alexa, and Quantcast aren’t the only sources of free site metrics, but they’re the ones most frequently consulted. Of the three, Quantcast is the only one that gets a passing grade, for acting responsibly. Alexa and Compete, however, are knowingly peddling largely inaccurate data, and for this reason should be shunned. Note that Google has a free product: Google Trends. It currently doesn’t include smaller sites, but instead of inventing data for them, it just tells you they don’t have any. Google, the trustworthy giant of the web, will hopefully extend this service to you and I!

Related Articles: