BuzzLogic

Posted by joy


Oh, wait, just when you thought we were done here with research services for the Google impaired, there is yet another one. Buzzlogic has been sending out their crawler for the past few weeks to this blog and by happenstance currently has a private beta for companies.

What is different about BuzzLogic’s crawler though is that it’s revealing a referrer which really, honestly should not be seen in the Web logs. Also, their crawler does not have any identifying information in the User Agent field. Here’s an example.

The questionable referrer, which I am seeing via Sitemeter looks like this:

[file:///data/thumbnailer/work/home-2006-11-17-17:21:16.438/2006-11-19-07:37:13.838-in.html

If I had to guess, however BuzzLogic compiles the collected data into a static HTML file. I’ve seen that static HTML file change day by day, each with a different time/date stamp for each individual instance it hits my Web server.

This is what I see via my Web logs.
Host: 64.34.246.44 (I was only able to connect this to BuzzLogic through a traceroute of the IP address. The BuzzLogic Web server is hosted on what seems to be a completely different hosting provider.)
/wp-content/plugins/sociable/images/reddit.png (This crawler is hitting my image files for some reason.)
Http Code: 200 Date: Nov 19 10:37:14 Http Version: HTTP/1.1 Size in Bytes: 5943
Referer: -
Agent: Mozilla/5.0 (compatible; Konqueror/3.5; Linux) KHTML/3.5.4 (like Gecko)

[tags]bot, crawler, scraper, buzzlogic, brand monitoring services, search engine challenged PR firms [/tags]


One Response to “BuzzLogic”

  1. Bitch | Lab Says:

    Did you ever learn more about this? I started seeing this the same time you did, and it coincided with an attack my blog, which the hosting company shut down because we were burning up our processing time, knocking out service to other clients. so they say.

    what bugged me was that, when the site was offline, this crawler always happened to be at the site at the same time. Now, of course, it’s not offline and it’s shown up again, so there’s probably no connection.

    it looks like they’re hitting up images files, thumnailing them, and creating an alternative to the popular image search engines, google and yahoo. yes? dunno, so wondered if you had klew.

Leave a Reply