[support] Eliminating bot-noise

Tibor Liktor liktor at gmail.com
Thu Aug 30 13:34:43 UTC 2007


Hi Mike,

I was afraid that I get a similar answer, but I also don't see other solution.

Thx all!

Best,
Tibor

On Thu, 30 Aug 2007 08:15:15 -0500
Michael MacKenna <mpmackenna at gmail.com> wrote:

> This may seem like a lot of work for solving your issue, but depending on how desperate you are
> it should work.  You could right a python script that digs through the log files and skips all
> the bogus lines that match a certain pattern and writes the lines that don't fit that pattern to
> a new log file.  The end result would be a new log file that contains only the information you
> need.
> 
> Mike
> 
> 
> 
> Tibor Liktor wrote:On Thu, 30 Aug 2007 08:11:19 -0400
> Earnie Boyd <earnie at users.sourceforge.net> wrote:
> 
>   Quoting Tibor Liktor <liktor at gmail.com>:
> 
>     Hi,
> 
> 
> I've got a watchdog problem.
> 
> The watchdog log is essential for me to discover bugs and errors, and 
> monitor the site's
> performance, and blahblablah - you know that.
> 
> But watchdog became quite useless for me, because it is full of 404 
> errors triggered by Google and
> other bots.
> 
> Now nearly 90% of the log is crap. It is impossible to dig out any 
> useful info from that. (Not
> speaking about the additional server load and database size issues.)
> 
> Is there any solution to filter out the tons of messages caused by 
> searchbots?
> 
> Do you face with similar issues? How do you handle those?
> 
>       There is a filter by message type list box that can help but that is 
> too simple and I must not be understanding you.  I do know that the 
> "page not found" errors tend to be ridiculous and the drupal engine 
> itself causes a several; i.e. the referrer is the site on which the 
> watchdog log is being reviewed.  It would be nice if I could filter to 
> "all messages except page not found messages"; is that what you mean?  
> Can't do it out of the box but that doesn't mean you can't program for 
> it.
> 
> Earnie -- http://for-my-kids.com/
> -- http://give-me-an-offer.com/
> 
>     
> Hi,
> 
> 
> no, I need the "normal" 404 messages, since they provide me useful infos about broken links on the
> site.
> 
> My problem is the Googlebot tries links thousand years dead, etc. and generates a massive amount
> of unnecessary entries in my watchdog.
> 
> I only would like to filter for domainnames, IP-s, etc.
> 
> 
> Best,
> Tibor
>   


More information about the support mailing list