store | blogs | forums | twitter | facebook | wiki | mailing lists | downloads | support portal
Atomic Secure Linux
It is currently Fri May 24, 2013 4:18 pm

» Feed - Atomicorp

All times are UTC - 5 hours [ DST ]




Post new topic Reply to topic Share/Bookmark  [ 3 posts ] 
Author Message
 Post subject: a problem with google crawl
Unread postPosted: Thu Jul 28, 2011 12:09 pm 
Offline
Forum User
Forum User

Joined: Sat Jan 17, 2009 2:19 pm
Posts: 99
Hi,
I am receiving hundreds of the following trigger when Google crawls one of my customers:

Quote:
--0624ad2a-B--
GET /index.php/component/content/article/34-portada/components/index.php?option=com_content&view=article&id=583:lugares_donde_festejan_las_fiest&catid=39:peru&Itemid=59 HTTP/1.1
Host: xxxxx
Connection: Keep-alive
Accept: */*
From: googlebot(at)googlebot.com
User-Agent: Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)
Accept-Encoding: gzip,deflate

--0624ad2a-F--
HTTP/1.1 403 Forbidden
Content-Length: 388
Keep-Alive: timeout=5, max=100
Connection: Keep-Alive
Content-Type: text/html; charset=iso-8859-1

--0624ad2a-H--
Message: Access denied with code 403 (phase 2). Match of "rx (^-?[0-9]+$|^-?[0-9]+\\:([a-z0-9- ]+|[0-9a-z- ]+)$|^$|^[-0-9:a-z \\.\\!]+$)" against "ARGS:id" required. [file "/usr/local/apache/conf/modsec_rules/99_asl_jitp.conf"] [line "4289"] [id "390605"] [rev "13"] [msg "Atomicorp.com WAF Rules - Virtual Patch: Joomla id ARG injection"] [severity "CRITICAL"]
Apache-Error: [file "core.c"] [line 3650] [level 3] File does not exist: /403.shtml
Action: Intercepted (phase 2)
Stopwatch: 1311868697142731 18507 (3192 17657 -)
Producer: ModSecurity for Apache/2.5.13 (http://www.modsecurity.org/); 201107271800.
Server: Apache


I know that the line:
GET /index.php/component/content/article/34-portada/components/index.php?option=com_content&view=article&id=583:lugares_donde_festejan_las_fiest&catid=39:peru&Itemid=59

contains two index.php, but how google got that address to crawl?

I have checked the customer account and didn't find anything wrong, any idea on how to fix on this?

Sergio


Top
 Profile  
 
 Post subject: Re: a problem with google crawl
Unread postPosted: Thu Jul 28, 2011 12:24 pm 
Offline
Atomicorp Staff - Site Admin
Atomicorp Staff - Site Admin
User avatar

Joined: Thu Feb 07, 2008 7:49 pm
Posts: 3248
Location: Chantilly, VA
I've definitely seen the google crawler send bogus and just mangled requests, so this could be one of those cases (I've seen google send completely bogus requests, no VERB, no valid character set, just raw 8 bit noise). If you are running Jommla/Mambo on that domain, then this isnt even a valid URL. In that case, I'd report it to google as a bug in their crawler( if you are using Joomla/Mambo with that domain).

If you are not using that CMS with that domain, then you would want to disable that rule for the vhost. Try that, test the URL, if its valid then that domain must not be using Joomla/Mambo. If it doesnt work, leave the rule on and report it to google as a bug.

_________________
Michael Shinn
Atomicorp - Security For Everyone

Co-Author of Troubleshooting Linux Firewalls.


Top
 Profile  
 
 Post subject: Re: a problem with google crawl
Unread postPosted: Thu Jul 28, 2011 11:52 pm 
Offline
Forum User
Forum User

Joined: Sat Jan 17, 2009 2:19 pm
Posts: 99
Thank you Mike.

The site is using Joomla, now I need to find where to report this to google :)

Sergio


Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic Share/Bookmark  [ 3 posts ] 

» Feed - Atomicorp

All times are UTC - 5 hours [ DST ]


Who is online

Users browsing this forum: No registered users and 1 guest


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Jump to:  
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group