Jump to content

Strange Google Problem


Recommended Posts

Hello all,

I have a strange problem I have been trying to get rid of and can't seem to figure it out.

In google webmaster tools there is a crawl error in both my sitemap and in the not found section. This error refers to my main domain of www.leeschultzart.com. Google can crawl pretty much all the rest of the pages but it returns a 404 error whenever it tries to crawl the main page.

I have heard that it might be because google sees duplicate content because I have leeschultzart.com redirected to www.leeschultzart.com and that it could be dealing with that, I'm not sure. I do see this in my access log which I think is the error message from the google bot trying to reach my index:

"GET / HTTP/1.1" 404 12914 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"



I have checked with my host and they think it has something to do with my htaccess. They recommend removing it, but I would like to keep it as friendly urls are nice to help with the ranking of my pages. I will try to turn off friendly urls and remove the htaccess and let google crawl it again to see if I still get the errors, but I wanted to ask you guys and see if you have any idea what's going on here while I do that.

Here is my htaccess:

# URL rewriting module activation
RewriteEngine on

# URL rewriting rules
RewriteRule ^([a-zA-Z0-9-]*)/([0-9]+)\-([a-zA-Z0-9-]*)\.html(.*)$ product.php?id_product=$2$4 [L,E]
RewriteRule ^([0-9]+)\-([a-zA-Z0-9-]*)\.html(.*)$ product.php?id_product=$1$3 [L,E]
RewriteRule ^([0-9]+)\-([a-zA-Z0-9-]*)(.*)$ category.php?id_category=$1 [QSA,L,E]
RewriteRule ^content/([0-9]+)\-([a-zA-Z0-9-]*)(.*)$ cms.php?id_cms=$1 [QSA,L,E]
RewriteRule ^([0-9]+)__([a-zA-Z0-9-]*)(.*)$ supplier.php?id_supplier=$1$3 [QSA,L,E]
RewriteRule ^([0-9]+)_([a-zA-Z0-9-]*)(.*)$ manufacturer.php?id_manufacturer=$1$3 [QSA,L,E]

# Catch 404 errors
ErrorDocument 404 /404.php



I appreciate any help that you all can provide.

Link to comment
Share on other sites

That's what I figured. I'm checking again with my host on the domain redirect settings. Also, I did notice something else kind of funky. In google webmaster tools the error shows up in both the "not found" section and in the "sitemap" section. So, I thought I had a bogus sitemap. I tried regenerating it with the gsitemap module and it didn't fix it. I then tried to go to www.xml-sitemaps.com and have them create me one but it timesout and can't connect to the site.

So, I really don't know if it's due to the domain setup or something with prestashop. I'm using 1.1 and i've tried looking over the bug list but I don't see anything that is really relevant.

Any other ideas?

Link to comment
Share on other sites

Where would I check the settings for that? I did see in subdomains that there is an entry for www but I haven't changed any of that so it's default I think.

Also, another interesting note is that whenever I use a header checker at http://www.seoconsultants.com/tools/headers.asp it returns a 404 code in the header. I replace the header in index.php with a 200 code and then the header checker works as well as the xml sitemap generator. However the new generated xml sitemap from the generator doesn't correct the issue either.

Link to comment
Share on other sites

Support on a webhost ussually winds up being:

"Hello, My server seems to be down."
Support: "Well, it is up for us.. why don't you try again."
"Nope, its still down..."
Support: "Its probably your internet, try now."
"Okay, it works now... yea I guess my internet is messed up"
Support: "It happens have a good day."

Behind the scenes that support guy was hitting restart 10 times on the server haha :-)

Link to comment
Share on other sites

×
×
  • Create New...