Calladion Posted August 31, 2009 Share Posted August 31, 2009 Hello all,I have a strange problem I have been trying to get rid of and can't seem to figure it out.In google webmaster tools there is a crawl error in both my sitemap and in the not found section. This error refers to my main domain of www.leeschultzart.com. Google can crawl pretty much all the rest of the pages but it returns a 404 error whenever it tries to crawl the main page.I have heard that it might be because google sees duplicate content because I have leeschultzart.com redirected to www.leeschultzart.com and that it could be dealing with that, I'm not sure. I do see this in my access log which I think is the error message from the google bot trying to reach my index: "GET / HTTP/1.1" 404 12914 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)" I have checked with my host and they think it has something to do with my htaccess. They recommend removing it, but I would like to keep it as friendly urls are nice to help with the ranking of my pages. I will try to turn off friendly urls and remove the htaccess and let google crawl it again to see if I still get the errors, but I wanted to ask you guys and see if you have any idea what's going on here while I do that.Here is my htaccess: # URL rewriting module activation RewriteEngine on # URL rewriting rules RewriteRule ^([a-zA-Z0-9-]*)/([0-9]+)\-([a-zA-Z0-9-]*)\.html(.*)$ product.php?id_product=$2$4 [L,E] RewriteRule ^([0-9]+)\-([a-zA-Z0-9-]*)\.html(.*)$ product.php?id_product=$1$3 [L,E] RewriteRule ^([0-9]+)\-([a-zA-Z0-9-]*)(.*)$ category.php?id_category=$1 [QSA,L,E] RewriteRule ^content/([0-9]+)\-([a-zA-Z0-9-]*)(.*)$ cms.php?id_cms=$1 [QSA,L,E] RewriteRule ^([0-9]+)__([a-zA-Z0-9-]*)(.*)$ supplier.php?id_supplier=$1$3 [QSA,L,E] RewriteRule ^([0-9]+)_([a-zA-Z0-9-]*)(.*)$ manufacturer.php?id_manufacturer=$1$3 [QSA,L,E] # Catch 404 errors ErrorDocument 404 /404.php I appreciate any help that you all can provide. Link to comment Share on other sites More sharing options...
CYTechnologies Posted August 31, 2009 Share Posted August 31, 2009 Well the current htaccess file you have here doesn't do any redirection or addition of the www. - So how are you doing this currently? Link to comment Share on other sites More sharing options...
Calladion Posted August 31, 2009 Author Share Posted August 31, 2009 Right now it's being handled by the host with the domain settings somehow. I was just kind of wondering if the redirect would be causing google to error out on the index. Link to comment Share on other sites More sharing options...
CYTechnologies Posted September 1, 2009 Share Posted September 1, 2009 Its possible, that htaccess file isn't doing anything that would hurt your index file. Link to comment Share on other sites More sharing options...
Calladion Posted September 1, 2009 Author Share Posted September 1, 2009 That's what I figured. I'm checking again with my host on the domain redirect settings. Also, I did notice something else kind of funky. In google webmaster tools the error shows up in both the "not found" section and in the "sitemap" section. So, I thought I had a bogus sitemap. I tried regenerating it with the gsitemap module and it didn't fix it. I then tried to go to www.xml-sitemaps.com and have them create me one but it timesout and can't connect to the site.So, I really don't know if it's due to the domain setup or something with prestashop. I'm using 1.1 and i've tried looking over the bug list but I don't see anything that is really relevant.Any other ideas? Link to comment Share on other sites More sharing options...
CYTechnologies Posted September 1, 2009 Share Posted September 1, 2009 Make sure your store settings are directing everyone to www - you could have a loop thats causin the timeout, your domain is directing to www. and prestashop is redirecting back to no www. Link to comment Share on other sites More sharing options...
Calladion Posted September 1, 2009 Author Share Posted September 1, 2009 Where would I check the settings for that? I did see in subdomains that there is an entry for www but I haven't changed any of that so it's default I think.Also, another interesting note is that whenever I use a header checker at http://www.seoconsultants.com/tools/headers.asp it returns a 404 code in the header. I replace the header in index.php with a 200 code and then the header checker works as well as the xml sitemap generator. However the new generated xml sitemap from the generator doesn't correct the issue either. Link to comment Share on other sites More sharing options...
Calladion Posted September 4, 2009 Author Share Posted September 4, 2009 I now have disable friendly urls and the htaccess and the errors remain. Link to comment Share on other sites More sharing options...
CYTechnologies Posted September 4, 2009 Share Posted September 4, 2009 Did your host say anything else about the issue? Link to comment Share on other sites More sharing options...
Calladion Posted September 4, 2009 Author Share Posted September 4, 2009 They are still blaming it on the htaccess. I just posted a reply to them and I'll see what they say. Support isn't the best. Link to comment Share on other sites More sharing options...
CYTechnologies Posted September 4, 2009 Share Posted September 4, 2009 Support on a webhost ussually winds up being:"Hello, My server seems to be down."Support: "Well, it is up for us.. why don't you try again.""Nope, its still down..."Support: "Its probably your internet, try now.""Okay, it works now... yea I guess my internet is messed up"Support: "It happens have a good day."Behind the scenes that support guy was hitting restart 10 times on the server haha :-) Link to comment Share on other sites More sharing options...
Recommended Posts