uvip Posted August 1, 2013 Share Posted August 1, 2013 I am using 1.4 and I would like to know how to no follow the cms pages? It seems google crawled all of my cms pages here and list all of them. Thx Link to comment Share on other sites More sharing options...
vekia Posted August 1, 2013 Share Posted August 1, 2013 in this case you have exclude them from index, for example in the robots.txt file Link to comment Share on other sites More sharing options...
uvip Posted August 2, 2013 Author Share Posted August 2, 2013 Thanks for your information. Could you please be more specific there? How to do this in the robots.txt? Thanks in this case you have exclude them from index, for example in the robots.txt file Link to comment Share on other sites More sharing options...
vekia Posted August 2, 2013 Share Posted August 2, 2013 here is the example of the robots.txt content with "blocked" pages # robots.txt for http://www.example.com/ User-agent: * Disallow: /cyberworld/map/ # This is an infinite virtual URL space Disallow: /tmp/ # these will soon disappear Disallow: /foo.html you have to use Disallow: /page-to-block.html Link to comment Share on other sites More sharing options...
uvip Posted August 4, 2013 Author Share Posted August 4, 2013 Thanks for your reply. But most of my cms pages are not end with .html ? Also I read this about this issue: From 1295531089: From 1290821506: Edit robots.txt in the root directory of PrestaShop and add: Disallow: /cms.php or if your shop is in a subdirectory called shop: Disallow: /shop/cms.php If you don't have a robots.txt file, create a blank one with chmod 666 permissions, then go to Tools > Generators and click "Generate robots.txt file". Hi, I followed your instructions but Google has still indexed at least one of the cms pages. Would it help if I disallowed the "content" folder which is where the cms pages appear to reside ? Or would this disallow valid content which I want indexed ? The problem is that robots.txt is for crawlers and can't be used if the pages are already in Google index. You should use no index for these pages. And it actually makes it worse to have them in robots.txt. Because the crawlers will get to robots.txt and see that it shouldn't visit these pages. So remove them from robots.txt and add noindex to them. So I am wondering if there is a solution to let google stop listing the included cms pages there since they have already been in google? How to no index them and no follow them simultaneously? Is there a good module designed for this function? Thx here is the example of the robots.txt content with "blocked" pages # robots.txt for http://www.example.com/ User-agent: * Disallow: /cyberworld/map/ # This is an infinite virtual URL space Disallow: /tmp/ # these will soon disappear Disallow: /foo.html you have to use Disallow: /page-to-block.html Link to comment Share on other sites More sharing options...
nipun Posted May 29, 2014 Share Posted May 29, 2014 hi ,how can i add no index no follow in particular CMS page Like {* /content/9-featured-products-blog *} and particular CATEGORY page Like {* /30-testing*} In PrestaShop V. 1.5.6Please Helpthank you Link to comment Share on other sites More sharing options...
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now