Jump to content

Recommended Posts

Hi to everyone, i'm using PS version 8.1.5 and i'm using Faceted search module (version 4.0.0) but on Google search console i have a lot of issues about link. I noticed crawlers are scanning all filters i applied generating one link for each filter. You can imagine that i have thousands of useless links that are always crawled (they shouldn't be crawled). In attachment the picture of the issue of some link on goole search console and my module's settings. 

I have around 5000 products on my catalogue and more than 36000 generated wrong links about filters.

I compared all settings with my other website (PS version 1.7.8.7) and there i have no issues about it, then is not a problem about settings.

Do you think is a bug of the module? Or it could be something else? 
Thanks

Nicola

link.jpg

faceted.jpg

Link to comment
Share on other sites

Hi Mediacom87, thanks for your reply. I tried to add what you said times ago but crawler ignore the robots.txt.

I added months ago:
Disallow: /*?n=
Disallow: /*?q=
But i still have the same issue.

Ok i will have a look on the module, but i'm wondering why with the native module fateced search in the PS version 1.7 is working and in the 8.1.x is not. It could be a problem of the theme i'm using too?

Link to comment
Share on other sites

  On 3/4/2025 at 7:33 AM, bosco91 said:

Hi Mediacom87, thanks for your reply. I tried to add what you said times ago but crawler ignore the robots.txt.

I added months ago:
Disallow: /*?n=
Disallow: /*?q=
But i still have the same issue.

Ok i will have a look on the module, but i'm wondering why with the native module fateced search in the PS version 1.7 is working and in the 8.1.x is not. It could be a problem of the theme i'm using too?

Expand  

The problem is linked to the design of the module, which integrates links on each filter, and Google now uses data from Google Chrome (and yes, Google captures all data from all sites visited from this browser), so the rules in robots.txt or on links with rel=“nofollow” are no longer respected.

You therefore need to tell Google precisely via a robots meta tag that it must not index the page in question.

To manage this easily, if you don't know how to code it, you can use this kind of module, which can handle this case.

Link to comment
Share on other sites

maybe adding this code should solve the problem

if (Tools::getValue('q') || Tools::getValue('n')) {
$this->context->controller->meta['robots'] = 'noindex,follow';
}

the other option is the module
the problem then was not substantially google, but other bots and especially chatgpt

Link to comment
Share on other sites

  On 3/4/2025 at 11:52 AM, bosco91 said:

maybe adding this code should solve the problem

if (Tools::getValue('q') || Tools::getValue('n')) {
$this->context->controller->meta['robots'] = 'noindex,follow';
}

the other option is the module
the problem then was not substantially google, but other bots and especially chatgpt

Expand  

Hello,
in which file should I add the indicated code?
Thank you.

Link to comment
Share on other sites

  • 2 weeks later...

I have the same issue with robots ignoring robot.txt, mysql server is being overloaded because of constant queries from faceted search module.
I decided to make an override of FrontController to use default $page.meta.robots variables.

class FrontController extends FrontControllerCore
{
    public function getTemplateVarPage(){
        $page = parent::getTemplateVarPage();
        if (Tools::getValue('q') || Tools::getValue('n')) {
            $page['meta']['robots'] = 'noindex,follow';
            
        }
        return $page;
    }
}

 

Edited by WisQQ
updated post (see edit history)
Link to comment
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...