Site Search

Find sitemap.xml, robots.txt, ads.txt, app-ads.txt and sellers.json - Real-Time Search

Not http https www subdomain

maurice-garcin.fr robots.txt

maurice-garcin.fr
Site: maurice-garcin.fr

The robots.txt file is a file at the root a domain indicates parts of the site should not be accessed by search engine crawlers.

(*) Thumbnail Screenshots by Thumbshots



User-agent: * 
Disallow: /catalog/admin
Disallow: /download
Disallow: /cache
Disallow: /segments
Allow: /templates/gnimmo/catalog/images/
Disallow: /tmp
Disallow: /partenaires
Disallow: /log
Disallow: /flex
Disallow: /test
Disallow: /catalog/catalogues/
Disallow: /*.pdf
Disallow: *.pdf
Disallow: /catalog/download/
Disallow: /catalog/font/
Disallow: /catalog/gtk_update/
Disallow: /catalog/img/
Disallow: /catalog/includes/
Disallow: /catalog/pub/
Disallow: /catalog/temp/
Disallow: /catalog/test/
Disallow: /catalog/video/
Disallow: /catalog/mentions.php
Disallow: /catalog/rss.php
Disallow: /catalog/products_print.php
Disallow: /catalog/diaporama.php
Disallow: /catalog/rss.php
Disallow: *action=update_search*
Disallow: *ajax.php*
Disallow: *jqueryajaxagent*
Disallow: *advanced_search_result*
Disallow: *sort=*
Disallow: *nego_id*
Disallow: *simul_credit.php*
Disallow: *cPath*
Disallow: *manufacturers*
Disallow: *products_print*
Disallow: *search_requests.php*



User-agent: DomainCrawler/3.0
disallow: /

User-agent: TurnitinBot
Disallow: /

User-agent: ConveraCrawler
Disallow: /

User-agent: QuepasaCreep
Disallow: /

User-agent: Jetbot
Disallow: /

# Deny Soso spider in the site
User-agent: Soso
Disallow: /

# Deny Yandex spider in the site
User-agent: Yandex
Disallow: /

User-agent: Qwantify
Disallow: /

Sitemap: https://www.maurice-garcin.fr/catalog/sitemaps.php
              
Saturday, 8 August 2020