Site Search

Find sitemap.xml, robots.txt and ads.txt - Real-Time Search

Not http https www subdomain

thefaceshop.com.vn robots.txt

thefaceshop.com.vn
Site: thefaceshop.com.vn

The robots.txt file is a file at the root a domain indicates parts of the site should not be accessed by search engine crawlers.

(*) Thumbnail Screenshots by ShrinkTheWeb



# we use Haravan as our ecommerce platform

User-agent: *
Disallow: /admin
Disallow: /cart
Disallow: /carts
Disallow: /orders
Disallow: /checkout
Disallow: /checkouts
Disallow: /account
Disallow: /collections/*+*
Disallow: /collections/*%2B*
Disallow: /collections/*%2b*
Disallow: /blogs/*+*
Disallow: /blogs/*%2B*
Disallow: /blogs/*%2b*
Disallow: /*facebook_store_view*
Disallow: /*5giay_store_view*
Disallow: /*webtretho_store_view*
Disallow: /discount/*
Disallow: /apple-app-site-association
Sitemap: https://thefaceshop.com.vn/sitemap.xml

# Google adsbot ignores robots.txt unless specifically named!
User-agent: adsbot-google
Disallow: /checkout
Disallow: /checkouts
Disallow: /carts
Disallow: /orders
Disallow: /discount/*
Disallow: /*facebook_store_view*
Disallow: /*5giay_store_view*
Disallow: /*webtretho_store_view*

User-agent: Nutch
Disallow: /

User-agent: MJ12bot
Crawl-delay: 10

User-agent: Pinterest
Crawl-delay: 1

              
Sunday, 24 March 2019