Site Search

Find sitemap.xml, robots.txt, ads.txt, app-ads.txt and sellers.json - Real-Time Search

Not http https www subdomain robots.txt

The robots.txt file is a file at the root a domain indicates parts of the site should not be accessed by search engine crawlers.

(*) Thumbnail Screenshots by Thumbshots


### Google ###
User-agent: Googlebot
User-agent: Googlebot-Mobile

Allow: /*?order=recommended
Allow: /*&orderDir=
Allow: /*&rooms=
Allow: /*&adults=
Allow: /*&page=

Disallow: /zh/*
Disallow: /hk/*
Disallow: /ko/*
Disallow: /ja/*
Disallow: /en/USD/*
Disallow: /fr/USD/*
Disallow: /es/USD/*
Disallow: /de/USD/*
Disallow: /it/USD/*
Disallow: /pt/USD/*

User-agent: Googlebot-Image
Allow: /

### Bing / Yahoo ###
User-agent: Bingbot
User-agent: BingPreview
User-agent: MSNbot
User-agent: msnbot-media
User-agent: Slurp
User-agent: Yahoo! Slurp
Allow: /*?order=recommended
Allow: /*&orderDir=
Allow: /*&rooms=
Allow: /*&adults=
Allow: /*&page=
Disallow: /*?*

### Yandex / ###
User-agent: Yandex
User-agent: YandexBot
User-agent: YandexMobileBot
User-agent: Mail.Ru
User-agent: Mail.RU_Bot 
Disallow: /*?
Disallow: /fr/
Disallow: /it/
Disallow: /de/
Disallow: /pt/
Disallow: /es/

### DuckDuckBot / Qwant / Orange ###
User-agent: DuckDuckBot
User-agent: DuckDuckBot/1.1
User-agent: DuckDuckGo-Favicons-Bot
User-agent: Qwantify
User-agent: OrangeBot
Allow: /*?order=recommended
Allow: /*&orderDir=
Allow: /*&rooms=
Allow: /*&adults=
Allow: /*&page=
Disallow: /*?*

### Baidu / Naver / / Soso ###
User-agent: Baiduspider
User-agent: Baiduspider-image
User-agent: naver
user-agent: Yeti
User-agent: HaoSouSpider
User-agent: Sosospider
Allow: /*?order=recommended
Allow: /*&orderDir=
Allow: /*&rooms=
Allow: /*&adults=
Allow: /*&page=
Disallow: /*?*

### Ask / Robozilla / Blekko / BlitzBOT ###
User-agent: Teoma
User-agent: Robozilla
User-agent: ScoutJet
User-agent: BlitzBOT
Allow: /*?order=recommended
Allow: /*&orderDir=
Allow: /*&rooms=
Allow: /*&adults=
Allow: /*&page=
Disallow: /*?*

### Facebook / Twitter & tools ###
User-agent: Facebot
User-agent: Twitterbot
User-agent: SEMrushBot
User-agent: MJ12bot
Allow: /

### Dirty & Impolite Bots ####

User-agent: 007ac9
User-agent: 008
User-agent: asterias
User-agent: BacklinkCrawler
User-agent: BackDoorBot/1.0
User-agent: bdbrandprotect
User-agent: BuiltBotTough
User-agent: Bullseye/1.0
User-agent: BlowFish/1.0
User-agent: BotALot
User-agent: Black Hole
User-agent: BunnySlippers
User-agent: BPImageWalker
User-agent: CheeseBot
User-agent: CherryPicker
User-agent: CherryPickerSE/1.0
User-agent: CherryPickerElite/1.0
User-agent: Cliqzbot
User-agent: CopyRightCheck
User-agent: Cegbfeieh
User-agent: Crescent
User-agent: Crescent Internet ToolPak HTTP OLE Control v.1.0
User-agent: cosmos
User-agent: DIIbot
User-agent: Download Ninja
User-agent: DittoSpyder
User-agent: DOC
User-agent: URL_Spider_Pro
User-agent: EmailCollector
User-agent: EmailSiphon
User-agent: EmailWolf
User-agent: ExtractorPro
User-agent: EroCrawler
User-agent: Fetch
User-agent: eCatch
User-agent: psbot
User-agent: Ezooms
User-agent: findlinks
User-agent: Foobot
User-agent: Gigabot
User-agent: grub-client
User-agent: Harvest/1.5
User-agent: HTTrack
User-agent: HTTrack 3.0
User-agent: httplib
User-agent: hloader
User-agent: humanlinks
User-agent: ia_archiver
User-agent: InfoNaviRobot
User-agent: Jetbot
User-agent: JennyBot
User-agent: Keyword Density/0.9
User-agent: k2spider
User-agent: Kenjin Spider
User-agent: LNSpiderguy
User-agent: LinkextractorPro
User-agent: lwp-trivial/1.34
User-agent: lwp-trivial
User-agent: libWeb/clsHTTP
User-agent: LinkWalker
User-agent: LinkScan/8.1a Unix
User-agent: LexiBot
User-agent: LinkWalker
User-agent: larbin
User-agent: libwww
User-agent: linko
User-agent: Mata Hari
User-agent: moget
User-agent: moget/2.1
User-agent: MSIECrawler
User-agent: MIIxpc/4.2
User-agent: MIIxpc
User-agent: Mozilla/4.0 (compatible; BullsEye; Windows 95)
User-agent: Microsoft URL Control - 5.01.4511
User-agent: Microsoft URL Control - 6.00.8169
User-agent: Mister PiX
User-agent: Microsoft.URL.Control
User-agent: NICErsPRO
User-agent: NetMechanic
User-agent: NPBot
User-agent: NetAnts
User-agent: Net Vampire
User-agent: Nutch
User-agent: Openfind data gathere
User-agent: Openfind
User-agent: Offline Explorer
User-agent: ProWebWalker
User-agent: ProPowerBot/2.14
User-agent: Python-urllib
User-agent: QuepasaCreep
User-agent: QueryN Metasearch
User-agent: RepoMonkey Bait & Tackle/v1.01
User-agent: RepoMonkey
User-agent: rogerbot
User-agent: Roverbot
User-agent: RMA
User-agent: SiteSnagger
User-agent: Sogou web spider
User-agent: Sosospider
User-agent: suzuran
User-agent: Szukacz/1.4
User-agent: spbot
User-agent: Updownerbot
User-agent: Scrapy
User-agent: SiteExplorer Findxbot GarlikCrawler
User-agent: spanner
User-agent: SpankBot
User-agent: Teleport
User-agent: TeleportPro
User-agent: turingos
User-agent: TurnitinBot
User-agent: Telesoft
User-agent: The Intraformant
User-agent: True_Robot/1.0
User-agent: True_Robot
User-agent: toCrawl/UrlDispatcher
User-agent: TheNomad
User-agent: Titan
User-agent: TightTwatBot
User-agent: UbiCrawler
User-agent: URLy Warning
User-agent: VCI WebViewer VCI WebViewer Win32
User-agent: VCI
User-agent: Web Image Collector
User-agent: Wget/1.6
User-agent: wget
User-agent: WebmasterWorldForumBot
User-agent: WebReaper
User-agent: Website Quester
User-agent: Webster Pro
User-agent: WebEnhancer
User-agent: WebAuto
User-agent: WebZip
User-agent: WebStripper
User-agent: WebZip/4.0
User-agent: WebBandit
User-agent: WebBandit/3.50
User-agent: WebSauger
User-agent: WebCopier
User-agent: WebZIP
User-agent: Wget
User-agent: Wget/1.5.3
User-agent: WWW-Collector-E
User-agent: YisouSpider
User-agent: yoozBot
User-agent: Zeus
User-agent: Zeus 32297 Webster Pro V2.9 Win32
User-agent: Zao
User-agent: Zealbot
User-agent: Zite
User-agent: Zookabot
User-agent: ZyBORG
Disallow: /

### Others ###
User-agent: *
Disallow: /

### Notice: if you would like to crawl you can contact me at "[email protected]"

Sunday, 20 June 2021
Stock Images by Depositphotos