时间:2010-04-01 点击: 次 来源:互联网 作者:佚名 - 小 + 大
| 本文记录了全世界比较出名的Robots.txt 列表需要设置的搜索蜘蛛。如何设置那个目录不想被搜索引擎收录的可参照下去设置。 当然也必须从Robots.txt 去设置 
 Google的蜘蛛: Googlebot 如需要参考的可以参照本文: 
 User-agent: Black Hole  User-agent: Titan  User-agent: WebStripper  
 User-agent: NetMechanic  User-agent: CherryPicker  User-agent: EmailCollector  
 User-agent: EmailSiphon  
 User-agent: WebBandit  
 User-agent: EmailWolf  
 User-agent: ExtractorPro  User-agent: CopyRightCheck  
 User-agent: Crescent  User-agent: NICErsPRO  
 User-agent: SiteSnagger  
 User-agent: ProWebWalker  User-agent: CheeseBot  User-agent: mozilla/4  User-agent: mozilla/5  
 
 User-agent: Mozilla/4.0 (compatible; MSIE 4.0; Windows 95)  
 User-agent: Mozilla/4.0 (compatible; MSIE 4.0; Windows 9  
 User-agent: ia_archiver  
 User-agent: ia_archiver/1.6  
 User-agent: Alexibot  User-agent: Teleport  
 User-agent: TeleportPro  User-agent: Wget  User-agent: MIIxpc  
 
 User-agent: WebZip  
 
 User-agent: WebZip/4.0  User-agent: WebStripper  User-agent: WebSauger  User-agent: WebCopier  User-agent: NetAnts  User-agent: Mister PiX  
 User-agent: WebAuto  User-agent: TheNomad  
 
 User-agent: RMA  
 User-agent: libWeb/clsHTTPDisallow: / User-agent: asterias  
 
 User-agent: spanner  
 User-agent: InfoNaviRobot  
 
 
 
 User-agent: Mozilla/4.0 (compatible; BullsEye; Windows 95)  User-agent: Crescent Internet ToolPak HTTPOLE Control v.1.0  
 User-agent: CherryPickerSE/1.0  User-agent: CherryPickerElite/1.0  User-agent: WebBandit/3.50  User-agent: NICErsPRO  
 
 User-agent: Foobot  User-agent: WebmasterWorldForumBot  User-agent: SpankBot  User-agent: BotALot  
 User-agent: lwp-trivial  
 
 User-agent: Microsoft URL Control - 6.00.8169  User-agent: URLy Warning  User-agent: Wget  
 User-agent: LinkWalker  User-agent: cosmos  User-agent: moget  User-agent: hloader  
 User-agent: humanlinks  User-agent: LinkextractorPro  
 User-agent: Offline Explorer  编辑:Windear 
 
 User-agent: LexiBot  
 User-agent: Offline Explorer  
 User-agent: The Intraformant  
 User-agent: True_Robot/1.0  User-agent: True_Robot  User-agent: BlowFish/1.0  
 User-agent: JennyBot  User-agent: MIIxpc/4.2  
 User-agent: BuiltBotTough  
 User-agent: BackDoorBot/1.0  
 User-agent: WebEnhancer  
 
 User-agent: suzuran  User-agent: VCI WebViewer VCI WebViewer Win32  User-agent: VCI  User-agent: Szukacz/1.4  User-agent: QueryN Metasearch  User-agent: Openfind data gathere  User-agent: Openfind  
 User-agent: Xenu's Link Sleuth 1.1c  
 User-agent: Xenu's  User-agent: Zeus  User-agent: RepoMonkey Bait & Tackle/v1.01  User-agent: RepoMonkey  User-agent: Zeus 32297 Webster Pro V2.9 Win32  User-agent: Webster Pro  User-agent: EroCrawler  User-agent: LinkScan/8.1a Unix Disallow: / 
 User-agent: Keyword Density/0.9  
 User-agent: Kenjin Spider  
 User-agent: Cegbfeieh  Different: User-agent: larbin User-agent: b2w/0.1 
 User-agent: Copernic 
 
 
 
 User-agent: URL_Spider_Pro User-agent: CherryPicker 编辑:Windear User-agent: EmailCollector User-agent: EmailSiphon 
 
 User-agent: ExtractorPro User-agent: CopyRightCheck 
 User-agent: Crescent 
 User-agent: SiteSnagger User-agent: ProWebWalker User-agent: CheeseBot 
 User-agent: LNSpiderguy 
 User-agent: mozilla User-agent: mozilla/3 User-agent: mozilla/4 
 
 
 User-agent: TheNomad 
 User-agent: WWW-Collector-E 
 
 User-agent: libWeb/clsHTTP User-agent: httplib User-agent: turingos User-agent: InfoNaviRobot User-agent: Harvest/1.5 
 User-agent: Crescent Internet ToolPak HTTP OLE Control v.1.0 
 
 User-agent: CherryPickerElite/1.0 
 
 User-agent: NICErsPRO User-agent: DittoSpyder User-agent: Foobot User-agent: BotALot 
 User-agent: lwp-trivial/1.34 User-agent: lwp-trivial 
 
 
 User-agent: LinkextractorPro User-agent: Offline Explorer 
 User-agent: Mata Hari User-agent: LexiBot User-agent: Web Image Collector 
 
 User-agent: True_Robot User-agent: BlowFish/1.0 
 User-agent: MIIxpc/4.2 
 
 User-agent: BackDoorBot/1.0 User-agent: toCrawl/UrlDispatcher 
 User-agent: WebEnhancer 
 
 User-agent: VCI WebViewer VCI WebViewer Win32 
 User-agent: Szukacz/1.4  
 User-agent: QueryN Metasearch User-agent: Openfind data gathere User-agent: Openfind  
 
 
 User-agent: Zeus User-agent: RepoMonkey Bait & Tackle/v1.01 
 
 User-agent: Openbot 
 
 User-agent: Zeus Link Scout User-agent: Zeus 32297 Webster Pro V2.9 Win32 User-agent: EroCrawler User-agent: LinkScan/8.1a Unix 
 User-agent: Kenjin Spider User-agent: Iron33/1.0.2 
 User-agent: GetRight/4.2 User-agent: FairAd Client 
 User-agent: Aqua_Products User-agent: Radiation Retriever 1.1 User-agent: WebmasterWorld Extractor 
 User-agent: Oracle Ultra Search User-agent: MSIECrawler User-agent: PerMan 
 User-agent: searchpreview User-agent: naver 
 User-agent: dumbot User-agent: Hatena Antenna User-agent: grub-client User-agent: grub 
 User-agent: b2w/0.1 
 
 User-agent: psbot 
 User-agent: Python-urllib 
 
 
 User-agent: Crescent User-agent: SiteSnagger User-agent: ProWebWalker 
 User-agent: CheeseBot 
 User-agent: Mister PiX User-agent: WebAuto User-agent: TheNomad User-agent: WWW-Collector-E User-agent: RMA User-agent: httplib 
 User-agent: InfoNaviRobot User-agent: Harvest/1.5 User-agent: Bullseye/1.0 User-agent: Mozilla/4.0 (compatible; BullsEye; Windows 95) 
 
 User-agent: CherryPickerElite/1.0 
 User-agent: URLy Warning 
 User-agent: humanlinks 
 User-agent: The Intraformant User-agent: True_Robot/1.0 
 User-agent: BlowFish/1.0 User-agent: JennyBot User-agent: MIIxpc/4.2 
 User-agent: BuiltBotTough User-agent: ProPowerBot/2.14 
 User-agent: BackDoorBot/1.0 
 User-agent: WebEnhancer 
 User-agent: VCI WebViewer VCI WebViewer Win32 
 
 User-agent: QueryN Metasearch User-agent: Openfind data gathere User-agent: Openfind  User-agent: Xenu's Link Sleuth 1.1c 
 User-agent: Zeus 
 User-agent: RepoMonkey Bait & Tackle/v1.01 
 User-agent: RepoMonkey User-agent: Microsoft URL Control User-agent: Openbot User-agent: URL Control 
 
 User-agent: Webster Pro 
 User-agent: EroCrawler User-agent: LinkScan/8.1a Unix User-agent: Keyword Density/0.9 
 
 User-agent: Bookmark search tool 
 User-agent: GetRight/4.2 User-agent: FairAd Client 
 User-agent: Aqua_Products 
 
 User-agent: WebmasterWorld Extractor 
 User-agent: Flaming AttackBot 
 
 User-agent: MSIECrawler User-agent: PerMan 
 User-agent: sootle User-agent: es 
 User-agent: Enterprise_Search/1.0 
 User-agent: Enterprise_Search | 
下一篇:百度排名下降的主要原因分析
 贵公网安备52010302003427号
贵公网安备52010302003427号