网站时光机つくえ

网站时光机つくえ
Wayback Machine
	截图 2021年ねん10月がつ的てき网站时光机つくえ首くび页
网站类型	存そん档
成立せいりつ	1996年ねん5月がつ10日とおか，28年ねん前まえ
持もち有ゆう者しゃ	互联网档案あん馆
网址	web.archive.org
注ちゅう册さつ	可か选
推出时间	2001年ねん10月がつ24日にち，22年ねん前まえ
现状	活かつ跃
編へん程ほど語ご言げん	Java、Python

网站时光机つくえ（英語えいご：Wayback Machine）是これ万まん维网的てき數かず碼档案馆，由よし位い于美国こく加か利り福ぶく尼あま亚州旧きゅう金山かなやま的てき非ひ營利えいり組織そしき互联网档案あん馆创建，亦また为该组织最さい重要じゅうよう的てき服ふく务之一いち。它允许用户“回かい到いた过去”，查看过去的てき网站的てき样子。其创始はじめ人じん布ぬの鲁斯特とく·卡利和わBruce Gilliat（英えい语：Bruce Gilliat）开发了りょう网站时光机つくえ，旨むね在ざい通どおり过保存ほぞん已やめ失效しっこう网页的てき存そん档副本ふくほん，以“普及ふきゅう所有しょゆう知ち识”（universal access to all knowledge）。自じ2001年ねん推出以来いらい，截至2024年ねん1月がつ3日にち，网站时光机つくえ已やめ存そん档超过 8600 亿个网页和わ超ちょう过 99 PB 的てき数すう据すえ。^[4]^[5]

历史

网站时光机つくえ由よし互联网档案あん馆的创始人じん布ぬの魯斯特とく·卡利和わBruce Gilliat（英えい语：Bruce Gilliat）于2001年ねん公開こうかい推出，以解决网站在维护或ある关闭时无法ほう查看内容ないよう的てき问题^[6]，此外还能查看网页的てき历史存そん档版本はんぽん，创始人じんKahle和わGilliat希望きぼう以此能のう为整个互联网“普及ふきゅう所有しょゆう知ち识”（universal access to all knowledge）^[7]。

Wayback Machine这个名称めいしょう源げん于动画が片へんThe Rocky and Bullwinkle Show（英えい语：The Rocky and Bullwinkle Show）中なか的てき“WABAC机つくえ器き（英えい语：WABAC machine）”（发音为Way-back），这是一いち个时间旅行りょこう装置そうち^[8]^[9]。在ざい动画片へん的てき皮かわ博はく迪すすむ的てき不可能ふかのう的てき历史一いち集中しゅうちゅう，角かく色しょく使用しよう这一机つくえ器き来らい见证、参与さんよ甚至改あらため变历史上しじょう的てき著名ちょめい事件じけん^[10]。

网站时光机つくえ于1996年ねん开始存そん档缓存网页，目もく标是在ざい五年后将服务公之于众^[11]。从1996年ねん到いた2001年ねん，这些信しん息いき保存ほぞん在ざい数字すうじ磁带上じょう，Kahle偶尔允まこと许研究けんきゅう人じん员和科学かがく家か使用しよう数すう据すえ库^[12]。2001年ねん，互联网档案あん馆成立せいりつ五ご周年しゅうねん时，加州かしゅう大学だいがく伯はく克利かつとし分校ぶんこう举行了りょう网站时光机つくえ的てき公布こうふ仪式^[13]。当とう网站时光机つくえ推出时，它已经存档了超ちょう过100亿个页面^[14]。

如今，数すう据すえ存そん储在互联网档案あん馆的大型おおがたLinux节点群集ぐんしゅう上じょう^[7]。有ゆう时会重おも新しん访问并存档网站的新しん版本はんぽん（参まいり见下文ぶん技わざ术细节）^[15]。如果网站允まこと许网络时光こう机つくえ“爬虫索引さくいん”网站并保存ほぞん数すう据すえ，则也可か以通过在搜索そうさく框かまち中ちゅう输入网站的てきURL手しゅ动捕获网站^[11]。

技わざ术细节

网络时光机つくえ已やめ经开发了软件用よう于“爬虫索引さくいん”并下载所有しょゆう可か公おおやけ开访问的万まん维网页面、Gopher层次结构、Usenet公告こうこく板いた系けい统和可か下か载软件けん^[16]。这些“爬虫”收集しゅうしゅう的てき信しん息いき并不能ふのう包括ほうかつ互联网上所有しょゆう可用かよう的てき信しん息いき，因いん为许多数たすう据すえ受发布ぬの者しゃ限げん制せい或ある存そん储在不可ふか访问的てき数かず据すえ库中なか。为了克服こくふく部分ぶぶん缓存网站的てき不一致ふいっち性せい，2005年ねん，互联网档案あん馆开发了Archive-It.org，使つかい得とく机つくえ构和内容ないよう创作者しゃ可か以自愿すなお收集しゅうしゅう和わ保存ほぞん数字すうじ内容ないよう，并创建けん数字すうじ档案馆^[17]。

爬虫索引さくいん来き自じ各かく种来源げん，其中一些是从第三方导入的，而另一些是由存档内部生成的^[15]。自じ2010年ねん以来いらい，“Worldwide Web Crawls”一いち直ちょく在ざい运行，并捕获全球だま网站^[15]^[18]。

快かい照あきら捕と获的频率因いん网站而异^[15]。“Worldwide Web Crawls”中ちゅう的てき网站包含ほうがん在ざい“爬网列れつ表ひょう”（crawl list）中ちゅう，每次まいじ爬网都会とかい将はた网站存そん档一いち次じ^[15]。爬网可能かのう需要じゅよう数すう月がつ甚至数すう年ねん才能さいのう完成かんせい，具体ぐたい取と决于其大小しょう^[15]。例れい如，"Wide Crawl Number 13"从2015年ねん1月がつ9日にち开始，于2016年ねん7月がつ11日にち完成かんせい^[19]。但ただし是ぜ，一次可能有多个爬网正在进行，并且一个站点可能包含在多个爬网列表中，因いん此，对站点てん进行爬网的てき频率有ゆう很大的てき不同ふどう。^[15]

存そん储容量的りょうてき增加ぞうか

随ずい着ぎ多た年来ねんらい技わざ术的发展，网站时光机つくえ的てき存そん储容量りょう不断ふだん增加ぞうか。2003年ねん，仅经过两年ねん的てき公おおやけ开访问，网站时光机つくえ便びん以每月まいつき12太字ふとじ节（TB）的てき速度そくど增ぞう长。数かず据すえ存そん储在由よし互联网档案あん馆的工作こうさく人じん员定制せい设计的てきPetaBox（英えい语：PetaBox）机つくえ架か系けい统上。第だい一いち个100太字ふとじ节（TB）的てき机つくえ架か于2004年ねん6月がつ全面ぜんめん投入とうにゅう使用しよう，不ふ过很快かい就发现，这些存そん储空间远远不够^[20]^[21]。

互联网档案あん馆在2009年ねん其定制せい的てき存そん储体系けい结构迁移到いたSun开放式しき储存（英えい语：Sun Open Storage），并在Sun系けい统的てき加か利り福ぶく尼あま亚园区的てきSun模も块化数すう据すえ中心ちゅうしん（英えい语：Sun Modular Datacenter）中ちゅう托たく管かん了りょう一いち个新的てき数かず据すえ中心ちゅうしん^[22]。截至2009年ねん^[update]，网站时光机つくえ包含ほうがん大だい约3拍はく字じ节（PB）的てき数すう据すえ，并以每月まいつき100太字ふとじ节（TB）的てき速度そくど增ぞう长^[23]。

2013年ねん1月がつ，该公司こうし宣布せんぷ了りょう2400亿个URL的てき突破とっぱ性せい里程りてい碑ひ^[24]。2013年ねん10月がつ，该公司こうし宣布せんぷ了りょう“保存ほぞん页面”(Save a Page)功こう能のう^[25]，允まこと许任何なん互联网用户存档URL的てき内容ないよう。这成为了托たく管かん恶意二に进制文ぶん件けん的てき服ふく务滥用よう威い胁^[26]^[27]。

截至2014年ねん12月 (2014-12)^[update]，网站时光机つくえ存そん有ゆう4350亿个网页，将はた近きん9拍はく字じ节（PB）的てき数すう据すえ，并且每ごと周しゅう增ぞう长约20太字ふとじ节（TB）^[14]^[28]^[29]。

据すえ报道，截至2016年ねん7月がつ (2016-07)^[update]，网站时光机つくえ存そん有ゆう约15拍はく字じ节（PB）的てき数すう据すえ^[30]。

截至2018年ねん9月がつ (2018-09)^[update]，网站时光机つくえ存そん有ゆう超ちょう过25拍はく字じ节（PB）的てき数すう据すえ^[31]^[32]。

成なり长

2013年ねん10月がつ至いたり2015年ねん3月がつ，该网站的全ぜん球たまAlexa排はい名めい从163^[33]变为208^[34]。2019年ねん3月がつ，该排名めい为244^[35].

网站时光机つくえ的てき成なり长 ^[36] ^[37]
年とし份	已やめ存そん档的页面数すう（单位：亿）
2005	400
2008	850
2012	1,500
2013	3,730
2014	4,000
2015	4,520
2016	4,590
2017	2,790
2018	3,100
2019	3,450
2020	4,050
2021	5,140
2022	6,400
2024	8,660

网站排除はいじょ方かた针

历年来らい，网站时光机つくえ一いち直ちょく尊重そんちょう机つくえ器き人じん排除はいじょ标准（robots.txt）以决定じょう一个网站是否会受爬网；或ある者もの如果已やめ经爬网了，它的存そん档是否いや可か以公开查看み。通つう过使用しようrobots.txt，网站所有しょゆう者しゃ可か以选择退出たいしゅつ网站时光机つくえ。如果站点阻止そし了りょう网页存そん档，则域中ちゅう以前いぜん存そん档的任にん何なん页面也将立りつ即そく显示为不可用かよう。此外，互联网档案あん馆表示ひょうじ，“有ゆう时网站所有しょゆう者しゃ会かい直接ちょくせつ联系我わが们，要求ようきゅう我わが们停止ていし对网站进行ぎょう爬网或ある存そん档。我わが们会遵守じゅんしゅ这些请求。”^[38]^[39]

2017年ねん4月がつ17日にち，有ゆう报道称たたえ，一些网站已经倒闭，成なり为暂停的てき域いき（英えい语：Domain parking）（Domain parking）。它们通どおり过使用しようrobots.txt把わ自己じこ排除はいじょ在ざい搜索そうさく引擎之の外そと，这使得とく时光机つくえ无意中ちゅう排除はいじょ了りょう這些网站^[40]。

网站时光机つくえ的てき网站排除はいじょ方かた针（Website exclusion policy）部分ぶぶん基もと于2002年ねん加か利り福ぶく尼あま亚大学がく伯はく克利かつとし分校ぶんこう信しん息いき管理かんり和わ系けい统学院いん发布的てき《管理かんり删除请求和わ维护档案完かん整せい性せい的てき建けん议》（英語えいご：Recommendations for Managing Removal Requests and Preserving Archival Integrity），此建议赋予よ网站所有しょゆう者しゃ阻止そし访问网站存そん档的权利^[41]。网站时光机つくえ遵守じゅんしゅ了りょう这一政策せいさく，以避免めん昂のぼる贵的诉讼^[42]。

网站排除はいじょ方かた针于2017年ねん开始放ひ宽，当とう时它停止ていし遵循robots.txt，并对美国びくに政府せいふ和わ军方的てき网站进行爬网和わ显示网页。截至2017年ねん4月がつ，网站时光机つくえ更さら广泛地ち忽ゆるがせ略りゃく了りょうrobots.txt，而不仅对于美国こく政府せいふ网站^[43]^[44]^[45]^[46]。

用途ようと

自じ2001年ねん网站时光机つくえ公こう开发布ぬの以来いらい，学者がくしゃ们一直在研究它的存储和收集数据的方式，以及其存档中实际包含ほうがん的てき页面。截至2013年ねん，学者がくしゃ们已经在网站时光机上きじょう撰せん写うつし了りょう大だい约350篇へん文章ぶんしょう，其中大だい部分ぶぶん来き自じ信しん息いき技わざ术、图书馆学和わ社会しゃかい科学かがく领域。社会しゃかい科学かがく学者がくしゃ们使用しよう网站时光机つくえ分析ぶんせき了りょう从90年代ねんだい中期ちゅうき至いたり今こん网站的てき发展对公司こうし的てき成なり长的影かげ响^[14]。

当とう网站时光机つくえ存そん档一个页面めん时，它通常会じょうかい包含ほうがん大だい多数たすう超ちょう链接，以使这些链接遭互联网的てき不ふ稳定性せい轻易破やぶ坏时，能のう够仍然しか保持ほじ活かつ动状态。印度いんど的てき研究けんきゅう人じん员研究けんきゅう了りょう网站时光机つくえ保存ほぞん在ざい线学术出版しゅっぱん物ぶつ中ちゅう的てき超ちょう链接的てき能力のうりょく的てき有效ゆうこう性せい，发现它保存ほぞん了りょう略りゃく多た于一半いっぱん的てき超ちょう链接。^[47]

有ゆう记者使用しよう网站时光机つくえ查看失效しっこう的てき网站、过时的てき新しん闻报道どう以及被ひ更改こうかい的てき网站内容ないよう。其内容ないよう已やめ用よう于追究ついきゅう政治せいじ家か的てき责任，揭穿争そう论场合あい上じょう的てき谎言^[48]。2014年ねん，乌克兰东部ぶ分裂ぶんれつ地区ちく叛军顿涅茨いばら克かつ人民じんみん军领导人じん伊い戈ほこ尔·斯特列れつ尔科夫おっと的てき社交しゃこう媒体ばいたい的てき存そん档页面めん显示，他た吹嘘自己じこ的てき部ぶ队击落了一架疑似乌克兰军用飞机，后きさき来らい才知さいち道どう这架飞机实际上じょう是ぜ一架马航民航客机（马来西にし亚航空こうくう17号ごう班はん机つくえ），之これ后きさき，他た删除了りょう发布的てき这篇文章ぶんしょう，并指责乌克かつ兰军方かた击落了りょう这架飞机^[48]^[49]。2017年ねん，在ざい社交しゃこう网站Reddit的てき讨论中ちゅう，有人ゆうじん表示ひょうじ访问过archive.org 并发现白宫网站删除じょ了りょう所有しょゆう提ひさげ及气候こう变化的てき内容ないよう，对此，一いち位い用よう户评论道：“科学かがく家か有ゆう必要ひつよう在ざい华盛顿举行ぎょう一いち次じ游ゆう行こう”，此事成なり为了为科学かがく游ゆう行こう（March for Science）举行的てき原因げんいん^[50]^[51]^[52]。

存在そんざい局限きょくげん

2014年ねん，从抓取と网站到它可以在网站时光机上きじょう查看之の间存在そんざい6个月的てき延のべ迟时间^[53]。目前もくぜん，该延迟时间为3-10小しょう时^[54]。网站时光机つくえ仅提供ていきょう有限ゆうげん的てき搜索そうさく功こう能のう，它的“站点搜索そうさく”（Site Search）功こう能のう允まこと许用户根据すえ描述站点的てき词汇来らい查找站点，而非网页本身ほんみ的てき词汇^[55]。

由よし于网络爬虫ちゅう的てき限きり制せい，网站时光机つくえ无法完全かんぜん存そん档互动式しき网页，例れい如Flash平台ひらだい和わ使用しようJavaScript和わ渐进式しき网络应用程ほど序じょ编写的てき表ひょう单，因いん为这些功能のう需要じゅよう与あずか宿主しゅくしゅ网站交互こうご。网站时光机つくえ的てき网络爬虫很难提ひっさげ取ど任にん何なん未み使用しようHTML或ある其变形がた编码的てき内容ないよう，这通常会じょうかい导致超ちょう链接损坏和わ图像丢失。因よし此，网络爬虫无法存そん档不包含ほうがん指向しこう其他页面的てき链接的てき“孤立こりつ页面”（Orphan page）^[55]^[54]。由よし于其爬虫程ほど序じょ仅能根ね据すえ其预设的深度しんど限げん制せい追つい踪有限げん数量すうりょう的てき超ちょう链接，因いん此它无法存そん档每个页面めん中ちゅう的てき每まい个超链接^[18]。

法律ほうりつ证据

民事みんじ诉讼

Netbula LLC v. Chordiant Software Inc.

在ざい2009年ねん的てき“Netbula, LLC v. Chordiant Software Inc.”一案いちあん中ちゅう，被告ひこくChordiant提出ていしゅつ动议，要求ようきゅうNetbula禁きん用よう其网站上的てきrobots.txt文ぶん件けん，因いん为该文ぶん件けん导致网站时光机つくえ追おい溯さかのぼ性せい地ち撤销了りょう对Netbula网站先さき前ぜん版本はんぽん的てき存そん档的访问权限，Chordiant相しょう信しん这些页面中ちゅう存在そんざい有利ゆうり于诉讼的材料ざいりょう^[56]。

Netbula反はん对该动议，理由りゆう是ぜ被告ひこく要求ようきゅう更改こうかいNetbula的てき网站，他た们应该直接ちょくせつ为这些页面めん直接ちょくせつ传唤互联网档案あん馆^[57]。然しか而，互联网档案あん馆的一名雇员发表了宣誓声明，支持しじChordiant的てき动议，表示ひょうじ在ざい“不ふ对其运营造成ぞうせい大量たいりょう负担，费用和わ干ひ扰”的てき情じょう况下，无法通どおり过任何なん其他方式ほうしき访问网页^[56]。

美国びくに加か利り福ぶく尼あま亚北区く联邦地区ちく法ほう院いん圣何塞ふさが分部わけべ的てき地方ちほう法官ほうかん霍华德とく·劳埃德とく（Howard Lloyd）驳回了りょうNetbula的てき论点，并命令めいれい他た们暂时禁用ようrobots.txt阻止そし程ほど序じょ，以使Chordiant可か以检索さく他た们想要よう的てき存そん档页面めん^[56]。

波なみ兰电视台

在ざい2004年ねん10月がつ的てき“ Telewizja Polska USA, Inc. v. Echostar Satellite”No. 02 C 3293, 65 Fed. R. Evid. Serv. 673 (N.D. Ill. October 15, 2004)一案いちあん中ちゅう，一いち名めい诉讼当事とうじ人じん试图使用しよう网站时光机つくえ的てき档案作さく为有效ゆうこう证据的てき来らい源げん，此举可能かのう属ぞく于首次じ。波なみ兰电视台是ぜTVP Polonia（英えい语：TVP Polonia）的てき供きょう应商，EchoStar（英えい语：EchoStar）运营Dish Network。在ざい审判程ほど序じょ之の前まえ，EchoStar表示ひょうじ，它打算ださん提供ていきょう网站时光机つくえ快かい照あきら，作さく为波兰电视台网站过去内容ないよう的てき证据。

参まいり閲

网络存そん档网站列表ひょう（英えい语：List_of_Web_archiving_initiatives）
公共こうきょう領域りょういき音樂おんがく（英えい语：Public domain music）
網あみ頁ぺーじ存そん檔（英えい语：Web archiving）
數すう位い圖書館としょかん

类似的てき项目

Archive.is
網あみ際ぎわ網もう路ろ記憶きおく基金ききん會かい（英えい语：Internet Memory Foundation）
LibriVox
國家こっか數すう位い資し訊基礎きそ設しつらえ施ほどこせ和わ保護ほご計けい劃（英えい语：National Digital Information Infrastructure and Preservation Program） (NDIIPP)
國家こっか數すう位い圖書館としょかん計けい劃（英えい语：National Digital Library Program） (NDLP)
古こ腾堡计划
英國えいこく國家こっか檔案館かん的てき英國えいこく政府せいふ網もう頁ぺーじ存そん檔（英えい语：UK Government Web Archive）
英國えいこく網もう頁ぺーじ存そん檔聯盟れんめい（英えい语：UK Web Archiving Consortium）
WebCite
Google圖書としょ
ウェブ魚拓ぎょたく（日にち语：ウェブ魚拓ぎょたく）

其他

外部がいぶ链接

官かん方かた网站
互联网档案あん馆的使用しよう条じょう款，隐私政策せいさく和わ版ばん权政策せいさく. archive.org. 2014-12-31 [2020年ねん6月がつ20日はつか]. （原始げんし内容ないよう存そん档于2020年ねん6月がつ6日にち）.
搜索そうさく或ある保存ほぞん网页的てき基本きほん用よう户操作そうさ指南しなん. WikiHow.com. [2020-06-20]. （原始げんし内容ないよう存そん档于2020-03-15）（英えい语、德とく语、西にし班はん牙きば语、法ほう语及意い大利おおとし语）.
Internet history is fragile. This archive is making sure it doesn't disappear [互联网历史し是ぜ脆弱ぜいじゃく的てき。这个档案正ただし在ざい确保它不会かい消失しょうしつ]. San Francisco: PBS Newshour. [2020-06-20]. （原始げんし内容ないよう存そん档于2021-04-08）.

镜像网站

网站时光机つくえ的てき官かん方かた镜像网站. 新しん亚历山大やまだい图书馆. [2020-06-20]. （原始げんし内容ないよう存そん档于2012-11-28）. 1996-2007年ねん（截至2019年ねん^[update]）.

实用程ほど序じょ

Wayback. SourceForge.net. [2020-06-20]. （原始げんし内容ないよう存そん档于2011-09-16）.
从网站时光こう机つくえ检索备份的てき工具こうぐ. github.com. [2018-05-03]. （原始げんし内容ないよう存そん档于2021-05-03）.
网站时光机つくえ在ざい线下载器. [2018-03-20]. （原始げんし内容ないよう存そん档于2018-03-21）（英えい语及波は兰语）.

参考さんこう文献ぶんけん

^ WayBackMachine.org WHOIS, DNS, & Domain Info – DomainTools. WHOIS. [2016-03-13]. （原始げんし内容ないよう存そん档于2020-05-14）.
^ InternetArchive.org WHOIS, DNS, & Domain Info – DomainTools. WHOIS. [2016-03-13]. （原始げんし内容ないよう存そん档于2020-05-12）.
^ archive.org Competitive Analysis, Marketing Mix and Traffic - Alexa. alexa.com. [2020-06-06]. （原始げんし内容ないよう存そん档于2020-05-18）.
^ Internet Archive: Wayback Machine. web.archive.org. （原始げんし内容ないよう存そん档于2023-03-13）. The current number of archived pages can be seen at the archive's home page.
^ Kahle, Brewster. A Message from Internet Archive Founder, Brewster Kahle. Internet Archive. [10 January 2024].
^ Notess, Greg R. The Wayback Machine: The Web's Archive. Online. March–April 2002, 26: 59–61.
^ ^7.0 ^7.1 20,000 Hard Drives on a Mission | Internet Archive Blogs. blog.archive.org. [2018-10-15]. （原始げんし内容ないよう存そん档于2018-10-20）（美国びくに英えい语）.
^ Green, Heather. A Library as Big as the World. BusinessWeek. 2002-02-28. （原始げんし内容ないよう存そん档于2011-12-20）.
^ Tong, Judy. Responsible Party – Brewster Kahle; A Library Of the Web, On the Web. New York Times. 2002-09-08 [2011-08-15]. （原始げんし内容ないよう存そん档于2011-02-20）.
^ Can the Internet Be Archived?. The New Yorker. 2015-01-26 [2019-01-23]. （原始げんし内容ないよう存そん档于2015-01-25）.
^ ^11.0 ^11.1 Internet Archive: Wayback Machine. archive.org. [2018-10-15]. （原始げんし内容ないよう存そん档于2014-01-03）（英えい语）.
^ Cook, John. Web site takes you way back in Internet history. Seattle Post-Intelligencer. 2001-11-01 [2011-08-15]. （原始げんし内容ないよう存そん档于2014-08-12）.
^ Wayback Goes Way Back on Web. Wired. 2001-10-28 [2017-10-16]. （原始げんし内容ないよう存そん档于2017-10-16）.
^ ^14.0 ^14.1 ^14.2 Arora, Sanjay K.; Li, Yin; Youtie, Jan; Shapira, Philip. Using the wayback machine to mine websites in the social sciences: A methodological resource. Journal of the Association for Information Science and Technology. 2015-05-05, 67 (8): 1904–1915. ISSN 2330-1635. doi:10.1002/asi.23503 （英えい语）.
^ ^15.0 ^15.1 ^15.2 ^15.3 ^15.4 ^15.5 ^15.6 Kalev Leetaru. The Internet Archive Turns 20: A Behind the Scenes Look at Archiving the Web. Forbes. 2016-01-28 [2017-10-16]. （原始げんし内容ないよう存そん档于2017-10-16）.
^ Kahle, Brewster. Archiving the Internet. Scientific American – March 1997 Issue. [2020-04-25]. （原始げんし内容ないよう存そん档于2012-08-03）（英えい语）.
^ Kaplan, Jeff. Archive-It: Crawling the Web Together. Internet Archive Blogs. 2014-11-27 [2020-04-24]. （原始げんし内容ないよう存そん档于2017-10-12）（英えい语）.
^ ^18.0 ^18.1 Worldwide Web Crawls. Internet Archive. [2020-06-25]. （原始げんし内容ないよう存そん档于2017-10-19）.
^ Wide Crawl Number 13. Internet Archive. [2020-06-07]. （原始げんし内容ないよう存そん档于2017-10-19）（英えい语）.
^ Internet Archive: Petabox. archive.org. 2020-06-07 [2020-06-07]. （原始げんし内容ないよう存そん档于2020-06-03）（英えい语）.
^ Kanellos, Michael. Big storage on the cheap. CNET News.com. 2005-07-29 [2020-06-07]. （原始げんし内容ないよう存そん档于2007-04-03）.
^ Internet Archive and Sun Microsystems Create Living History of the Internet [互联网档案あん馆和Sun系けい统创造づくり了りょう互联网的鲜活历史]. Sun Microsystems. 2009-03-25 [2020-06-07]. （原始げんし内容ないよう存そん档于2009-03-26）（英えい语）.
^ Mearian, Lucas. Internet Archive to unveil massive Wayback Machine data center [互联网档案あん馆推出で大だい规模网站时光机つくえ数すう据すえ中心ちゅうしん]. Computerworld.com. 2009-03-19 [2020-09-07]. （原始げんし内容ないよう存そん档于2009-03-23）（英えい语）.
^ Kahle, Brewster. Wayback Machine: Now with 240,000,000,000 URLs [网站时光机つくえ：现有240,000,000,000个URL]. blog.archive.org. Internet Archive Blogs. 2013-01-09 [2020-06-07]. （原始げんし内容ないよう存そん档于2014-04-14）（英えい语）.
^ Rossi, Alexis. Fixing Broken Links on the Internet. archive.org. San Francisco, CA, US: Collections Team, the Internet Archive. 2013-10-25 [2020-06-11]. （原始げんし内容ないよう存そん档于2014-11-07）. We have added the ability to archive a page instantly and get back a permanent URL for that page in the Wayback Machine. This service allows anyone – wikipedia editors, scholars, legal professionals, students, or home cooks like me – to create a stable URL to cite, share or bookmark any information they want to still have access to in the future.
^ The VirusTotal Team. 207.241.226.190 IP address information. virustotal.com. Dublin 2, Ireland: VirusTotal. 2015-03-25 [2020-06-11]. （原始げんし内容ないよう存そん档于2014-07-14）. 2015-03-25: Latest URLs hosted in this IP address detected by at least one URL scanner or malicious URL dataset. ... 2/62 2015-03-25 16:14:12 [complete URL redacted]/Renegotiating_TLS.pdf ... 1/62 2015-03-25 04:46:34 [complete URL redacted]/CBLightSetup.exe
^ Advisory provided by Google. Safe Browsing Diagnostic page for archive.org. google.com/safebrowsing. Mountain View, CA, US. 2015-03-25 [2020-06-11]. （原始げんし内容ないよう存そん档于2015-04-06）. 2015-03-25: Part of this site was listed for suspicious activity 138 time(s) over the past 90 days. ... What happened when Google visited this site? ... Of the 42410 pages we tested on the site over the past 90 days, 450 page(s) resulted in malicious software being downloaded and installed without user consent. The last time Google visited this site was on 2015-03-25, and the last time suspicious content was found on this site was on 2015-03-25. ... Malicious software includes 169 trojan(s), 126 virus, 43 backdoor(s).
^ Internet Archive Frequently Asked Questions. [2020-06-11]. （原始げんし内容ないよう存そん档于2009-02-21）.
^ Internet Archive Frequently Asked Questions. 2014-12-18 [2020-06-11]. （原始げんし内容ないよう存そん档于2014年ねん12月18日にち）.
^ Can the manipulation of big data change the way the world thinks? [操みさお纵大数すう据すえ能のう改あらため变世界かい的てき思おもえ维方式しき吗？]. The National. 2017-01-05 [2020-06-07]. （原始げんし内容ないよう存そん档于2017-01-12）（英えい语）.
^ Crockett, Zachary. Inside Wayback Machine, the internet's time capsule. The Hustle. 2018-09-28 [2020-06-07]. （原始げんし内容ないよう存そん档于2018-10-02）（英えい语）.
^ Heffernan, Virginia. Things Break and Decay on the Internet—That's a Good Thing. WIRED. 2018-09-18 [2018-10-26]. （原始げんし内容ないよう存そん档于2018-09-25）（英えい语）.
^ Archive.org Site Info. Alexa Internet. [2020-06-11]. （原始げんし内容ないよう存そん档于2013年ねん10月がつ28日にち）.
^ Archive.org Site Overview. Alexa Internet. [2020-06-11]. （原始げんし内容ないよう存そん档于2015-04-09）.
^ Archive.org Traffic, Demographics and Competitors - Alexa. 2019-03-23 [2020-06-11]. （原始げんし内容ないよう存そん档于2019-03-23）.
^ michelle. Wayback Machine Hits 400,000,000,000!. Internet Archive. 2014-05-09 [2020-06-11]. （原始げんし内容ないよう存そん档于2014-08-26）.
^ Internet Archive Wayback Machine. 互联网档案あん馆. [2020-06-01]. （原始げんし内容ないよう存そん档于2015-02-13）.
^ Some sites are not available because of Robots.txt or other exclusions. What does that mean?. 网站时光机つくえ. [2020-06-13]. （原始げんし内容ないよう存そん档于2011-04-15）（英えい语）. ......All of this information is contained in a file called robots.txt. While robots.txt has been adopted as the universal standard for robot exclusion, compliance with robots.txt is strictly voluntary...... Alexa, the company that crawls the web for the Internet Archive, does respect robots.txt instructions, and even does so retroactively. If a web site owner ever decides he/she prefers not to have a web crawler visiting his / her files and sets up robots.txt on the site, the Alexa crawlers will stop visiting those files and mark all files previously gathered as unavailable......sometimes a web site owner will contact us directly and ask us to stop crawling or archiving a site. We comply with these requests.
^ Cox, Joseph. The Wayback Machine Is Deleting Evidence of Malware Sold to Stalkers. 2018-05-22 [2020-06-13]. （原始げんし内容ないよう存そん档于2018年ねん5月がつ22日にち）.
^ Robots.txt meant for search engines don't work well for web archives. Internet Archive. 2017-04-17 [2020-06-13]. （原始げんし内容ないよう存そん档于2018-12-04）（英えい语）.
^ Recommendations for Managing Removal Requests And Preserving Archival Integrity. 加か利り福ぶく尼あま亚大学がく. 2002-12-14 [2020-06-13]. （原始げんし内容ないよう存そん档于2017-09-18）（英えい语）.
^ Retroactive robots.txt removal of past crawls AKA Oakland Archive Policy. 互联网档案あん馆. 2014-07-07 [2020-06-13]. （原始げんし内容ないよう存そん档于2017年ねん10月がつ10日とおか）（英えい语）.
^ Mark Graham. Robots.txt meant for search engines don't work well for web archives [用よう于搜索引さくいん擎的robots.txt不ふ适用于网络存档]. Internet Archive Blogs. 2017-04-17 [2020-06-18]. （原始げんし内容ないよう存そん档于2017-04-17）（英えい语）.
^ Archivierung des Internets: Internet Archive ignoriert künftig robots.txt [互联网档案あん馆：互联网存档馆将はた忽ゆるがせ略りゃくrobots.txt文ぶん件けん]. heise online. [2020-06-18]. （原始げんし内容ないよう存そん档于2017-04-27）（德とく语）.
^ Suchmaschinen: Internet Archive will künftig Robots.txt-Einträge ignorieren. Golem.de. [2020-06-18]. （原始げんし内容ないよう存そん档于2017-06-19）（德とく语）.
^ Internet Archive will ignore robots.txt files to keep historical record accurate [互联网档案あん馆将忽ゆるがせ略りゃくrobots.txt文ぶん件けん以保持ほじ历史文ぶん件けん的てき准じゅん确性]. Digital Trends. 2017-04-24 [2020-06-18]. （原始げんし内容ないよう存そん档于2017-05-16）（英えい语）.
^ Sampath Kumar, B.T.; Prithviraj, K.R. Bringing life to dead: Role of Wayback Machine in retrieving vanished URLs. Journal of Information Science. 2014-11-21, 41 (1): 71–81. ISSN 0165-5515. doi:10.1177/0165551514552752 （英えい语）.
^ ^48.0 ^48.1 Nelson, Steven. Wayback Machine Won't Censor Archive for Taste, Director Says After Olympics Article Scrubbed. US News. 2016-08-17 [2020-06-20]. （原始げんし内容ないよう存そん档于2017-01-06）. The Wayback Machine's unique search function frequently is used as a tool for journalists to review now-dead websites or to comb through dated news reports. The archived content has been used to embarrass politicians and expose battlefield lies.
^ Lepore, Jill. What the Web Said Yesterday. The New Yorker. 2015-01-19 [2020-06-20]. （原始げんし内容ないよう存そん档于2015-01-25）.
^ The March for Science began with this person's 'throwaway line' on Reddit [为科学かがく游ゆう行こう始はじめ于此人じん在ざいReddit上じょう“一带而过的话”]. Washington Post. [2017-04-23]. （原始げんし内容ないよう存そん档于2017-04-23）（英えい语）.
^ Are scientists going to march on Washington? [科学かがく家か要よう去さ华盛顿游行ぎょう吗？]. The Washington Post. 2017-01-24 [2020-06-20]. （原始げんし内容ないよう存そん档于2017-01-31）（英えい语）.
^ Foley, Katherine Ellen. The global March for Science started with a single Reddit thread. Quartz. [2020-06-20]. （原始げんし内容ないよう存そん档于2017-04-24）（英えい语）.
^ Internet Archive Frequently Asked Questions. 互联网档案あん馆. 2014-04-02 [2020-06-25]. （原始げんし内容ないよう存そん档于2014-04-02）.
^ ^54.0 ^54.1 Using The Wayback Machine. help.archive.org. 互联网档案あん馆. [2020-06-25]. （原始げんし内容ないよう存そん档于2020-07-06）.
^ ^55.0 ^55.1 Bates, Mary Ellen. The Wayback Machine. Online. 2002, 26: 80 –通どおり过EBSCOhost.
^ ^56.0 ^56.1 ^56.2 Lloyd, Howard. Order to Disable Robots.txt (PDF). American-Justice.org. 2009-10-15 [2020-06-26]. （原始げんし内容ないよう (PDF)存そん档于2019-08-08）.
^ Cortes, Antonio L. Motion Opposing Removal of Robots.txt. American-Justice.org. 2009-09-29 [2020-06-26]. （原始げんし内容ないよう存そん档于2011-05-13）.

[1] WayBackMachine.org WHOIS, DNS, & Domain Info – DomainTools. WHOIS. [2016-03-13]. （原始げんし内容ないよう存そん档于2020-05-14）.

[2] InternetArchive.org WHOIS, DNS, & Domain Info – DomainTools. WHOIS. [2016-03-13]. （原始げんし内容ないよう存そん档于2020-05-12）.

[alexa-3] rchive.org Competitive Analysis, Marketing Mix and Traffic - Alexa. alexa.com. [2020-06-06]. （原始げんし内容ないよう存そん档于2020-05-18）.

[4] Internet Archive: Wayback Machine. web.archive.org. （原始げんし内容ないよう存そん档于2023-03-13）. The current number of archived pages can be seen at the archive's home page.

[5] Kahle, Brewster. A Message from Internet Archive Founder, Brewster Kahle. Internet Archive. [10 January 2024].

[6] Notess, Greg R. The Wayback Machine: The Web's Archive. Online. March–April 2002, 26: 59–61.

[:0-7] 7.0 ^7.1 20,000 Hard Drives on a Mission | Internet Archive Blogs. blog.archive.org. [2018-10-15]. （原始げんし内容ないよう存そん档于2018-10-20）（美国びくに英えい语）.

[8] Green, Heather. A Library as Big as the World. BusinessWeek. 2002-02-28. （原始げんし内容ないよう存そん档于2011-12-20）.

[9] Tong, Judy. Responsible Party – Brewster Kahle; A Library Of the Web, On the Web. New York Times. 2002-09-08 [2011-08-15]. （原始げんし内容ないよう存そん档于2011-02-20）.

[10] Can the Internet Be Archived?. The New Yorker. 2015-01-26 [2019-01-23]. （原始げんし内容ないよう存そん档于2015-01-25）.

[IA:_Wayback-11] 11.0 ^11.1 Internet Archive: Wayback Machine. archive.org. [2018-10-15]. （原始げんし内容ないよう存そん档于2014-01-03）（英えい语）.

[12] Cook, John. Web site takes you way back in Internet history. Seattle Post-Intelligencer. 2001-11-01 [2011-08-15]. （原始げんし内容ないよう存そん档于2014-08-12）.

[13] Wayback Goes Way Back on Web. Wired. 2001-10-28 [2017-10-16]. （原始げんし内容ないよう存そん档于2017-10-16）.

[Arora_(2015)-14] 14.0 ^14.1 ^14.2 Arora, Sanjay K.; Li, Yin; Youtie, Jan; Shapira, Philip. Using the wayback machine to mine websites in the social sciences: A methodological resource. Journal of the Association for Information Science and Technology. 2015-05-05, 67 (8): 1904–1915. ISSN 2330-1635. doi:10.1002/asi.23503 （英えい语）.

[leetaru-15] 15.0 ^15.1 ^15.2 ^15.3 ^15.4 ^15.5 ^15.6 Kalev Leetaru. The Internet Archive Turns 20: A Behind the Scenes Look at Archiving the Web. Forbes. 2016-01-28 [2017-10-16]. （原始げんし内容ないよう存そん档于2017-10-16）.

[ArchivingInternet-16] Kahle, Brewster. Archiving the Internet. Scientific American – March 1997 Issue. [2020-04-25]. （原始げんし内容ないよう存そん档于2012-08-03）（英えい语）.

[17] Kaplan, Jeff. Archive-It: Crawling the Web Together. Internet Archive Blogs. 2014-11-27 [2020-04-24]. （原始げんし内容ないよう存そん档于2017-10-12）（英えい语）.

[:3-18] 18.0 ^18.1 Worldwide Web Crawls. Internet Archive. [2020-06-25]. （原始げんし内容ないよう存そん档于2017-10-19）.

[19] Wide Crawl Number 13. Internet Archive. [2020-06-07]. （原始げんし内容ないよう存そん档于2017-10-19）（英えい语）.

[20] Internet Archive: Petabox. archive.org. 2020-06-07 [2020-06-07]. （原始げんし内容ないよう存そん档于2020-06-03）（英えい语）.

[21] Kanellos, Michael. Big storage on the cheap. CNET News.com. 2005-07-29 [2020-06-07]. （原始げんし内容ないよう存そん档于2007-04-03）.

[22] Internet Archive and Sun Microsystems Create Living History of the Internet [互联网档案あん馆和Sun系けい统创造づくり了りょう互联网的鲜活历史]. Sun Microsystems. 2009-03-25 [2020-06-07]. （原始げんし内容ないよう存そん档于2009-03-26）（英えい语）.

[23] Mearian, Lucas. Internet Archive to unveil massive Wayback Machine data center [互联网档案あん馆推出で大だい规模网站时光机つくえ数すう据すえ中心ちゅうしん]. Computerworld.com. 2009-03-19 [2020-09-07]. （原始げんし内容ないよう存そん档于2009-03-23）（英えい语）.

[24] Kahle, Brewster. Wayback Machine: Now with 240,000,000,000 URLs [网站时光机つくえ：现有240,000,000,000个URL]. blog.archive.org. Internet Archive Blogs. 2013-01-09 [2020-06-07]. （原始げんし内容ないよう存そん档于2014-04-14）（英えい语）.

[ia-2013-10-25] Rossi, Alexis. Fixing Broken Links on the Internet. archive.org. San Francisco, CA, US: Collections Team, the Internet Archive. 2013-10-25 [2020-06-11]. （原始げんし内容ないよう存そん档于2014-11-07）. We have added the ability to archive a page instantly and get back a permanent URL for that page in the Wayback Machine. This service allows anyone – wikipedia editors, scholars, legal professionals, students, or home cooks like me – to create a stable URL to cite, share or bookmark any information they want to still have access to in the future.

[vt-207-241-26] The VirusTotal Team. 207.241.226.190 IP address information. virustotal.com. Dublin 2, Ireland: VirusTotal. 2015-03-25 [2020-06-11]. （原始げんし内容ないよう存そん档于2014-07-14）. 2015-03-25: Latest URLs hosted in this IP address detected by at least one URL scanner or malicious URL dataset. ... 2/62 2015-03-25 16:14:12 [complete URL redacted]/Renegotiating_TLS.pdf ... 1/62 2015-03-25 04:46:34 [complete URL redacted]/CBLightSetup.exe

[goog-sb-ia1-27] Advisory provided by Google. Safe Browsing Diagnostic page for archive.org. google.com/safebrowsing. Mountain View, CA, US. 2015-03-25 [2020-06-11]. （原始げんし内容ないよう存そん档于2015-04-06）. 2015-03-25: Part of this site was listed for suspicious activity 138 time(s) over the past 90 days. ... What happened when Google visited this site? ... Of the 42410 pages we tested on the site over the past 90 days, 450 page(s) resulted in malicious software being downloaded and installed without user consent. The last time Google visited this site was on 2015-03-25, and the last time suspicious content was found on this site was on 2015-03-25. ... Malicious software includes 169 trojan(s), 126 virus, 43 backdoor(s).

[28] Internet Archive Frequently Asked Questions. [2020-06-11]. （原始げんし内容ないよう存そん档于2009-02-21）.

[29] Internet Archive Frequently Asked Questions. 2014-12-18 [2020-06-11]. （原始げんし内容ないよう存そん档于2014年ねん12月18日にち）.

[30] Can the manipulation of big data change the way the world thinks? [操みさお纵大数すう据すえ能のう改あらため变世界かい的てき思おもえ维方式しき吗？]. The National. 2017-01-05 [2020-06-07]. （原始げんし内容ないよう存そん档于2017-01-12）（英えい语）.

[31] Crockett, Zachary. Inside Wayback Machine, the internet's time capsule. The Hustle. 2018-09-28 [2020-06-07]. （原始げんし内容ないよう存そん档于2018-10-02）（英えい语）.

[32] Heffernan, Virginia. Things Break and Decay on the Internet—That's a Good Thing. WIRED. 2018-09-18 [2018-10-26]. （原始げんし内容ないよう存そん档于2018-09-25）（英えい语）.

[alexa-2013-10-33] Archive.org Site Info. Alexa Internet. [2020-06-11]. （原始げんし内容ないよう存そん档于2013年ねん10月がつ28日にち）.

[alexa-2015-03-34] Archive.org Site Overview. Alexa Internet. [2020-06-11]. （原始げんし内容ないよう存そん档于2015-04-09）.

[35] Archive.org Traffic, Demographics and Competitors - Alexa. 2019-03-23 [2020-06-11]. （原始げんし内容ないよう存そん档于2019-03-23）.

[36] . Wayback Machine Hits 400,000,000,000!. Internet Archive. 2014-05-09 [2020-06-11]. （原始げんし内容ないよう存そん档于2014-08-26）.

[37] Internet Archive Wayback Machine. 互联网档案あん馆. [2020-06-01]. （原始げんし内容ないよう存そん档于2015-02-13）.

[38] Some sites are not available because of Robots.txt or other exclusions. What does that mean?. 网站时光机つくえ. [2020-06-13]. （原始げんし内容ないよう存そん档于2011-04-15）（英えい语）. ......All of this information is contained in a file called robots.txt. While robots.txt has been adopted as the universal standard for robot exclusion, compliance with robots.txt is strictly voluntary...... Alexa, the company that crawls the web for the Internet Archive, does respect robots.txt instructions, and even does so retroactively. If a web site owner ever decides he/she prefers not to have a web crawler visiting his / her files and sets up robots.txt on the site, the Alexa crawlers will stop visiting those files and mark all files previously gathered as unavailable......sometimes a web site owner will contact us directly and ask us to stop crawling or archiving a site. We comply with these requests.

[39] Cox, Joseph. The Wayback Machine Is Deleting Evidence of Malware Sold to Stalkers. 2018-05-22 [2020-06-13]. （原始げんし内容ないよう存そん档于2018年ねん5月がつ22日にち）.

[40] Robots.txt meant for search engines don't work well for web archives. Internet Archive. 2017-04-17 [2020-06-13]. （原始げんし内容ないよう存そん档于2018-12-04）（英えい语）.

[41] Recommendations for Managing Removal Requests And Preserving Archival Integrity. 加か利り福ぶく尼あま亚大学がく. 2002-12-14 [2020-06-13]. （原始げんし内容ないよう存そん档于2017-09-18）（英えい语）.

[42] Retroactive robots.txt removal of past crawls AKA Oakland Archive Policy. 互联网档案あん馆. 2014-07-07 [2020-06-13]. （原始げんし内容ないよう存そん档于2017年ねん10月がつ10日とおか）（英えい语）.

[43] Mark Graham. Robots.txt meant for search engines don't work well for web archives [用よう于搜索引さくいん擎的robots.txt不ふ适用于网络存档]. Internet Archive Blogs. 2017-04-17 [2020-06-18]. （原始げんし内容ないよう存そん档于2017-04-17）（英えい语）.

[44] Archivierung des Internets: Internet Archive ignoriert künftig robots.txt [互联网档案あん馆：互联网存档馆将はた忽ゆるがせ略りゃくrobots.txt文ぶん件けん]. heise online. [2020-06-18]. （原始げんし内容ないよう存そん档于2017-04-27）（德とく语）.

[45] Suchmaschinen: Internet Archive will künftig Robots.txt-Einträge ignorieren. Golem.de. [2020-06-18]. （原始げんし内容ないよう存そん档于2017-06-19）（德とく语）.

[46] Internet Archive will ignore robots.txt files to keep historical record accurate [互联网档案あん馆将忽ゆるがせ略りゃくrobots.txt文ぶん件けん以保持ほじ历史文ぶん件けん的てき准じゅん确性]. Digital Trends. 2017-04-24 [2020-06-18]. （原始げんし内容ないよう存そん档于2017-05-16）（英えい语）.

[47] Sampath Kumar, B.T.; Prithviraj, K.R. Bringing life to dead: Role of Wayback Machine in retrieving vanished URLs. Journal of Information Science. 2014-11-21, 41 (1): 71–81. ISSN 0165-5515. doi:10.1177/0165551514552752 （英えい语）.

[usn1-48] 48.0 ^48.1 Nelson, Steven. Wayback Machine Won't Censor Archive for Taste, Director Says After Olympics Article Scrubbed. US News. 2016-08-17 [2020-06-20]. （原始げんし内容ないよう存そん档于2017-01-06）. The Wayback Machine's unique search function frequently is used as a tool for journalists to review now-dead websites or to comb through dated news reports. The archived content has been used to embarrass politicians and expose battlefield lies.

[NewYorker-2015-01-26-49] Lepore, Jill. What the Web Said Yesterday. The New Yorker. 2015-01-19 [2020-06-20]. （原始げんし内容ないよう存そん档于2015-01-25）.

[50] The March for Science began with this person's 'throwaway line' on Reddit [为科学かがく游ゆう行こう始はじめ于此人じん在ざいReddit上じょう“一带而过的话”]. Washington Post. [2017-04-23]. （原始げんし内容ないよう存そん档于2017-04-23）（英えい语）.

[:1-51] Are scientists going to march on Washington? [科学かがく家か要よう去さ华盛顿游行ぎょう吗？]. The Washington Post. 2017-01-24 [2020-06-20]. （原始げんし内容ないよう存そん档于2017-01-31）（英えい语）.

[52] Foley, Katherine Ellen. The global March for Science started with a single Reddit thread. Quartz. [2020-06-20]. （原始げんし内容ないよう存そん档于2017-04-24）（英えい语）.

[53] Internet Archive Frequently Asked Questions. 互联网档案あん馆. 2014-04-02 [2020-06-25]. （原始げんし内容ないよう存そん档于2014-04-02）.

[:2-54] 54.0 ^54.1 Using The Wayback Machine. help.archive.org. 互联网档案あん馆. [2020-06-25]. （原始げんし内容ないよう存そん档于2020-07-06）.

[:4-55] 55.0 ^55.1 Bates, Mary Ellen. The Wayback Machine. Online. 2002, 26: 80 –通どおり过EBSCOhost.

[howard_lloyd-56] 56.0 ^56.1 ^56.2 Lloyd, Howard. Order to Disable Robots.txt (PDF). American-Justice.org. 2009-10-15 [2020-06-26]. （原始げんし内容ないよう (PDF)存そん档于2019-08-08）.

[antonio_cortes-57] Cortes, Antonio L. Motion Opposing Removal of Robots.txt. American-Justice.org. 2009-09-29 [2020-06-26]. （原始げんし内容ないよう存そん档于2011-05-13）.

[4]

[5]

[1]

[2]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

[46]

[47]

[48]

[49]

[50]

[51]

[52]

[53]

[54]

[55]

[56]

[57]


截图 2021年ねん10月がつ的てき网站时光机つくえ首くび页
网站类型	存そん档
成立せいりつ	1996年ねん5月がつ10日とおか，28年ねん前まえ（1996-05-10）
持もち有ゆう者しゃ	互联网档案あん馆
网址	web.archive.org
注ちゅう册さつ	可か选
推出时间	2001年ねん10月がつ24日にち，22年ねん前まえ（2001-10-24）^[1]^[2]
现状	活かつ跃
編へん程ほど語ご言げん	Java、Python