- Mar 16, 2001
- 484
- 0
- 0
You can't beat free backups!
I was able to restore a bunch of data from my website from many months ago. I have backup software now (ARCserve), but this sure came through in a pinch! I only wish it could have saved my off-line data, too.
If you have ever been in that unfortunate scenario, you know this is one of the hottest deals of them all! ...And you have to admit, it's fun to browse some of your old haunts "the way they were".
Check out the Wayback Machine
FAQs
Which sites are available in the Wayback Machine?
The Internet Archive is attempting to archive the entire publicly available web. Some sites may not be included because the automated crawlers were unaware of their existence at the time of the crawl. It's also possible that some sites were not archived because they were password protected, detected webmaster instructions to not be crawled, or were otherwise inaccessible to the Internet Archive's automated systems.
Why are some sites harder to archive than others?
If you look at the collection of archived sites, you will find some broken pages, missing graphics, and some sites that aren't archived at all. The Internet Archive has tried to create a complete archive, but has had difficulties with some sites, because the link structure was not straightforward to crawl.
Can I link to old pages on the Wayback Machine?
Yes! The Wayback Machine is built so that it can be used and referenced by anybody and everybody. If you find an archived page that you would like to reference on your web page or in an article, you can copy the URL and share it with others.
How was the Wayback Machine made?
Over 100 terabytes of data is stored on a couple hundred modified servers situated in the basement of a former military building in the Presidio of San Francisco. Alexa Internet, in cooperation with the Internet Archive, has designed an index that allows browsing of web documents over multiple time periods, and turned this unique feature into the Wayback Machine.
What type of machinery is used in the Wayback Machine?
The Internet Archive is stored on dozens of slightly modified Hewlett Packard and uslab.com servers. The computers run on the FreeBSD and Linux operating systems. Each computer has about 512Mb of memory and generally holds just over 300 gigabytes of data on IDE disks.
How can I get my site included in the Wayback Machine?
Alexa Internet has been crawling the web since 1996, which has resulted in a massive archive. If you have a web site, and you would like to ensure that it is saved for posterity in the Alexa Archive, chances are that it's already there. We make every effort to crawl the entire publicly available web. However, if you wish to take extra measures to ensure that we archive your site, you can visit the Alexa "Archive Your Site" page at http://www.alexa.com/help/webmasters/request_bot.html.
How can I remove my site from the Wayback Machine?
To get your site removed, update your robots.txt file to disallow ia_archiver. Alexa's crawlers will get your new robots.txt, which will make its way into the Wayback Machine and mark all previously archived pages inaccessible
EDIT: Ha! Be sure to check out one of the early Anand pages: Jan 9th, 1998
---
Definitely interesting, but definitely Off Topic material, not a Hot Deal.
AT Mod
I was able to restore a bunch of data from my website from many months ago. I have backup software now (ARCserve), but this sure came through in a pinch! I only wish it could have saved my off-line data, too.
If you have ever been in that unfortunate scenario, you know this is one of the hottest deals of them all! ...And you have to admit, it's fun to browse some of your old haunts "the way they were".
Check out the Wayback Machine
FAQs
Which sites are available in the Wayback Machine?
The Internet Archive is attempting to archive the entire publicly available web. Some sites may not be included because the automated crawlers were unaware of their existence at the time of the crawl. It's also possible that some sites were not archived because they were password protected, detected webmaster instructions to not be crawled, or were otherwise inaccessible to the Internet Archive's automated systems.
Why are some sites harder to archive than others?
If you look at the collection of archived sites, you will find some broken pages, missing graphics, and some sites that aren't archived at all. The Internet Archive has tried to create a complete archive, but has had difficulties with some sites, because the link structure was not straightforward to crawl.
Can I link to old pages on the Wayback Machine?
Yes! The Wayback Machine is built so that it can be used and referenced by anybody and everybody. If you find an archived page that you would like to reference on your web page or in an article, you can copy the URL and share it with others.
How was the Wayback Machine made?
Over 100 terabytes of data is stored on a couple hundred modified servers situated in the basement of a former military building in the Presidio of San Francisco. Alexa Internet, in cooperation with the Internet Archive, has designed an index that allows browsing of web documents over multiple time periods, and turned this unique feature into the Wayback Machine.
What type of machinery is used in the Wayback Machine?
The Internet Archive is stored on dozens of slightly modified Hewlett Packard and uslab.com servers. The computers run on the FreeBSD and Linux operating systems. Each computer has about 512Mb of memory and generally holds just over 300 gigabytes of data on IDE disks.
How can I get my site included in the Wayback Machine?
Alexa Internet has been crawling the web since 1996, which has resulted in a massive archive. If you have a web site, and you would like to ensure that it is saved for posterity in the Alexa Archive, chances are that it's already there. We make every effort to crawl the entire publicly available web. However, if you wish to take extra measures to ensure that we archive your site, you can visit the Alexa "Archive Your Site" page at http://www.alexa.com/help/webmasters/request_bot.html.
How can I remove my site from the Wayback Machine?
To get your site removed, update your robots.txt file to disallow ia_archiver. Alexa's crawlers will get your new robots.txt, which will make its way into the Wayback Machine and mark all previously archived pages inaccessible
EDIT: Ha! Be sure to check out one of the early Anand pages: Jan 9th, 1998
---
Definitely interesting, but definitely Off Topic material, not a Hot Deal.
AT Mod