Finally, an archive.org URL at the top instead of archive.ph. Nice one.
The later has a "bot protection" page that looks like Cloudflare's but someone suggested it is not. Makes sense because archive.today and Cloudflare were in a spat some time ago. Archive.today wanted to allow monitoring of users' locations, e.g., via EDNS Client Subnet, but Cloudflare did not send ECS.
Unlike Archive.today, Internet Archive does not try to force users to enable Javascript or make them solve CAPTCHAs. Nor does Common Crawl.
It is interesting to contrast the Internet Archive (IA) with Archive.today. The later is vague about how it is funded and admits it could sell out to advertisers in the future.^1 There is obviously no small amount of data it could collect about user interests and behaviours that could be used to support advertising. For example, what usage data does it store, if any. What are the Terms of Use for that data. There are no public statements about any restrictions on what the website operator can do. The operator invites users to "Ask me anything" but AFAICT the website has no Privacy Policy. The operator admits it sends the client's IP in a X-Forwarded-For HTTP header. This is not something one would experience with IA. The server hosting the page being archived receives an IA IP address as the client IP address, not an IA user's IP address. IA has a Privacy Policy, last updated in 2001.^2 Unlike Archive.today, I feel reasonably confident IA wil not sell out to commercial interests but who knows.
1. From Archive.today's FAQ:
How is the archive funded?
It is privately funded; there are no complex finances behind it. It may look more or less reliable compared to startup-style funding or a university project, depending on which risks are taken into account.
Will advertising appear on the archive one day ?
I cannot make a promise that it will not. With the current growth rate I am able to keep the archive free of ads. Well, I can promise it will have no ads at least till the end of 2014.
2. YMMV, but IME but the fewer "updates" to a Privacy Policy over time the better.
Correction: FAQ states client IP address (instead of Archive.today IP address) is sent but not in a X-Forwarded-For header.
Archive.today FAQ:
Do you preserve archivers' privacy? E.g. not disclose the source IP address?
Yes.
But take in mind that when you archive a page, your IP is being sent to the the website you archive as though you are using a proxy (in X-Forwarded-For header). This feature allows websites (e.g shops or the sites with weather forecast) target your region, not mine.
The later has a "bot protection" page that looks like Cloudflare's but someone suggested it is not. Makes sense because archive.today and Cloudflare were in a spat some time ago. Archive.today wanted to allow monitoring of users' locations, e.g., via EDNS Client Subnet, but Cloudflare did not send ECS.
Unlike Archive.today, Internet Archive does not try to force users to enable Javascript or make them solve CAPTCHAs. Nor does Common Crawl.
It is interesting to contrast the Internet Archive (IA) with Archive.today. The later is vague about how it is funded and admits it could sell out to advertisers in the future.^1 There is obviously no small amount of data it could collect about user interests and behaviours that could be used to support advertising. For example, what usage data does it store, if any. What are the Terms of Use for that data. There are no public statements about any restrictions on what the website operator can do. The operator invites users to "Ask me anything" but AFAICT the website has no Privacy Policy. The operator admits it sends the client's IP in a X-Forwarded-For HTTP header. This is not something one would experience with IA. The server hosting the page being archived receives an IA IP address as the client IP address, not an IA user's IP address. IA has a Privacy Policy, last updated in 2001.^2 Unlike Archive.today, I feel reasonably confident IA wil not sell out to commercial interests but who knows.
1. From Archive.today's FAQ:
How is the archive funded?
It is privately funded; there are no complex finances behind it. It may look more or less reliable compared to startup-style funding or a university project, depending on which risks are taken into account.
Will advertising appear on the archive one day ?
I cannot make a promise that it will not. With the current growth rate I am able to keep the archive free of ads. Well, I can promise it will have no ads at least till the end of 2014.
2. YMMV, but IME but the fewer "updates" to a Privacy Policy over time the better.