InformationWeek has a good article, Behind The Storage Cloud. This article gives something of the plumbing behind the available storage in the cloud. Something they didn’t talk about in that article are the limitations I have been running into using the cloud.
For infrastructure providers like Google or Force.com, who are offering a PaaS (Platform as a Service), the storage is built into the application. If you chose them to develop your application, that works out fine. However, if you are looking for archiving or storage scaling (grow storage as you need it), it’s not so good.
Amazon offers a different kind of storage. S3 is a web based storage system. It’s like a bucket for data. It’s the biggest bucket you’ll probably ever see, but it’s just a big bucket. When you create a new directory, you’re creating a smaller bucket in the big bucket. The namespace for the bucket is global. That means that your smaller bucket can’t have the same name as someone else’s bucket. That’s a huge limitation.
Another issue with S3 is that it is a web service and not a block device. That means you can’t directly attach it and use it as a file system. You need to make API calls. Even from within EC2 (Amazon’s Cloud Computing Environment), S3 is only accessible through the API using PUT and GET style commands. Amazon is working on allowing EC2 to attach directly and there are other projects working on the same thing, PersistentFS being one of them.
I haven’t found a good, cloud based attachable storage yet. I think Amazon, when they make S3 attachable will be the first (although that might only be attachable from within EC2). What is currently available is a plethora of archival solutions. For home usage, I don’t think anything beats EMC Mozy. For $4.95/month, you get unlimited storage for one PC. It’s slow to add new files (at least for me it is), but overall, I don’t think any of its competitors really compete. I tried two others previously and decided to go with mozy for its price/feature ratio.
For business archival, I don’t know that I would recommend Mozy. It’s not that I would recommend against them, I just think there are better options. Off site, tape backups are still cheap and reliable. For the SMB market, burning a DVD once a week might even be enough. Just depends on your workflow and volume of data.
As a side note, it would be fairly easy to write a custom application to automatically backup changed files to S3. At less than 20 cents per month per GB, that might be a fairly reasonable solution, especially if you frequently need to access the archived data. I might even write a free giveaway to do just that. Just a POC kind of thing.