Sync It. Sync It Good
You've either had, or will have, a data loss disaster. Mine came in 1999, when we moved from Phoenix back to Boston. Booting my primary PC into Windows Setup, I chose what I thought was the correct partition to which to install the OS, tapped Enter on the keyboard and ... Gone. I wiped out years of email and some other important documents, and though I tried to recover it, whatever primitive forensic tools I had at the time didn't do the job.
Even the most skeptical among us tends to find religion when disaster strikes. I certainly did. My data loss episode triggered a never-ending, compulsive need to back everything up regularly, and the piles of backup media I still have resembles a computer history walkthrough. There's tape, ZIP disk, CD, DVD, DVD-DL, and various internal and external hard drive types, including a crazy set of 1 TB Firewire 800-based drives that cost $999 each at the time and were swapped out on my Windows Server box on a weekly basis. (And give me some credit for having a pre-cloud off-site backup solution. My parents live in the same town.)
But, time moves on and as cloud services become more accessible, affordable, and capable, the mind turns naturally to a hosted solution. After all, why should storage and backup be any different than hosted productivity servers (email, calendaring, communications), management solutions (System Center, InTune), or any of the other offerings Microsoft and its competitors are now racing to put up in the cloud?
Actually, as it turns out, storage/backup is quite a bit different. Sure, companies like Amazon with its S3 service and a host of smaller, consumer-oriented solutions (Carbonite, Mozy, and so on) provide companies and individuals with a variety of options for online backup. But as I examine my own usage patterns and consider the needs of business users, it occurs to me that there may be a better way. That is, most of us don't necessarily need a file store in the cloud. What we need is for our data to be wherever we are. We need sync.
A recent development in Microsoft's consumer-oriented cloud services triggered this change in perspective. For years, I've used Microsoft Live Mesh, an always-in-beta incubation project, to synchronize my important data--documents related to work, favorite photos, and so on--between my various PCs and a cloud-based Mesh "web desktop." This web desktop provided up to 5 GB of storage and acted as off-site backup, essentially, and since it was available via any web browser I could theoretically get at these important files from anywhere in the world if I had to.
Live Mesh was fast, reliable, and efficient, and I used it over the course of my previous three books, including "Windows 7 Secrets," during which time I shared the book-related documents, over Mesh, between myself and my co-author. It worked wonderfully.
This month, Microsoft is beginning the transition from Mesh to Live Sync, the new version of which is based on Live Mesh, but with some important differences. The company is connecting the service to Windows Live SkyDrive, which provides an ample 25 GB of storage space, but it is paradoxically lowering the amount of content you can sync to the cloud from 5 GB to 2 GB, and not raising the limit as one might logically have expected.
My initial reaction to this was, of course, negative. But as I moved my content from Mesh to Live Sync and thought about how I used this service, it suddenly occurred to me that I had never once even used the web desktop. And since Live Sync can sync a virtually unlimited amount of content between PCs in peer to peer mode, the 2 GB limit isn't really an issue. It applies only to what gets synced to the cloud.
Playing devil's advocate, it's pretty clear that Microsoft isn't yet interested in hosting cloud-based backup. And in a blog post describing this transition from Mesh to Live Sync, the company noted that the costs associated with such a service would be daunting--indeed, untenable--when scaled to meet the needs of the hundreds of millions of customers it has. Furthermore, Microsoft found that almost no customers were even using an appreciable part of that 5 GB of Mesh cloud-based storage anyway. So this isn't really a problem for most of its existing customers, just as it wasn't for me personally.
The value of peer of peer document sync cannot be overstated. Many times, with either Mesh or, now, Live Sync, I've started working on a document on a PC in one room, gotten up, walked into another room, sat down at a second PC, opened the same document, and gotten right back to work. All of the edits and changes I just made at the first PC are present, and no work is lost. Peer to peer sync provides a way for me to work on documents on the road and have the changes synced back to my desktop PC at home, in real time. It is, in other words, what many people are really talking about when they think of backup. It's really about replication, about storing important documents and files in two or more physical places so that if one machine is lost or stolen, or otherwise inaccessible, the data lives on.
Now, sync doesn't replace traditional backup. There will always be needs, including regulatory needs, for data to be backed up and protected in more traditional ways. If you think about how Microsoft's latest version of Data Protection Manager (DPM) works, it's hard not to imagine a more pervasive cloud piece to this tiered backup solution (beyond the already-present single vendor solution through Iron Mountain). Some data needs to be backed up locally and easily accessible. Some needs to be semi-permanently archived to tape. Some needs to be in the cloud or elsewhere offsite but accessible if needed. And a working set of data needs to be synced, something that can occur side-by-side with whatever backup solution you're using.
In the cloud era, there will be concerns around security and data breeches, and with performance. But as with any cloud concerns, these issues will be addressed over time as the economics of not maintaining terabytes of data locally will become more and more of a selling point. Just as IT departments will need to wrestle with shifting job duties as more services are hosted off-site, we will need to formalize how and where our important data is hosted and made available.
Of course, looking around the Microsoft ecosystem, you see many examples of replication. There's the Drive Duplication technology in Windows Home Server that I suspect will make its way to future Windows Server and Windows client versions. There's Active Directory replication between domain controllers. Folder replication in managed environments. And so on. These and other similar technologies all point the way, I think, to the way we can and should manage our data going forward, and that's true for both individuals and companies of all sizes.
How we get there will depend largely on the technologies we adopt. Generally speaking, I think that cloud backup won't be as important as people may now believe. More critical is that our data be made available when and where we want it. And from what I can see, that's not backup. That's sync.
An edited version of this commentary appeared in the June 15, 2010 issue of Windows IT Pro UPDATE. --Paul