9
bpulv wrote:
My Backup System
Computer security experts say you should always maintain a minimum of three (3) forms of backup for your photographs. At least one should be off site to protect against loss in the event of fire or natural disaster. The Cloud is not the best choice because your photos will be under the control of a second party that may have technical problems, financial failure or bankruptcy, sale to an unreliable party or may hold your data hostage to rising costs. Furthermore, upload speeds for large RAW files is way too slow and over time causes undue ware to hard drives. I have a system that will store hundreds of thousands of RAW, TIFF and JEPG photographs. Although you do not need anywhere near the 31Tb storage system I have, You will be able to glean some ideas by looking at how I protect my photographs. My system provides both on and off site redundant backup that is completely under my own control.
My system consists of the following:
1) All storage of photographs is external to the computer’s (iMac) internal drive.
2) Downloads from the camera, editing, final product and primary storage is on a 5Tb Western Digital My Book drive. Additional drives can be added as needed.
a) Each shoot is stored in a folder identified by job date and name. The date code is in a format that allows any software to automatically sort my files into date order for easy retrieval. The order is YYYYMMDD_Descriptive Title. For example, 20200521_Jones Family Reunion means that the photos were taken on May 5, 2020 of the Jones Family Reunion.
b) Within each folder are four sub-folders labeled “RAW”, "PSD",“Edit” and “Final”. All uploads are made to the RAW folder and only copies of the raw files are copied to the Edit folder for editing. The final output that may include JEPG, TIFF, etc. files are transferred to the Final folder.
3) The complete primary storage drive is backed up to a Drobo 5-drive RAID (Redundant Array of Independent Disks). This allows multiple backup copies of the data along with offsite backup. In addition, the Drobo’s firmware automatically corrects any data transfer errors and insures that all backup drives have matched data. The RAID consists of five Western Digital Red Drives, which are designed for this type of service and have an excellent reputation for reliability. Furthermore they are “hot swappable” which means they can be safely inserted and removed from the Drobo without removing power from the system. If one or even two hard drives fail, the Drobo will use the remaining three drives to write back the correct data to two replacement drives.
4) The Drobo is only powered on when backups are being made. Otherwise, it is powered off. This limits the exposure to possible hacking and greatly prolongs the life of the backup drives.
5) I have two additional external USB drives that I alternately exchange with my son who lives about twenty miles away. Those drive provide off site storage so in case the house burns down or there is a natural desaster, I will not loose my photographs.
My Backup System br br Computer security experts ... (
show quote)
With all due respect, there are a number of inaccuracies in this post, and it would be a shame to let those influence others that might benefit from good data protection strategies.
First, let’s address what cloud storage actually is. A cloud data center consists of redundant storage, servers, networking and power housed in hardened, secure data center managed by professionals and more importantly, if you choose a major provider such as Amazon, Google, Microsoft Apple, IBM, etc., there will be 3-5 copies of your data at geographically separate data centers to protect against a disaster at any one site. You cannot possibly even come close to this level of professional management, enterprise hardware, redundancy, power conditioning, fire suppression, security or multiple geographically separate copies.
And let’s put a couple of other pieces misinformation concerning the cloud to rest. There has NEVER been a failure, bankruptcy, or “data held hostage” at ANY Major cloud provider. In fact, only ONE second tier provider (Nirvanix) has gone under, and they provided their customers months of notice to move their data. Now if you choose some unknown third tier “cloud” storage such as “Supersynch”, you’re on your own. Can your data be compromised (hacked) in the cloud? Sure, IF you use poor security practice on your end such as single authentication, using the same password at multiple sites, sloppy computing practices such as opening strange emails or attachments, visiting insecure sites, etc - anything that can allow your credentials to be stolen or malware to corrupt your data which is then mirrored to your backups. Are hackers likely to target your data in the cloud? No. Hackers want nice neat databases with social security or credit card data, not wasting time going through ten thousand photos looking for a single nugget of useful information. Finally, many of those that decry the cloud conveniently forget that their really sensitive data and that of at least 2/3 of businesses as well as the data from the nation’s intelligence services (NSA, CIA...) is already in the cloud. Your SSN, credit card, medical records, stock/retirement accounts, any military records banking and tax information is, guess where? In the cloud.
Now let’s address the speed limitation of the internet. If you have a dog slow internet connection AND a huge amount of data, then cloud storage may not be workable for you, but there is a workaround. A number of cloud providers will allow you to provide a disk with your information 5o upload and will provide you with a disk with your information if you need to download the entire contents. Note also that once you have “seeded” the cloud, future uploads consist only of new and changed files and that can be done automatically after hours so it has no impact on you. And the idea that the cloud will wear out your HD uploading is a red herring. Unless you choose not to have ANY backup, then the data has to be read from your HD at some point, regardless as to whether it’s the cloud or local backup.
In short, don’t let this inaccurate fear mongering keep you from keeping a DR copy of your data. Keeping a drive off-site (in a lock box for example) has a number of downsides including the bank (or your relative’s house) being destroyed by fire or flood, the drive not starting (a large percentage of drive failures occur on start up after being down for long periods), the data that is lost between those trips to the bank and the necessity of actually going through this time wasting routine every week or every month. That’s all your data in a single drive that may or may not start. And don’t forget the convenience of being able to access your data from anywhere with a computer and an internet connection. In the week after your home (and both your primary data storage and backup) are destroyed by fire or flood and you need that insurance information and inventory/pictures of your property while staying in a hotel (perhaps without your car), that data is at your fingertips. Finally, the idea of archiving your data on USB drives (as mentioned) or on DVDs is fraught with disaster - neither was ever intended as long term storage and can fail over time.
I also want to address the limitations of RAID storage. A good RAID array can be either faster or more reliable (or both) than a single drive, but they do have limitations. First, there is this idea that each drive in a RAiD array contains all the data or that you can remove one drive and store it as a complete backup for all your data. Neither is true. Unless we’re discussing simple mirroring (RAID 1 or 0+1), with all other RAID levels (0,3,4 or 5 or so called “6”/double parity schemes), each drive contains only a PIECE of your data and /or parity information. If you want to back up a RAID array, you need as much storage as the total of all the drives in the array. Now if you lose a drive AND you have a spare that you insert or comes online automatically, then the drive can be rebuilt as a background task from the other drives. You can continue to access your data, but the performance is typically degraded by ~50%. During that degraded period while a drive is being rebuilt (which can take days with large drives), a second drive failure during that time results in data loss. Because of this, many RAID arrays allow dual parity drives which can handle dual drive failure, BUT then you need two spare drives to rebuild, and double drive failures DO occur, usually as the result of power line issues, lightning strikes or controller failures. Finally, there is no RAID “standard” except mirroring although the RAID levels are somewhat defined. What that means is if your controller board dies, you’ll need another array of the same type/mfg. to get to your data - you can’t simply move the drives to a different array and expect then to be read.
Whew! Sorry for the long post, but after 25+ years specializing in data storage, i’ve seen too much lost data by too many causes, so I’m preaching 3 copies of your data - primary, backup and off-site/DR, and don’t let fear of not “having control of your data’ scare you away from the cloud, which for most users, is by far the safest method of off-site storage.