Enterprise Storage Data Recovery: ZFS, Storage Spaces, SAN and iSCSI

Faulted ZFS pool, detached Storage Spaces virtual disk, accidentally initialized SAN LUN. In enterprise storage recovery one wrong command turns a recoverable situation into permanent loss. Uberblock and metadata, parity and slabs, LUN mapping, the danger of rebuild/resilver, and the image-first methodology.

When enterprise storage systems fail, the problem is far larger than a single disk failure. When a ZFS pool drops into a faulted state, when a Windows Storage Spaces virtual disk is marked detached, or when a LUN on a SAN is accidentally reinitialized, what is lost is not just blocks. The company accounting database, the virtualization datastores, the mail server mailbox files and years of accumulated documents all become inaccessible at once. This is why enterprise storage data recovery is a fundamentally different discipline from single disk recovery. The margin for error is very small here, and one wrong command can turn a recoverable situation into permanent data loss.

At DSET we have served enterprise customers from Ankara Hacettepe Teknokent since 2003. We have worked on ZFS pools, Storage Spaces, SAN arrays connected over fiber channel and iSCSI, NAS units and virtualization infrastructures. The most devastating scenarios we have seen over the years almost always follow the same pattern: a disk fails, the system administrator panics, starts a quick repair, rebuild or resilver, and that operation kills the other already tired disks and collapses the entire array. The purpose of this article is to explain these traps with technical depth and to set out why enterprise arrays must always be imaged first.

In the sections that follow we will cover why ZFS pool architecture is so sensitive, what can go wrong in the parity and mirror layouts of Storage Spaces, SAN LUN corruption and reinitialization accidents, iSCSI target problems, and most importantly the image-first recovery methodology. The goal is to help whoever performs the first response in an enterprise storage incident make the right decision.

Quick Answer

When an enterprise storage array (ZFS, Storage Spaces, SAN or iSCSI) fails, the first and most important thing to do is to stop all write operations and start no repair, rebuild or resilver. Put the array into a read-only state, label the member disks in their physical order, and wait for a professional recovery that takes a one-to-one sector image of each disk first. In ZFS pools, commands like zpool clear, zpool scrub or a forced import run against corrupt metadata can permanently destroy recoverable data. In Storage Spaces, forcing a detached virtual disk back online can damage parity information. On the SAN side, attempting recovery on top of an accidentally initialized LUN produces new writes. In all these cases the correct approach is the same: stop, image, then work on image copies. Never touch the original disks.

ZFS Pool Architecture and Why It Looks So Fragile

ZFS is a copy-on-write file system and volume manager. It writes data onto building blocks called vdevs (virtual devices), and these vdevs combine to form the pool. A vdev can be a mirror, raidz1, raidz2 or raidz3. The integrity of the pool depends on the health of all of its vdevs: if a single vdev is permanently lost, the entire pool drops into a faulted state because that vdev alone has no redundancy at the pool level. This is the most misunderstood point about ZFS. People know that raidz2 can survive two disk failures, but that tolerance applies within each vdev, not across vdevs.

ZFS protects every block with a checksum and keeps multiple copies of metadata. The state of the pool is governed by a structure called the uberblock. On each disk, several historical copies of the uberblock are stored in a circular array. The system normally uses the most recent consistent uberblock. When disks are abruptly cut off or a power loss leaves a half-finished transaction group (TXG), the latest uberblock may be inconsistent. In a healthy scenario ZFS rolls back to a previous consistent TXG. However, when the upper levels of the metadata tree are damaged, the pool becomes impossible to import and you encounter messages like cannot import: I/O error or one or more devices is currently unavailable.

The distinction between degraded and faulted is critical. A degraded pool is still online and the data is accessible, only one or more vdevs are running with reduced redundancy. A faulted pool, on the other hand, is inaccessible. The most dangerous thing a system administrator can do is to try to bring a faulted pool back immediately. A forced import with zpool import -F makes ZFS rewind the last few TXGs to return to a consistent point; sometimes it works, but if the underlying disk integrity is already compromised this operation produces new writes and worsens the situation. Attempting a forced import without imaging first is gambling.

The correct way to recover data on ZFS pools is to image all the physical disks that make up the pool, then attach those images as read-only to a recovery environment and resolve the pool either in read-only mode (zpool import -o readonly=on) or by reading from the image copies directly. If a disk is missing, the missing data can be reconstructed using raidz redundancy math, but that calculation is performed against the images, not the original disks. For more technical detail on other file systems on the Linux side, see our Linux server data recovery guide.

Missing Devices, Corrupt Metadata and Uberblock Issues in ZFS

When a ZFS pool cannot be imported, the first question is whether the problem is physical or logical. In physical problems one or more disks are entirely lost or returning read errors. The zpool status output shows devices in UNAVAIL or REMOVED state. The most common mistake here is to reinsert the missing disk and let the system resynchronize it. If the other disks are also in marginal condition, the resilver that starts tires them as well and the pool collapses completely.

Logical problems are more insidious. A half-finished TXG, a corrupted space map or a damaged MOS (Meta Object Set) can make a pool impossible to open. ZFS's ability to roll back uberblocks is powerful but not always sufficient. In some cases even when the correct uberblock is selected, the metadata blocks it points to may have been damaged for another reason. At this point the structure of the pool must be examined read-only with zdb (ZFS Debugger). Commands like zdb -e -bcsvL perform an integrity scan without mounting the pool and show which objects are readable.

The critical point we must underline here is this: aggressive repairs are dangerous. The zpool clear command resets error counters and sometimes forces ZFS to use problematic devices again. If a device is actually returning corrupt data, after a clear ZFS may accept that data as valid and rewrite it under checksum protection; this means writing on top of silent data corruption. Similarly, starting a scrub on a faulted pool sends still-weak disks into a long read marathon, and during that process a second disk failure can make the pool unrecoverable.

The correct methodology is to work without ever writing to the original disks. After each disk is imaged, the recovery specialist can try different uberblocks on the image copies, roll back to different TXG points, and safely test which scenario yields the most consistent data. If one image attempt fails, another copy is taken and the process restarts. As long as the original disk is preserved, there is an unlimited number of attempts. This is the fundamental philosophy of ZFS recovery.

Windows Storage Spaces: Parity, Mirror and Detached Virtual Disks

Windows Storage Spaces is an abstraction layer that gathers physical disks into a storage pool and creates virtual disks (storage spaces) on top of it. A storage space can be configured with one of three resiliency types: simple (no redundancy, performance focused), mirror (two or three way copy) and parity (RAID 5 like, space efficient but with low write performance). Storage Spaces distributes data across physical disks in blocks called slabs, and this mapping is kept in the pool metadata.

The most frequent Storage Spaces problem we encounter in enterprise environments is the virtual disk dropping into a detached state. This usually happens when enough physical disks in the pool go offline. For example, in a two way mirror space, if the disks holding both copies are lost, or in a parity space if disks fail beyond the tolerance limit, Windows marks the virtual disk as detached and the data becomes inaccessible. A more subtle scenario is when the disks are actually healthy but temporarily invisible due to a controller, cable or backplane problem. In this case, even when the disks return, the virtual disk may still remain detached because the pool metadata has frozen in an inconsistent state.

The situation is more complex in parity spaces because Storage Spaces parity works differently from classic hardware RAID 5. Slabs and parity information are distributed across disks in a Windows specific layout, and when a disk is missing the reconstruction of the missing slabs is performed according to Windows's own algorithm. Replacing a missing disk and forcing the pool to repair itself (repair-virtualdisk) is dangerous if the other disks are in marginal condition. The repair operation produces intense reads and writes across the pool, and if a second disk fails under this load the parity space becomes unrecoverable.

Attempting to force a detached virtual disk online with PowerShell (Connect-VirtualDisk or a read-write attach) makes new writes to the pool structure if the metadata is already inconsistent and can worsen the situation. The correct approach is again the same: image all the physical disks belonging to the pool, resolve the Storage Spaces metadata structure on the images, reconstruct the slab mapping and extract the data from the image copies. Because the slab based structure of Storage Spaces is complex, this analysis requires expertise. For general server recovery scenarios, our server data recovery page explains how the process works.

SAN LUN Corruption and Accidental Reinitialization

SAN (Storage Area Network) arrays are the heart of enterprise infrastructure. Multiple physical disks are gathered into RAID groups (or storage pools in modern arrays) by the array controller, and logical units called LUNs (Logical Unit Numbers) are carved out of these pools and presented to servers over fiber channel or iSCSI. On the server side a LUN appears like an ordinary block device, and a file system such as NTFS, VMFS, ext4 or another is written onto it. This multi-layered structure makes SAN recovery especially difficult because the problem can be at the RAID level of the array, at the LUN mapping level or at the file system level.

One of the most devastating SAN accidents is an accidental LUN initialize or reinitialize. When an administrator selects the wrong LUN in the array management interface and reinitializes or formats it, the array resets the mapping table for that LUN. The interesting thing is that in most cases the underlying data blocks still remain on the physical disks; only the metadata describing where the LUN starts and how it is mapped has been erased. This creates a window of hope for recovery, but only as long as no new data is written to that LUN. If the array starts using that LUN for a new task after reinitialization, the old data blocks are overwritten and the chance of recovery decreases with every passing minute.

LUN corruption can also occur without a physical disk failure. An array controller firmware bug, an uncontrolled power loss, or a failed failover in dual controller arrays can leave the LUN metadata inconsistent. In this case the LUN may still be presented to the server but the file system on it becomes unreadable, or the server sees the LUN as raw (RAW). A novice reflex here is to try to repair the file system on the server side (with chkdsk or fsck). This is very dangerous because if the underlying block layout is corrupted at the LUN level, the file system repair makes changes on top of inconsistent metadata and scrambles the real data structure even further.

The golden rule of SAN recovery is to address the problem from the array side, not the server side. The first step is to label all the physical disks belonging to the array in their physical order, remove them, and take an image of each. Because the RAID layout and LUN mapping can be proprietary in modern arrays, the recovery specialist must reverse-engineer this vendor specific layout on the images. As long as no writes are made to the disks, even an accidentally initialized LUN is often highly recoverable. The critical thing is to stop the array as soon as the accident is noticed and allow no rebuild or new task assignment on any disk. For more information on collapse processes and costs at the RAID level, see our article on the RAID 5 collapse recovery process.

iSCSI Target Problems and the Risks Introduced by the Network Layer

iSCSI is a protocol that carries SCSI commands over TCP/IP and provides SAN-like block storage access without fiber channel hardware. An iSCSI target is defined on a storage array or a software defined server, and clients called initiators connect to this target to use the remote block device as if it were a local disk. The flexibility of this architecture makes it very common in enterprise virtualization and database environments, but the network layer introduces new risk vectors.

The most common iSCSI related data loss scenario is the corruption of the storage structure the target relies on (a ZFS zvol, an LVM volume, a file based image or a hardware array LUN). iSCSI here is only the carrier; the actual data sits on the block device behind it. For this reason, the first thing to do in an iSCSI incident is to determine exactly what the target is built on. For example, if the target is a ZFS zvol, the problem is actually a ZFS pool problem and the ZFS methodology described above applies.

The network layer brings its own dangers. When iSCSI connections drop and reconnect over an unstable network, the file system on the initiator side may see half-finished writes and fall into an inconsistent state. Worse still is the case where two initiators mount the same iSCSI target in a shared manner (without a cluster-aware file system). This configuration mistake causes both servers to make independent writes to the same block device and the file system to corrupt rapidly. Such split-brain corruptions are among the hardest scenarios to recover because they contain two different sets of writes interleaved at the block level.

The correct approach in iSCSI target recovery is to image the block device the target relies on without performing any repair on the initiator side. If the target is on a hardware array, the array side rules apply. If it is on a software defined server, the disks on that server must be imaged. Forcing the connection to reestablish and the file system to mount can produce new journal writes on top of inconsistent metadata. For environments where iSCSI and NAS protocols overlap, our NAS data recovery service covers these mixed scenarios as well.

The Danger of Rebuild and Resilver: Loading Failing Disks

The pattern we see over and over again in enterprise storage recovery, and the one that causes the most data loss, is starting a rebuild or resilver on a collapsing array. To understand these operations, we need to explain technically why they are dangerous. A RAID group, a ZFS vdev or a Storage Spaces parity space runs with redundancy: when a disk fails, the system can recalculate that disk's data from parity or mirror copies on the other disks. When a new disk is inserted, a rebuild (hardware RAID), resilver (ZFS) or repair (Storage Spaces) operation begins, and this operation requires reading almost all of the other disks in the array.

The problem is this: if a disk in an array has failed, the remaining disks usually come from the same batch, of the same age and from the same operating environment. So if one disk has died, the others may also be near the end of their lives. The rebuild operation puts these tired disks under an uninterrupted read load lasting hours or even days. If a second or even third disk fails under this intense load, the redundancy limit is exceeded and the array becomes unrecoverable. In the industry this is called multiple disk failure during rebuild, and the risk has grown as the rebuild times of high capacity disks have lengthened.

Latent sector errors enlarge this risk further. Disks that are never read during normal operation may have silently accumulated unreadable sectors. These sectors only surface during a rebuild when that data is first attempted to be read. At exactly the critical moment, while the system is already trying to compensate for a failed disk, encountering an unreadable sector on a second disk can halt the rebuild and leave the array in a half-repaired, inconsistent state. Because ZFS resilver performs checksum verification it catches silent corruption, but this also slows the operation and keeps weak disks under load for longer.

This is why the first response of recovery specialists is to stop a rebuild or resilver that has started, if possible, and to start no new one. The correct order is this: first stop the array, then image each disk, then run the rebuild logic virtually on the image copies. When a disk gives an unreadable sector on the image, it does not kill the recovery process; that sector can be skipped, completed from another copy or calculated from parity. On the original hardware there is no such tolerance. This is exactly the most concrete benefit of the image-first approach. Even in non-RAID systems, for example flexible arrays like Synology SHR, the same principle applies; our Synology NAS collapse recovery article addresses this special case.

Why Enterprise Arrays Require Image-First Recovery

There is a single principle repeated in every section so far: stop the writes first, then image the member disks. We need to explain holistically why this principle is indisputable in enterprise arrays. An enterprise array is a single logical whole made up of many interrelated physical disks. The correct order of the data can only be resolved when a consistent snapshot of all disks is taken. If some disks of the array continue to change during recovery, the ground of the solution keeps shifting and a consistent reconstruction becomes impossible.

The first step of image-first recovery is to completely cut off write traffic. This means shutting down servers, removing the host connections of the array, or putting the array into a read-only state. Then each physical disk is labeled in its physical slot order. This labeling is vital because RAID and ZFS reconstruction depend on disk order; if the disks get mixed up the layout cannot be resolved. Then each disk is imaged with a one-to-one sector copy method through a hardware write-protected bridge. For disks that return read errors, special imaging hardware retries weak sectors repeatedly to pull as much data as possible.

After the images are taken, all analysis and reconstruction are performed only on these copies. The original disks wait in a secure vault and are never altered again. The greatest advantage this approach provides is the right to unlimited attempts. The recovery specialist can try different uberblocks in ZFS, different RAID parameters in SAN or different slab mappings in Storage Spaces. The failure of each attempt simply means restoring the image copy and starting over. No attempt puts the original data at risk.

The image-first approach also removes time pressure. While working on live hardware, every second carries the risk of a weak disk dying completely. Once the images are taken this pressure is lifted and the specialist can patiently resolve even the most difficult logical corruptions. In an enterprise data loss, haste is the greatest enemy; imaging turns haste into a methodical process. You can get detailed information about the scope of our services and the process flow from our services page.

Enterprise Data and Confidentiality: Protecting Corporate Data

An enterprise storage recovery job is far more than a technical task; it is also a relationship of trust. ZFS pools, SAN arrays and Storage Spaces spaces usually hold a company's most sensitive assets: financial records, customer databases, employee personal data, contracts, source code and intellectual property. Because this data passes into the hands of a third party during the recovery process, protecting confidentiality is as important as technical success.

The professional recovery process requires the disk images taken to be stored in access-controlled environments. The images are securely destroyed according to an agreed policy after the recovery is complete and the customer has received their data. Documenting this lifecycle is important for companies subject to data protection regulations such as KVKK; a data recovery process must itself ensure the security of the personal data it processes. Confidentiality agreements and data processing responsibilities are an inseparable part of this process.

Continuity planning frameworks such as NIST SP 800-34 recommend that recovery operations be treated as part of a disaster recovery and business continuity plan. This means that recovery is not just about getting the data back but also about understanding how the incident happened and preventing its recurrence in the future. When working with an enterprise customer, we not only extract the lost data but also report the root cause (insufficient redundancy, unmonitored disk health, a missing backup strategy) and offer recommendations for future resilience.

When an enterprise storage incident happens, the first contact is very important. The right first response determines the chance of recovery. So if a ZFS pool, Storage Spaces space, SAN LUN or iSCSI target is having problems, we recommend that you contact us before attempting any repair.

FAQ

My ZFS pool is faulted and cannot be imported. Should I try zpool import -F? No, you should not try it first. zpool import -F forces ZFS to rewind the last few TXGs to return to a consistent point and in some cases it works, but if the underlying disk integrity is already compromised it produces new writes and can worsen the situation. The correct approach is to image all the pool disks first and to perform attempts like a forced import only on the image copies. No command that writes to the original disks should be run during the first response.

My Storage Spaces virtual disk became detached. Can I force it back online with PowerShell? The detached state usually results from the pool metadata remaining inconsistent or from enough physical disks being offline. Forcing the virtual disk online or running repair-virtualdisk makes new writes to the pool structure if the metadata is already inconsistent and can damage parity information. If the other disks are in marginal condition, a second disk failure during repair can make the space unrecoverable. You must image all the pool disks first and resolve the slab structure on the images.

We initialized the wrong LUN on the SAN. Is the data completely gone? In most cases no, but you need to act fast. A LUN initialize operation usually only resets the mapping metadata; the underlying data blocks still remain on the physical disks. However, if the array starts using that LUN for a new task and writes over it, the old data is lost with every passing minute. Stop writing to the array immediately, allow no rebuild or new task assignment on any disk, and get professional support to have the disks imaged.

A disk failed and I inserted a new disk and started a resilver, now the pool is in a worse state. What happened? A resilver requires reading almost all of the disks in the array. Because the other disks are of the same age and from the same environment they may be tired, and a second disk may have failed under this intense read load. Latent sector errors also surface for the first time during a resilver. The right thing to do in this situation is to stop the resilver, start no new operation, and image the disks and perform the reconstruction on the image copies. Unreadable sectors on the images do not kill the recovery.

How is the confidentiality of our company data protected during recovery? The disk images taken are stored in access-controlled environments and only the specialists assigned to the recovery process can access them. After the recovery is complete and the data is delivered to the customer, the images are securely destroyed according to an agreed policy. Documenting this lifecycle is important for enterprise customers subject to regulations such as KVKK. Confidentiality agreements are an inseparable part of the process and data processing responsibilities are clarified from the very beginning.

Conclusion

Enterprise storage data recovery is a fundamentally different discipline from single disk recovery. ZFS pools, Windows Storage Spaces spaces, SAN arrays and iSCSI targets are complex logical wholes made up of many interrelated physical disks, and so a mistake made during the first response can turn a recoverable situation into permanent loss. The principle we have emphasized over and over in this article is clear: stop the writes, start no repair, rebuild or resilver, and image the member disks first. All analysis and reconstruction must be performed only on the image copies, and the original disks must never be altered.

Avoiding aggressive force import and clear commands in ZFS, not forcing detached disks online in Storage Spaces, not writing new data to accidentally initialized LUNs on a SAN, and not loading resilver work onto failing disks; all of these rest on the same fundamental logic. Enterprise data is also the most sensitive data, and protecting its confidentiality is as important as technical success.

At DSET we have performed enterprise storage recovery from Ankara Hacettepe Teknokent since 2003. If a ZFS pool, Storage Spaces space, SAN LUN or iSCSI target is having problems, contact us before any attempt. The right first response determines whether your data comes back.