I have a two node SOFS cluster that has direct attached storage this host HA storage for an RDS setup. This has been in production for at least two years. I noticed a weird trend on disk space of the last few days and checked into it and noticed that the Dedup Jobs are failing.
If i run the job i get spammed with warning messages (790 events on my last run)
Data Deduplication was unable to access a file or volume. Details: CDedupFileFactory::Create(\\?\Volume{72912f5d-c574-4860-b956-74ac0ddcd0cb}\System Volume Information\Dedup\ChunkStore\{39A9D76E-376C-4AAD-9B28-1BB62BFBA566}.ddp\DedupFileList.1.new, ...) The volume may be inaccessible for I/O operations or marked read-only. In case of a cluster volume, this may be a transient failure during failover. Operation: Updating a chunk store file list. Recovering a chunk store file list from redundancy. Checking chunk store consistency. Initializing chunk store. Running the deduplication job. Context: Chunk store: \\?\Volume{72912f5d-c574-4860-b956-74ac0ddcd0cb}\System Volume Information\Dedup\ChunkStore\{39A9D76E-376C-4AAD-9B28-1BB62BFBA566}.ddp Volume name: \\?\Volume{72912f5d-c574-4860-b956-74ac0ddcd0cb}\ Volume name: C:\ClusterStorage\Volume1 (\\?\Volume{72912f5d-c574-4860-b956-74ac0ddcd0cb}\) Error-specific details: Error: CDedupFileFactory::Create(\\?\Volume{72912f5d-c574-4860-b956-74ac0ddcd0cb}\System Volume Information\Dedup\ChunkStore\{39A9D76E-376C-4AAD-9B28-1BB62BFBA566}.ddp\DedupFileList.1.new, ...), 0x80070005, Access is denied.
Then after a few minutes i receive and information event.
Optimization job has completed. Volume: C:\ClusterStorage\Volume1 (\\?\Volume{72912f5d-c574-4860-b956-74ac0ddcd0cb}\) Error code: 0x80565313 Error message: Data deduplication failed to access the volume. It may be offline. Savings rate: 73 Saved space: 5826018537547 Volume used space: 2125270257664 Volume free space: 1093804580864 Optimized file count: 104 In-policy file count: 105 Job processed space (bytes): 0 Job elapsed time (seconds): 389 Job throughput (MB/second): 0
Besides this everything appears to be working just fine.
I am looking for ideas here.
Thanks