[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Database backup and deduplication question



On 1/8/12 6:05 PM, Colin Percival wrote:
> Hi all,
>
> Sorry about the long silence... between Christmas and New Years and
> FreeBSD 9.0-RELEASE, I've fallen a bit behind on email lately. :-(
>
> On 12/29/11 20:02, Greg Larkin wrote:
>>> I ran those tests as shown, and each one worked as expected with a 2:1
>>> ratio.  Then I ran a few more sequences, in case they help:
>>>
>>> [...]
>>>
>>> The first sequence looks ok (file1 file1 part1), but after that, the
>>> "New data" number increases more than expected.  I have another Mac, and
>>> I'll try the same tests to see if there's any machine-specific issue here.
>> I just ran the same tests with the same files on a different Mac running
>> 10.5.8 instead of 10.6.8.  This time, the results are as expected:
>>
>> [...]
>>
>> If you have any ideas about debug messages or other instrumentation that I
>> can put in the source code, let me know.  I'll compare the shell
>> environment between the two machines to see if that reveals anything
>> interesting.
> Very interesting!  Were these two machines using the same keys?  If not, can
> you have them swap keys and try again to see if it's the machine or the keys
> which determine the behaviour?
>
> Once I know that I should be able to provide you with a debugging patch which
> will help track down the issue.
>
Hi Colin,

All machines have been using unique keys up until now.  Following your
instructions, I swapped keys between the Leopard (Mac Book Pro) and Snow
Leopard (iMac) machines, and for good measure, I installed tarsnap on a
Lion machine (Mac mini) and tried both sets of keys there.  At this
point, it appears to be a machine-dependent problem:


# iMac using Mac Book Pro keys (tarsnap 1.0.31):

sh-3.2# tarsnap --keyfile mbp-keys/tarsnap.key --dry-run -cf testarchive
file1 file1 part1 2>&1 | grep "^New data"
New data                                105043884        105559008
sh-3.2# tarsnap --keyfile mbp-keys/tarsnap.key --dry-run -cf testarchive
file1 file1 part1 part2 2>&1 | grep "^New data"
New data                                157504658        158272905
sh-3.2# tarsnap --keyfile mbp-keys/tarsnap.key --dry-run -cf testarchive
file1 file1 part1 part2 part3 2>&1 | grep "^New data"
New data                                183736112        184636668


# Mac Book Pro using iMac keys (tarsnap 1.0.28):

sh-3.2# tarsnap --keyfile imac-keys/tarsnap.key --dry-run -cf
testarchive file1 file1 part1 2>&1 | grep "^New data"
New data                                105083384        105589862
sh-3.2# tarsnap --keyfile imac-keys/tarsnap.key --dry-run -cf
testarchive file1 file1 part1 part2 2>&1 | grep "^New data"
New data                                105191397        105696541
sh-3.2# tarsnap --keyfile imac-keys/tarsnap.key --dry-run -cf
testarchive file1 file1 part1 part2 part 3 2>&1 | grep "^New data"
New data                                105191404        105696548


# Mac mini using both sets of keys (tarsnap 1.0.28):

## Using MBP keys
sh-3.2# tarsnap --keyfile mbp-keys/tarsnap.key --dry-run -cf testarchive
file1 file1 part1 2>&1 | grep "^New data"
New data                                105098054        105606433
sh-3.2# tarsnap --keyfile mbp-keys/tarsnap.key --dry-run -cf testarchive
file1 file1 part1 part2 2>&1 | grep "^New data"
New data                                105244753        105751780
sh-3.2# tarsnap --keyfile mbp-keys/tarsnap.key --dry-run -cf testarchive
file1 file1 part1 part2 part3 2>&1 | grep "^New data"
New data                                105331315        105837309

## Using iMac keys
sh-3.2# tarsnap --keyfile imac-keys/tarsnap.key --dry-run -cf
testarchive file1 file1 part1 2>&1 | grep "^New data"
New data                                105083384        105589862
sh-3.2# tarsnap --keyfile imac-keys/tarsnap.key --dry-run -cf
testarchive file1 file1 part1 part2 2>&1 | grep "^New data"
New data                                105191397        105696541
sh-3.2# tarsnap --keyfile imac-keys/tarsnap.key --dry-run -cf
testarchive file1 file1 part1 part2 part3 2>&1 | grep "^New data"
New data                                105279204        105783302


You might think that based on the results above, tarsnap 1.0.31 is the
cause of the problem, since it's the obvious difference between the
three machines.  In order to remove that variable, I built 1.0.28
(current version from MacPorts) on the iMac and retested.  That forced a
rebuild of OpenSSL to bring it up to version 1.0.0f.  Previously, I had
1.0.0e installed on the iMac.  The results are interesting:

sh-3.2# tarsnap --keyfile mbp-keys/tarsnap.key --dry-run -cf testarchive
file1 file1 part1 2>&1 | grep "^New data"
New data                                105098054        105606442
sh-3.2# tarsnap --keyfile mbp-keys/tarsnap.key --dry-run -cf testarchive
file1 file1 part1 part2 2>&1 | grep "^New data"
New data                                105244753        105751786
sh-3.2# tarsnap --keyfile mbp-keys/tarsnap.key --dry-run -cf testarchive
file1 file1 part1 part2 part3 2>&1 | grep "^New data"
New data                                105331315        105837316

These numbers are what I expect from tarsnap, so is it possible that
some bug in OpenSSL triggered the original deduplication problem that I
noticed? 

Thank you,
Greg