[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Listing the deltas



On 06/06/12 21:21, Ted Unangst wrote:
> On Wed, Jun 06, 2012 at 20:24, Colin Percival wrote:
>> On 06/06/12 05:19, Tom Limoncelli wrote:
>>> On a machine where very little changes each day, there always seems to
>>> be 20 MB of "new data".  Is there a way to list which files changed
>>> and were backuped?
>>
>> No.  The way the tarsnap code works internally, it generates a complete
>> archive and then figures out which blocks duplicate data previously
>> uploaded; the deduplication code doesn't have any way of knowing which
>> files data originally belonged to (and in fact there will be blocks
>> with data aggregated from multiple files).
> 
> If I remember correctly, tarsnap is only looking at aligned blocks,
> too.

You do not remember correctly.  Tarsnap uses blocks varying between
16kB and 256 kB (average size 64 kB), with no particular alignment.

> And is the block size 8k?  (As I recall, prepending 4k of new
> data to an existing 8k file will result in 12k of new data, since the
> already uploaded data does not match the new file's block alignment.)

For that case, yes.  But prepending 4k to a 1 MB file will usually only
result in one new block.

-- 
Colin Percival
Security Officer Emeritus, FreeBSD | The power to serve
Founder, Tarsnap | www.tarsnap.com | Online backups for the truly paranoid