PAX Extract file fix Pull#2 #19

ovidiul · 2017-03-13T13:41:07Z

No description provided.

fixing directory size to match zero on TAR archives FileInfo

FileInfo code display fix

When adding a file which size is being changed during archiving, the archive would become corrupt because the size in the header is different than the size of the actual data written. So an additional check should be done and if the sizes are different, the header should be updated. A quick test would be to create a loop where you write 1 byte to a test.txt file and then trying to add that file to the archive and extracting the archive.

splitbrain

I'm not comfortable merging this until I fully understand what the x header is doing.

splitbrain · 2017-03-19T09:18:49Z

tests/tar.test.php

@@ -186,7 +186,38 @@ public function test_dogfood()
            unlink($archive);
        }
    }
+
+	/**


indentation seem wrong again. please make sure you use 4 spaces for indentation. not tabs!

splitbrain · 2017-03-19T09:21:35Z

src/Tar.php

+            $filename = trim($this->readbytes(ceil($return['size'] / 512) * 512));
+            // next block is the real header
+            $block  = $this->readbytes(512);
+            $return = $this->parseHeader($block);


I don't understand. You are not using the data read from x header. $filename is never used. You basically just read and ignore the x header.

Since your tests work, I assume the original header already contains the proper UTF-8 filename? Would a very long UTF-8 filename have an x and an L header?

splitbrain · 2017-03-19T09:37:30Z

Looking at the IBM document you provided in #18 an x header will contain key value pairs. I dumped the header from your files and that confirms it:

28 path=./4слайд.jpg
20 ctime=1489408528
20 atime=1489408285
23 SCHILY.dev=16777220
23 SCHILY.ino=53836974
18 SCHILY.nlink=1

So for implementing it correctly, the path key in this header should overwrite the filename from the standard header.

The header may also contain a charset key which would give all other data (including path) in that charset. We would probably need to call iconv or mbstring on that then.

Ideally we should also create such a header when adding UTF-8 filenames.

ovidiul added 15 commits March 1, 2017 10:53

Update FileInfo.php

67c78c3

fixing directory size to match zero on TAR archives FileInfo

Update FileInfo.php

317d5ae

FileInfo code display fix

Update FileInfo.php

127a2b7

fixed broken indentation

f9ef5b4

php unit test fix

1c725f5

php unit test fix

04b7449

reverse original pull

e306d3d

adding PAX extract support

1e8bd79

pax extract test addon

46afe61

adding pax extraction tests

38f7093

adding pax extraction tests

226ef21

adding pax extraction tests

6d89f03

adding pax extraction test archives

205707c

ovidiul mentioned this pull request Mar 13, 2017

PAX typeFlag 'x' #18

Open

code update

0423d08

splitbrain reviewed Mar 19, 2017

View reviewed changes

milux mentioned this pull request Aug 31, 2022

Long filenames are truncated dennis-eisen/CT_AutoUpdater#8

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PAX Extract file fix Pull#2 #19

PAX Extract file fix Pull#2 #19

ovidiul commented Mar 13, 2017

splitbrain left a comment

splitbrain Mar 19, 2017

splitbrain Mar 19, 2017

splitbrain commented Mar 19, 2017

PAX Extract file fix Pull#2 #19

Are you sure you want to change the base?

PAX Extract file fix Pull#2 #19

Conversation

ovidiul commented Mar 13, 2017

splitbrain left a comment

Choose a reason for hiding this comment

splitbrain Mar 19, 2017

Choose a reason for hiding this comment

splitbrain Mar 19, 2017

Choose a reason for hiding this comment

splitbrain commented Mar 19, 2017