M HYPE SPLASH
// news

How to combine / merge zip files?

By Emma Terry

For the last several months I have copied several data folders to zip files at weekly intervals. Now I'd like to combine those zip files into one zip file, because most of the contents of the existing zip files are just different versions of the same data files.

So if a file appears in more than one of the existing zip files, I'd like the newest version to be in the new zip file being created. Of course if a file appears in only one existing zip file, then I want it in the final zip file also.

I'm trying to avoid having to unzip them one by one to a working folder, overwriting data from older zip files with data from newer zip files, and then rezipping everything into a new zip file.

From what I understand pkzip would combine the zip files themselves, but is there a dependable and fast free method anyone can tell me about?

1

8 Answers

you won't like it but: unzipping everything into a working folder in the right order, then zipping the result is the most effective way.

otherwise, you will end up with a lot of wasted CPU cycles:

  • assume your result goes to 'first.zip'
  • every file from '2.zip', '3.zip' etc has to be unzipped and then zipped again into 'first.zip'
  • in '2.zip' exists a file 'foobar.txt' and in '3.zip' exists another file 'foobar.txt'. merging it the way you want to merge it leads to 'compress it X times'
  • the toc of a .zip is at the end of the file: you add more content (to the middle of the
    .zip by updating a file in the middle) and the whole file has to be rewritten

so, imho just use 'unzip' wiseley:

% mkdir all
% for x in *.zip ; do unzip -d all -o -u $x ; done
% zip -r all.zip all

the order of the unzipping is important, I don't know the pattern of your zip names, but I would extract the newest zip file first, the '-u' option of unzip overwrites only files if they are newer or creates files if not already there. as a result, you will unzip only the newest files and zip the result only once.

6

Just use the -g option of ZIP, where you can append any number of ZIP files into one (without extracting the old ones). This will save you significant time.

Also have a look at zipmerge

1

It may not be what you're looking for, but the free Ant build tool does include the ability to merge Zipfiles.

0

:

zipmerge merges the source zip archives source-zip into the target zip archive target-zip. By default, files in the source zip archives overwrite existing files of the same name in the target zip archive.

I was thinking you could script the files being extracted into a temp directory.

There is problem with this command line. I couldn't find a way to order the unzipping of archives, so an older archive may overwrite a newer archive. This problem may be overcome by using an unzipper the has a command line switch to only overwrite if newer. I mainly use 7-Zip which doesn't have such a command line option.

Also, this command needs on all the zip files being in the same directory. Not a problem if all the zips have unique names. That said, the command can be changed to fit your situation.

for /f %f in ('dir /b *.zip') do "c:\program files\7-zip\7z" x %f -oc:\testdir -r -aoa

To change this to use another unzipping program just replace "c:\program files\7-zip\7z" x %f -oc:\testdir -r -aoa with whatever command you would execute on each file. Use %f as a place holder for the name of the file you want to unzip.

I tried looking for a polished app, free or otherwise and didn't really find one.

Hopefully this will give you a good start and WinZip or something similar can take care of the overwrite problem.

Good luck.

If I remember correctly, pkzip was a command-line program.

There's still a command-line version of ZIP which claims to be compatible with pkzip.

It's called Info-ZIP and there should be a version for your OS.

3

Look for winzip command line on the net. Winzip has several versions of command line tools to fit whatever version of winzip you may may have installed. The command line tool WZZIP has a -f "freshen" option that will zip newer files only of those that match the name of a file in the summation output zip file.

Use WZunzip wraped in a FOR statement as shown above to unzip one file to a directory then WZzip -f to add those files to an output summation zip file. Then the FOR loop repeats to work on the next input file to output to the one and only summation output file. The order of the input files does not matter since WZzip -f will only add to the output file if the input data is newer than what is already in the output file. All files that do not exist in the output file will also be added. Then you may unzip the result to a folder and then zip it up again to obtain an efficiently packed result file. You can even do this automatically after the FOR loop at the end of the batch file.

You can use zipmerge.

zipmerge ./out.zip ./file1.zip ./file2.zip

It can be built from the source:

git clone --depth 1
cd libzip
cmake -S ./ -B ./build -DCMAKE_BUILD_TYPE=Release -DBUILD_TOOLS=ON -DBUILD_DOC=OFF -DBUILD_EXAMPLES=OFF -DBUILD_REGRESS=OFF -DENABLE_GNUTLS=OFF
cmake --build ./build --config Release

The zipmerge executable will appear under build/Release

Your Answer

Sign up or log in

Sign up using Google Sign up using Facebook Sign up using Email and Password

Post as a guest

By clicking “Post Your Answer”, you agree to our terms of service, privacy policy and cookie policy