Archiving and compressing files: Difference between revisions

From Alliance Doc
Jump to navigation Jump to search
(splitting page into two titles)
Tag: Removed redirect
(Marked this version for translation)
 
(2 intermediate revisions by the same user not shown)
Line 1: Line 1:
<languages />


<translate>
<!--T:9-->
''Parent page: [[Storage and file management]]''
''Parent page: [[Storage and file management]]''


<!--T:10-->
[https://en.wikipedia.org/wiki/Archive_file Archiving] means creating one file that contains a number of smaller files within it. Reducing the number of files by creating an archive can improve the efficiency of file storage and help you stay within [[Storage_and_file_management#Filesystem_quotas_and_policies|quota limits]].  Archiving can also improve the efficiency of [[General directives for migration|file transfers]]. It is faster for the secure copy protocol ([https://en.wikipedia.org/wiki/Secure_copy scp]), for example, to transfer one archive file of a reasonable size than thousands of small files of equal total size.
[https://en.wikipedia.org/wiki/Archive_file Archiving] means creating one file that contains a number of smaller files within it. Reducing the number of files by creating an archive can improve the efficiency of file storage and help you stay within [[Storage_and_file_management#Filesystem_quotas_and_policies|quota limits]].  Archiving can also improve the efficiency of [[General directives for migration|file transfers]]. It is faster for the secure copy protocol ([https://en.wikipedia.org/wiki/Secure_copy scp]), for example, to transfer one archive file of a reasonable size than thousands of small files of equal total size.


<!--T:11-->
[https://en.wikipedia.org/wiki/Data_compression Compressing] means encoding a file such that the same information is contained in fewer bytes of storage. The advantage for long-term data storage should be obvious. For [[General directives for migration|data transfers]], the time spent for compressing data must be balanced against the time saved moving fewer bytes as described in this discussion of [https://bluewaters.ncsa.illinois.edu/data-transfer-doc data compression and transfer] from the US National Center for Supercomputing Applications.
[https://en.wikipedia.org/wiki/Data_compression Compressing] means encoding a file such that the same information is contained in fewer bytes of storage. The advantage for long-term data storage should be obvious. For [[General directives for migration|data transfers]], the time spent for compressing data must be balanced against the time saved moving fewer bytes as described in this discussion of [https://bluewaters.ncsa.illinois.edu/data-transfer-doc data compression and transfer] from the US National Center for Supercomputing Applications.


<!--T:12-->
* The best-known tool for archiving files in the Linux community is <tt>tar</tt>.  Here is [[a tutorial on 'tar']].
* The best-known tool for archiving files in the Linux community is <tt>tar</tt>.  Here is [[a tutorial on 'tar']].
* A replacement for <tt>tar</tt> called <tt>dar</tt> offers some advantages in functionality.  Here is [[a tutorial on 'dar']].  Both <tt>tar</tt> and <tt>dar</tt> can compress files as well as archive.  
* A replacement for <tt>tar</tt> called <tt>dar</tt> offers some advantages in functionality.  Here is [[Dar|a tutorial on 'dar']].  Both <tt>tar</tt> and <tt>dar</tt> can compress files as well as archive.  
* The <tt>zip</tt> utility, more commonly used in the Windows community but available on our systems, also provides both archiving and compression.
* The <tt>zip</tt> utility, more commonly used in the Windows community but available on our clusters, also provides both archiving and compression.
* Compression tools <tt>gzip, bzip2</tt> and <tt>xz</tt> can be used in conjunction with <tt>tar</tt>, or by themselves.
* Compression tools <tt>gzip, bzip2</tt> and <tt>xz</tt> can be used in conjunction with <tt>tar</tt>, or by themselves.
</translate>

Latest revision as of 19:39, 18 July 2019

Other languages:


Parent page: Storage and file management

Archiving means creating one file that contains a number of smaller files within it. Reducing the number of files by creating an archive can improve the efficiency of file storage and help you stay within quota limits. Archiving can also improve the efficiency of file transfers. It is faster for the secure copy protocol (scp), for example, to transfer one archive file of a reasonable size than thousands of small files of equal total size.

Compressing means encoding a file such that the same information is contained in fewer bytes of storage. The advantage for long-term data storage should be obvious. For data transfers, the time spent for compressing data must be balanced against the time saved moving fewer bytes as described in this discussion of data compression and transfer from the US National Center for Supercomputing Applications.

  • The best-known tool for archiving files in the Linux community is tar. Here is a tutorial on 'tar'.
  • A replacement for tar called dar offers some advantages in functionality. Here is a tutorial on 'dar'. Both tar and dar can compress files as well as archive.
  • The zip utility, more commonly used in the Windows community but available on our clusters, also provides both archiving and compression.
  • Compression tools gzip, bzip2 and xz can be used in conjunction with tar, or by themselves.