Handling large collections of files: Difference between revisions

Handling large collections of files (view source)

Revision as of 17:51, 18 July 2019

10 bytes removed , 5 years ago

remove Draft tag

Rdickson

Bureaucrats, cc_docs_admin, cc_staff

2,879

edits

Revision as of 17:50, 18 July 2019 (view source) Rdickson (talk \| contribs) (moved squashfs and ratarmount sections to Talk page) ← Older edit		Revision as of 17:51, 18 July 2019 (view source) Rdickson (talk \| contribs) (remove Draft tag) Newer edit →
Line 1:		Line 1:
	~~{{Draft}}~~

	In certain domains, notably [[AI and Machine Learning]], it is common to have to manage very large collections of files, meaning hundreds of thousands or more. The individual files may be fairly small, e.g. less than a few hundred kilobytes. In these cases, a problem arises due to [[Storage_and_file_management#Filesystem_quotas_and_policies\|filesystem quotas]] on Compute Canada clusters that limit the number of filesystem objects. So how can a user or group of users store these necessary data sets on the cluster? In this page we will present a variety of different solutions, each with its own pros and cons, so you may judge for yourself which is an appropriate one for you.		In certain domains, notably [[AI and Machine Learning]], it is common to have to manage very large collections of files, meaning hundreds of thousands or more. The individual files may be fairly small, e.g. less than a few hundred kilobytes. In these cases, a problem arises due to [[Storage_and_file_management#Filesystem_quotas_and_policies\|filesystem quotas]] on Compute Canada clusters that limit the number of filesystem objects. So how can a user or group of users store these necessary data sets on the cluster? In this page we will present a variety of different solutions, each with its own pros and cons, so you may judge for yourself which is an appropriate one for you.

Handling large collections of files: Difference between revisions

Handling large collections of files (view source)

Revision as of 17:51, 18 July 2019

Navigation menu

Search