Globus
Globus is a fast, reliable, high-performance service for secure data movement. Designed specifically for researchers, Globus has an easy-to-use interface with background monitoring features that automate the management of file transfers between any two resources, whether they are at Compute Canada, another supercomputing facility, a campus cluster, lab server, desktop or laptop.
Globus leverages GridFTP for its transfer protocol but shields the end user from complex and time consuming tasks related to GridFTP and other aspects of data movement. It significantly improves transfer performance by auto tuning the transfer settings, thereby reducing the time spent managing transfers. It also provides other features such as automatic restarts for interrupted transfers and file integrity checking.
Globus can be accessed via the main Globus website or via the Compute Canada Globus portal at https://globus.computecanada.ca.
Accessing Globus[edit]
Globus is a third-party service used by Compute Canada. To start using Globus, you must first sign up for an account on the Globus Sign Up page.
TODO: IMAGE HERE
Once the account is set up, it can be used to transfer information to/from Compute Canada systems. Note that Globus is hosted in the Amazon Cloud and managed by the University of Chicago, so your login information, including your account name, email address, and real name, will be stored in the United States.
Using Globus[edit]
Globus can be used for moving data between any two resources, whether it is a small number of very large (even terabyte-sized) files or a very large number of small files.
To Start a Transfer[edit]
(1) Log in to http://globus.computecanada.ca, if you are not already. Select the "Transfer Files" option from the "Manage Data" menu.
(2) Select the source endpoint in the "Endpoint" field. You may browse the drop-down list or type the name into the Endpoint field to find the desired endpoint. All Compute Canada resources are prefixed with computecanada#
.
You will be prompted to activate the endpoint. This will involve entering your username and password for the system the endpoint refers to. For example, if you are activating the endpoint for Silo, which is in WestGrid, you would use your WestGrid username and password.
IMAGE HERE
Once the endpoint is activated you may browse and select the files or directories you wish to transfer. The exact method of authentication varies, depending on which site is hosting an endpoint. If you are unsure of how to proceed, check the site-specific documentation (links below):
- WestGrid Globus documentation
- Calcul Québec Globus documentation
- Globus documentation
- SciNet: Select
computecanada#gpc
and you will be redirected to a page requesting your SciNet username and password to activate the endpoint.
(3) Select the files or directories that you want to transfer, then click the blue arrow to initiate the transfer. The task will be queued, and will begin shortly. You will receive an email (at the email address you registered with your Globus account) when the transfer is complete. You can also monitor in-progress transfers and view details of completed transfers from the activity tab on the Globus site.
IMAGE HERE
Advanced Transfer Options[edit]
Globus also provides some more advanced features, including encryption and file integrity checking. To access these features, access the “more options” tab at the bottom of the transfer page. Here, you can enable advanced features including encryption, file synchronization between two hosts, and preservation of file modification times. Note that enabling encryption significantly reduces transfer performance, so it should only be used for sensitive data.
IMAGE HERE
Transferring from a Personal Computer[edit]
Globus provides a desktop client, called Globus Connect Personal, to make it easy to transfer files to and from a personal computer running Windows, MacOS X, or Linux.
To install Globus Connect Personal:
(1) Go to the Compute Canada Globus portal and log in if you have not already done so.
(2) Once you are logged in, go to the Manage Endpoints page...
IMAGE HERE
...then select “Add Globus Connect Personal”.
IMAGE HERE
(3) Enter an endpoint name, which you will use to access the computer you will be installing Globus Connect Personal on. Example: MacLaptop or WorkPC.
IMAGE HERE
(4) Click the “Generate Setup Key” button. Copy the key to your computer’s clipboard, then click the download link for your OS.
(5) Install the program. Once it is installed, run the Globus Connect Personal program.
(6) The first time you run the program, enter the site key in the box that pops up.
(7) You should now be able to access the endpoint through Globus. The full endpoint name is [your username]#[name from step 3] Example: smith#WorkPC
Further Reading[edit]
Globus sharing makes collaboration between users easy. Sharing enables people to access files stored on your account on a Compute Canada system even if the other user does not have an account on that system. Files can be shared with any user, anywhere in the world, who has a Globus account. See Sharing Files With Globus, and https://support.globus.org/entries/23602336-Sharing-Data-using-Globus.
Globus groups provide an easy way to manage permissions for sharing with multiple users. When you create a group, you can use it from the sharing interface to quickly control access to a share for multiple users simultaneously. See Globus Groups and https://support.globus.org/entries/23586412-Managing-Globus-Online-Groups-for-Data-Sharing
Support and More Information[edit]
If you would like more information on Compute Canada’s use of Globus, or require support in using this service, please send an email to globus@computecanada.ca and provide the following information:
- Name
- Compute Canada Role Identifier (CCRI)
- Institution
- Inquiry or issue. Be sure to indicate which sites you want to transfer to and from.