Sample of GridFTP logs we used in the paper "Cross-geography Scientific Data Transfer Trends and User Behavior Patterns"
Data fields
| Name | Type | Unit | Description |
|---|---|---|---|
| appname | String | N/A | GridFTP client application name |
| block_size | Integer | byte | disk blocksize used for transfer |
| buffer_size | Integer | byte | tcp buffer size used for transfer |
| end_time | Float | second | end time of transfer (POSIX time) |
| ftp_return_code | Integer | N/A | ftp result code (226 = success, 5xx = fail) |
| gftp_version | String | N/A | version string of GridFTP server |
| server_id | Integer | N/A | A unique ID based on server IP address |
| num_bytes | Integer | byte | number of bytes transferred |
| num_streams | Integer | N/A | number of parallel streams used |
| num_stripes | Integer | N/A | number of stripes used |
| start_time | Float | second | start time of transfer |
| trans_type | String | N/A | transfer command: RETR, STOR, LIST, etc |
All logs in 2017 are expected to be online around mid-February, 2018. Contact me if you cannot wait.
Since the dataset is big, we are working to make it available online via other sharing service, e.g., Data Sharing With Globus.