Large amount of data that is often stored in many thousands of files is created as part of today’s geographically distributed scientific computation and collaboration environments. Managing and transferring large volumes of data sets present a significant challenge and are often a bottleneck in the scientific computing community. In this paper, we introduce an architecture to manage data distributions in a collaborative fashion through a GridTorrent Framework (GTF) whose data transfer mechanism inspired by Bittorrent. We present performance experiment data that compares our framework to parallel TCP (PTCP) and Bittorrent. Experimental results conducted suggest that using GridTorrent for large data set has significant advantages over parallel TCP in LAN and WAN type of computer networks.
Date of creation, presentation, or exhibit
Department, Program, or Center
Computer Science (GCCIS)
GridTorrent Framework: A High-performance Data Transfer and Data Sharing Framework for Scientific Computing, Proceedings of the Grid Computing Environments (GCE) workshop. Held at the Reno Convention Center: Reno, Nevada: 11-12 November 2007.
RIT – Main Campus