Search This Site Search This Site
 
About Solutions Products Partners How To Buy Benchmarks Support Home
     
  Products  
     
InfiniBand and 10-Gigabit Ethernet for I/O in Cluster Computing

July 26-28 Cluster Symposium 2005

Neal Bierbaum, Helen Chen, Jeffrey Decker, Erik Van De Vreugde

Sandia National Laboratories

 

Outline

  • Parallel I/O in cluster computing
  • TerraGRID – the Parallel Filesystem
  • 10 GigE TOE and IB SDP
  • Testbed configuration
  • Benchmark methodology
  • Results and analysis
  • Conclusion and future work

Parallel I/O Requirements

  • Cluster computing architecture
    • Multiple nodes run single
      application in parallel
    • Global data structure
      distributed in memory of
      multiple nodes
    • Filesystem with parallel I/O
      paths and global name
      space can eliminate the
      serial I/O bottleneck
FLOP to Byte/s ratio around 500:1

TerraGRID

  • Is iSCSI-based block-level scalable I/O platform
    • Uses Shared Access Scheduling Scheme to enable Linux file system to act as a massively parallel file system
      Each initiator uses SW RAID to issue requests to all targets in parallel
    • Each target presents a file or a raw device as block container
    • All initiators share a global name space

Fully Harnesses Linux File System and Utilities

Fully Harnesses Linux File System and Utilities

 

InfiniBand

  • InfiniBand (IB)
    • Transport protocol implemented in silicon
    • High speed (2.5 to 30 Gbps) low latency (100 ns) interconnect• InfiniBand (IB)
    • Transport protocol implemented in silicon
    • High speed (2.5 to 30 Gbps) low latency (100 ns) interconnect
  • Socket Direct Protocol
    • New AF_INET protocol family that supports reliable stream sockets
    • Allows sockets applications transparent access of the hardware IB protocol stack

 

TCP Offload Engine (TOE)

  • Adapters that deliver hardware-offloaded TCP/IP protocol stacks
    • Implemented over 1 and 10 Gigabit Ethernet
    • Cooperative TCP Offload
    • Provide support for existing sockets-based applications

 

The Big Picture

The Big Picture

 

Hardware Setup

oneSIS Head Node

 

Key Software

 

Key Hardware

 

Benchmark Methodology

  • A custom Sandia test system integrates the definition, execution, and the organization of results and related information
    • XML definition files define the test environment, the test program parameters, and the scheduling of simultaneous runs across multiple hosts
    • Results of each run are reported in a series of XML, HTML and serialized compressed data files to allow easy reviewing and consistent, unambiguous searching and processing of results from a large number of test runs
    • Test programs include IOZONE, NETPERF, and a custom file system operations test
    • Remote test control processes also record system resource usage on participating hosts during each test run
  • Post processing tools convert data specific to a test type into a spreadsheet for further analysis

 

Technology Baseline – Back-to-back Netperf Throughput and Latency

 

TerraGRID Socket Connection Profile

TerraGRID Initiators Netperf Clients

 

Infrastructure Baseline – Netperf Throughput

Netperf Throughput and Client CPU

 

Infrastructure Baseline – Netperf Latency

Infrastructure Baseline – Netperf Latency

 

IOzone Aggregate Throughput

IOzone Aggregate Throughput

 

IOzone CPU Overhead

IOzone CPU Overhead

 

Summary IOzone Aggregate Throughput by Technology

Summary IOzone Aggregate Throughput by Technology

 

Summary IOzone CPU Load by Technology

Summary IOzone CPU Load by Technology

 

IOzone Work Efficiency

 

Conclusion

  • 10 GbE and TOE out performed IB and SDP for socket applications in our test environment
  • Protocol offload, TOE and SDP, offered significant performance improvement
  • Further improvement possible with RDMA and zero-copy

 

Future Plans

  • Evaluate RDMA performance through DAPL or VAPI
  • Evaluate 10 Gigabit Ethernet as a shared I/O infrastructure between large platforms
    • Distance advantage (LAN, WAN)
    • Existing technology leverage
  • Infrastructure
  • Knowledge base
  • etc.
    • Commodity pricing?!

 

10 Gigabit Ethernet Market Trend

10 Gigabit Ethernet Market Trend

 

Click here to download a pdf  version of this page
 

About | Solutions | Products | Partners | How To Buy | Support | Contact | Careers | Legal | Privacy Policy | Home |

© Copyright 2007 Chelsio Communications