A high-end core router, such as the CiscoCRS-1, consists of multiple chassis, each of which is populated with multiple line cards that in turn have multiple high-speed network connections. The router’s control plane software must configure, control, and coordinate the set of interfaces to insure that the control software remains running at all times, that faults are detected and corrected, and that forwarding remains consistent across all interfaces. A key requirement is that changes in forwarding tables propagate to all parts of the router quickly without producing transient inconsistencies (the control software must be especially careful to avoid even short-terminternal routing loops). This paper considers the application of cluster management software to a core router. Specifically, we investigate OpenMPI and the ORCM system that uses OpenMPI. After a review of basics and definition of terms, the paper considers fault tolerance and describes ORCM capabilities and limitations. It then presents measurements of latency and throughput that characterize performance and overhead. We conclude with possible extensions and future work.
Funder: Cisco Systems; National Science Foundation;
ISBN för värdpublikation: 978-1-880843-84