×

The open cloud testbed: Supporting open source cloud computing systems based on large scale high performance, dynamic network services. (English) Zbl 1188.68025

Doulamis, T. (ed.) et al., Networks for grid applications. Third international ICST conference, GridNets 2009, Athens, Greece, September 8–9, 2009. Revised selected papers. Berlin: Springer (ISBN 978-3-642-11732-9/pbk; 978-3-642-11733-6/ebook). Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering 25, 89-97 (2010).
Summary: Recently, a number of cloud platforms and services have been developed for data intensive computing, including Hadoop, Sector, CloudStore (formerly KFS), HBase, and Thrift. In order to benchmark the performance of these systems, to investigate their interoperability, and to experiment with new services based on flexible compute node and network provisioning capabilities, we have designed and implemented a large scale testbed called the Open Cloud Testbed (OCT). Currently OCT has 120 nodes in 4 data centers: Baltimore, Chicago (two locations), and San Diego. In contrast to other cloud testbeds, which are in small geographic areas and which are based on commodity Internet services, the OCT is a wide area testbed and the 4 data centers are connected with a high performance 10Gb/s network, based on a foundation of dedicated lightpaths. This testbed can address the requirements of extremely large data streams that challenge other types of distributed infrastructure. We have also developed several utilities to support the development of cloud computing systems and services, including novel node and network provisioning services, a monitoring system, and an RPC system. In this paper, we describe the OCT concepts, architecture, infrastructure, a few benchmarks that were developed for this platform, interoperability studies, and results.
For the entire collection see [Zbl 1185.68028].

MSC:

68M10 Network design and communication in computer systems
PDF BibTeX XML Cite
Full Text: DOI

References:

[1] Gu, Y., Grossman, R.: Sector and Sphere: The Design and Implementation of a High Performance Data Cloud. Theme Issue of the Philosophical Transactions of the Royal Society A: Crossing Boundaries: Computational Science, E-Science and Global E-Infrastructure 367(1897), 2429–2445 (2009)
[2] Hadoop, http://hadoop.apache.org/core/
[3] CloudStore, http://kosmosfs.sourceforge.net/
[4] Nurmi, D., Wolski, R., Grzegorczyk, C., Obertelli, G., Soman, S., Youseff, L., Zagorodnov, D.: The Eucalyptus Open-source Cloud-computing System. In: Proceedings of Cloud Computing and Its Applications, Chicago, Illinois (October 2008)
[5] Amazon EC2 and S3, http://aws.amazon.com/
[6] Thrift, http://developers.facebook.com/thrift/
[7] Ghemawat, S., Gobioff, H., Leung, S.-T.: The Google File System, pub. In: 19th ACM Symposium on Operating Systems Principles, Lake George, NY (October 2003)
[8] Dean, J., Ghemawat, S.: MapReduce: Simplified Data Processing on Large Clusters. In: OSDI 2004: Sixth Symposium on Operating System Design and Implementation, San Francisco, CA (December 2004)
[9] HP Technical Report, HP-2009-122: Open Cirrus Cloud Computing Testbed: Federated Data Centers for Open Source Systems and Services Research
[10] Provos, N., McNamee, D., Mavrommatis, P., Wang, K., Modadugu, N.: The Ghost In The Browser - Analysis of Web-based Malware. In: Proceedings of the First Workshop on Hot Topics in Understanding Botnets, HotBots (2007)
[11] OCT Monitor, http://angle.ncdm.uic.edu/simnetup/
[12] Gu, Y., Grossman, R.L.: UDT: UDP-based Data Transfer for High-Speed Wide Area Networks. Computer Networks 51(7) (May 2007) · Zbl 1114.68309
[13] Travestino, F., Mambretti, J., Karmous-Edwards, G.: GridNetworks: Enabling Grids With Advanced Communication Services. Wiley, Chichester (2006)
[14] Bennett, C., Grossman, R., Seidman, J.: Open Cloud Consortium Technical Report TR-09-01, MalStone: A Benchmark for Data Intensive Computing (April 2009)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.