Xi, Bowei; Chen, Hui; Cleveland, William S.; Telkamp, Thomas Statistical analysis and modeling of Internet VoIP traffic for network engineering. (English) Zbl 1329.62480 Electron. J. Stat. 4, 58-116 (2010). Summary: Network engineering for quality-of-service (QoS) of Internet voice communication (VoIP) can benefit substantially from simulation study of the VoIP packet traffic on a network of routers. This requires accurate statistical models for the packet arrivals to the network from a gateway. The arrival point process is the superposition, or statistical multiplexing, of the arrival processes of packets of individual calls. The packets of each call form a transient point process with on-intervals of transmission and off-intervals of silence. This article presents the development and validation of models for the multiplexed process based on statistical analyses of VoIP traffic from the Global Crossing (GBLX) international network: 48 hr of VoIP arrival times and headers of 1.315 billion packets from 332018 calls. Statistical models and methods involve point processes and their superposition; time series autocorrelations and power spectra; long-range dependence; random effects and hierarchical modeling; bootstrapping; robust estimation; modeling independence and identical distribution; and visualization methods for model building. The result is two models validated by the analyses that can generate accurate synthetic multiplexed packet traffic. One is a semi-empirical model: empirical data are a part of the model. The second is a mathematical model: the components are parametric statistical models. This is the first comprehensive modeling of VoIP traffic based on data from a service provider carrying a full range of VoIP applications. The models can be used for simulation of any IP network architecture, wireline or wireless, because the modeling is for the IP-inbound traffic to an IP network. This is achieved because the GBLX data, collected on an IP link, are very close to their properties when they entered the GBLX network. Cited in 4 Documents MSC: 62P30 Applications of statistics in engineering and industry; control charts 62-07 Data analysis (statistics) (MSC2010) 62M10 Time series, auto-correlation, regression, etc. in statistics (GARCH) 62M15 Inference from stochastic processes and spectral analysis 90B20 Traffic problems in operations research Keywords:statistical model building; very large datasets; semi-empirical models; long-range dependence Software:RHIPE; longmemo; TCPDUMP × Cite Format Result Cite Review PDF Full Text: DOI Euclid References: [1] Agrawal, S., Narayan, P. P. S., Ramamirtham, J., Rastogi, R., Smith, M., Swanson, K., and Thottan, M. Voip service quality monitoring using active and passive probes. In, First International Conference on Communication System Software and Middleware (Comsware 2006) , pages 1-10, 2006. [2] Agrawal, S., Kanthi, C. N., Naidu, K. V. M., Ramamirtham, J., Rastogi, R., Satkin, S., and Srinivasan, A. Monitoring infrastructure for converged networks and services., Bell Labs Technical Journal , 12:63-77, 2007. [3] Andrews, D. F., Bickel, P. J., Hampel, F. R., Huber, P. J., Rogers, W. H., and Tukey, J. W., Robust Estimates of Location: Survey and Advances . Princeton University Press, 1972. · Zbl 0254.62001 [4] Anscombe, F. J. Graphs in Statistical Analysis., American Statistician , 27:17-21, 1973. [5] Arlos, P. and Fiedler, M. A comparison of measurement accuracy for DAG, tcpdump and windump. www.its.bth.se/staff/pca/, 2003. verified August, 2009. [6] Avaya ExpertNet. Avaya ExpertNet voip assessment tool., www.avaya.com. [7] Babu, T. V. J. G. and Hayes, J. F., Modeling and Analysis of Telecommunications Networks . John Wiley & Sons, 2004. ISBN 9780471348450. [8] Barford, P. and Crovella, M. Generating representative web workloads for network and server performance evaluation. In, Proceedings of the 1998 ACM SIGMETRICS Joint International Conference on Measurement and Modeling of Computer Systems , pages 151-160, 1998. [9] Baset, S. A. and Schulzrinne, H. G. An analysis of the Skype peer-to-peer Internet telephony protocol. In, Proceedings of the 25th IEEE International Conference on Computer Communications (INFOCOM 2006) , pages 1-11, 2006. [10] Belottia, P., Caponeb, A., Carellob, G., and Malucelli, F. Multi-layer mpls network design: The impact of statistical multiplexing., Computer Networks , 52 :1291-1307, 2008. · Zbl 1138.68011 · doi:10.1016/j.comnet.2008.01.005 [11] Beran, J., Statistics for Long-Memory Processes . Chapman & Hall, 1994. ISBN 0412049015. · Zbl 0869.60045 [12] Biernacki, A. VoIP source model based on the hyperexponential distribution., Proceedings of World Academy of Science, Engineering and Technology , 11:202-206, 2006. [13] Birke, R., Mellia, M., Petracca, M., and Rossi, D. Understanding VoIP from backbone measurements. In, INFOCOM 2007: The 26th IEEE International Conference on Computer Communications. , pages 2027 - 2035, 2007. [14] Bolotin, V. A. Modeling call holding time distributions for CCS network design and performance analysis., IEEE Journal on Selected Areas in Communications , 12:433-438, 1994. [15] Box, G. E. P. Science and statistics., Journal of the American Statistical Association , 71:791-799, 1976. · Zbl 0335.62002 · doi:10.1080/01621459.1976.10480949 [16] Brady, P. T. A statistical analysis of on-off patterns in 16 conversations., Bell System Technical Journal , 47:73-91, January 1968. [17] Brady, P. T. A model for generating on-off speech patterns in two-way conversation., Bell System Technical Journal , 48 :2445-2472, September 1969. [18] Brown, L., Gans, N., Mandelbaum, A., Sakov, A., Shen, H., Zeltyn, S., and Zhao, L. Statistical analysis of a telephone call center: A queueing-science perspective., Journal of the American Statistical Association , 100:36-50, 2005. · Zbl 1117.62303 · doi:10.1198/016214504000001808 [19] Cao, J., Cleveland, W. S., Lin, D., and Sun, D. X. On the nonstationarity of Internet traffic. In, Proceedings of the 2001 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems , pages 102-112, 2001. [20] Cao, J., Cleveland, W. S., Lin, D., and Sun, D. X. Internet traffic tends toward Poisson and independent as the load increases. In, Nonlinear Estimation and Classification , pages 83-109. Springer, 2002. · Zbl 1320.62228 · doi:10.1007/978-0-387-21579-2_6 [21] Casilari, E., Montes, H., and Sandoval, F. Modelling of voice traffic over IP networks. In, Third International Symposium on Communications Systems Networks and Digital Signal Processing (CSNDSP 2002) , pages 411-414, Staffordshire, UK, 2002. [22] Choe, J. and Shroff, N. B. Queueing analysis of high-speed multiplexers including long-range dependent arrival processes. In, Proceedings of the 18th IEEE International Conference on Computer Communications (INFOCOM) , pages 617-624, 1999. [23] Ciullo, D., Mellia, M., and Meo, M. Traditional IP measurements: What changes in a today multimedia IP network. In, Telecommunication Networking Workshop on QoS in Multiservice IP Networks , pages 262-267. IT-NEWS 2008, 2008. [24] Cleveland, W. S. and Devlin, S. J. Locally-weighted fitting: An approach to fitting analysis by local fitting., Journal of the American Statistical Association , 83:596-610, 1988. · Zbl 1248.62054 [25] Cox, D. R. Long-range dependence: A review. In, Statistics: An Appraisal , pages 55-74. The Iowa State University Press, 1984. [26] Cox, D. R. and Isham, V., Point Processes . Chapman & Hall, 1992. ISBN 0412219107. [27] Crovella, M. E. and Bestavros, A. Self-similarity in World Wide Web traffic: Evidence and possible causes., IEEE/ACM Transactions on Networking , 5:835-846, 1997. [28] Dang, T. D., Sonkoly, B., and Molnar, S. Fractal analysis and modeling of VoIP traffic. In, 11th International Telecommunications Network Strategy and Planning Symposium , pages 217-222, Vienna, Austria, 2004. [29] Daniel, C. and Wood, F., Fitting Equations to Data . Wiley, New York, 1971. · Zbl 0264.65011 [30] de Pereira, F. M., da Fonseca, N. L. S., and Arantes, D. S. On the performance of generalized processor sharing servers under long-range dependent traffic., Computer Networks , 40:413-431, 2002. [31] Erramilli, A., Narayan, O., and Willinger, W. Experimental queueing analysis with long-range dependent packet traffic., IEEE/ACM Transactions on Networking , 4:209-223, 1996. [32] Fraleigh, C., Moon, S., Lyles, B., Cotton, C., Khan, M., Moll, D., Rockell, R., Seely, T., and Diot, C. Packet-level traffic measurements from the Sprint IP backbone., IEEE Network , 17:6-16, 2003a. [33] Chuck Fraleigh, Fouad Tobagi, and Christophe Diot. Provisioning ip backbone networks to support latency sensitive traffic. In, Proceedings of the 22nd IEEE International Conference on Computer Communications (INFOCOM) , 2003b. [34] Gruber, J. A comparison of measured and calculated speech temporal parameters relevant to speech activity detection., IEEE Transactions on Communications , COM-30:728-738, 1982. [35] Guha, S., Hafen, R. P., Kidwell, P., and Cleveland, W. S. Visualization databases for the analysis of large complex datasets., Journal of Machine Learning Research , 5:193-200, 2009. [36] Guha, S. RHIPE: The R and Hadoop integrated programming environment., http://ml.stat.purdue.edu/rhipe. [37] Hafen, R. P. and Cleveland, W. S. The ed method for nonparametric density estimation and diagnostic checking. Technical report, Department of Statistics, Purdue University, 2009. [38] Hosking, J. R. M. Fractional differencing., Biometrika , 68:165-176, 1981. · Zbl 0464.62088 · doi:10.1093/biomet/68.1.165 [39] Jiang, W. and Schulzrinne, H. Analysis of on-off patterns in VoIP and their effect on voice traffic aggregation. In, Proceedings of the Ninth IEEE International Conference on Computer Communication and Network , pages 82-87, Las Vegas, Nevada, October 2000. [40] Karapantazis, S. and Pavlidou, F.-N. VoIP: A comprehensive survey on a promising technology., Computer Networks , 53 :2050-2090, 2009. [41] Kesidis, G., An Introduction to Communication Network Analysis . Wiley-IEEE Press, 2007. [42] Lee, H. H. and Un, C. K. A study of on-off characteristics of conversational speech., IEEE Transactions on Communications , COM-34:630-637, 1986. [43] Leland, W. E., Taqqu, M. S., Willinger, W., and Wilson, D. V. On the self-similar nature of Ethernet traffic (extended version)., IEEE/ACM Transactions on Networking , 2:1-15, 1994. [44] Markopoulou, A., Tobagi, F. A., and Karam, M. J. Assessing the quality of voice communications over Internet backbones., IEEE/ACM Transactions on Networking , 11:747-760, 2003. [45] Massoulie, L. and Simonian, A. Large buffer asymptotics for the queue with fbm input., Journal of Applied Probability , 36:894-906, 1999. · Zbl 0955.60096 · doi:10.1239/jap/1032374642 [46] Norwine, A. C. and Murphy, O. J. Characteristic time intervals in telephone conversation., Bell System Technical Journal , 17:281-291, 1938. [47] Paxson, V. and Floyd, S. Wide-area traffic: The failure of Poisson modeling., IEEE/ACM Transactions on Networking , 3:226-244, 1995. [48] Peterson, L. L. and Davie, B. S., Computer Networks: A Systems Approach . Morgan Kaufmann, 1999. · Zbl 1005.68004 [49] Riedi, R. H., Crouse, M. S., Ribeiro, V. J., and Baraniuk, R. G. A multifractal wavelet model with application to network traffic., IEEE Transactions on Information Theory , 45:992 -1018, 1999. · Zbl 0947.94003 · doi:10.1109/18.761337 [50] Rolls, D. A., Michailidis, G., and Hernández-Campos, F. Queueing analysis of network traffic: Methodology and visualization tools., Computer Networks , 48(3):447-473, 2005. [51] Rosenberg, J., Schulzrinne, H., Camarillo, G., Johnston, A., Peterson, J., Sparks, R., Handley, M., and Schooler, E. SIP: Session initiation protocol. IETF RFC 3261, June, 2002. [52] Schulzrinne, H., Casner, S., Frederick, R., and Jacobson, V. RTP: A transport protocol for real-time applications. IETF RFC 3550, July, 2003. [53] Shin, S. and Schulzrinne, H. Measurement and analysis of the voip capacity in ieee 802.11 wlan., IEEE Transactions on Mobile Computing , 8 :1265-1279, 2009. [54] Suh, K., Figueiredo, D. R., Kurose, J. F., and Towsley, D. F. Characterizing and detecting Skype-relayed traffic. In, Proceedings of the 25th IEEE International Conference on Computer Communications (INFOCOM 2006) , pages 1-12, 2006. [55] Tobagi, F. A., Markopoulou, A. P., and Karam, M. J. Is the internet ready for voip? In, Proceedings of Distributed Computing, Mobile and Wireless Computing 4th International Workshop (IWDC) , pages 49-57. Microsoft Press, 2002. [56] Toral-Cruz, H. and Torres-Roman, D. Traffic analysis for ip telephony. In, Proceedings of the 2nd International Conference on Electrical and Electronics Engineering , 2005. [57] Willinger, W., Taqqu, M. S., Sherman, R., and Wilson, D. V. Self-similarity through high-variability: Statistical analysis of Ethernet LAN traffic at the source level., IEEE/ACM Transactions on Networking , 5:71-86, 1997. [58] Zhang, G., Xie, G., Yang, J., Zhang, D., and Zhang, D. Self-similar characteristic of traffic in current metro area network. In, Proceedings of the 15th IEEE Workshop on Local and Metropolitan Area Networks , pages 176-181, Princeton, New Jersey, USA, 2007. This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.