da Silva, Natalia; Alvarez-Castro, Ignacio Clicks and cliques: exploring the soul of the community. (English) Zbl 1505.62116 Comput. Stat. 34, No. 4, 1537-1563 (2019). Summary: In this paper we analyze 26 communities across the United States with the objective to understand what attaches people to their community and how this attachment differs among communities. How different are attached people from unattached? What attaches people to their community? How different are the communities? What are key drivers behind emotional attachment? To address these questions, graphical, supervised and unsupervised learning tools were used and information from the Census Bureau and the Knight Foundation were combined. Using the same pre-processed variables as F. Knight [Soul of the community. Techn. Rep. (2010)] most likely will drive the results towards the same conclusions than the Knight foundation, so this paper does not use those variables. Cited in 1 Document MSC: 62-08 Computational methods for problems pertaining to statistics Keywords:community attachment; exploratory data analysis; random forests; statistical visualization; mosaic plot; product plot Software:knitr; tidyverse; randomForest; ggdendro; carData; GGally; productplots; xtable; reldist; ggplot2; R; car; Survey PDFBibTeX XMLCite \textit{N. da Silva} and \textit{I. Alvarez-Castro}, Comput. Stat. 34, No. 4, 1537--1563 (2019; Zbl 1505.62116) Full Text: DOI arXiv References: [1] Atkinson AB, Bourguignon F (2000) Handbook of income distribution, vol 1. Elsevier, Amsterdam [2] Breiman L (2001) Random forests. Mach Learn 45(1):5-32 · Zbl 1007.68152 [3] Dahl DB (2016) xtable: export tables to LaTeX or HTML. R package version 1.8-2 [4] de Vries A, Ripley BD (2016) ggdendro: create dendrograms and tree diagrams using ‘ggplot2’. R package version 0.1-20 [5] Fox J, Weisberg S (2011) An R companion to applied regression, 2nd edn. Sage, Thousand Oaks [6] Friendly M (1994) Mosaic displays for multi-way contingency tables. J Am Stat Assoc 89(425):190-200 [7] Handcock MS (2016) Relative distribution methods. Los Angeles, CA. Version 1.6-6. Project home page at http://www.stat.ucla.edu/ handcock/RelDist [8] Hartigan JA, Kleiner B (1981) Mosaics for contingency tables. In: Computer science and statistics: proceedings of the 13th symposium on the interface. Springer, pp 268-273 [9] Hofmann H, Wickham H, Cook D (2019) The 2013 data expo of the American Statistical Association. Comput Stat 25:551-554 [10] Hummel J (1996) Linked bar charts: analysing categorical data graphically. Comput Stat 11(1):23-33 · Zbl 0938.62002 [11] Inselberg A (2009) Parallel coordinates: visual multidimensional geometry and its applications. Springer, Berlin, Heidelberg · Zbl 1183.68662 [12] Izenman A (2008) Modern multivariate statistical techniques: regression, classification, and manifold learning, 1st edn. Springer. ISBN:0387781889 · Zbl 1155.62040 [13] Knight F (2010) Soul of the community. Technical report. https://knightfoundation.org/sotc/overall-findings/ [14] Liaw A, Wiener M (2002) Classification and regression by randomforest. R News 2(3):18-22 [15] Lumley T (2018) Survey: analysis of complex survey samples. R package version 3.34 [16] MacQueen J (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, vol 1. Oakland, CA, USA, pp 281-297 · Zbl 0214.46201 [17] R Core Team (2018) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna [18] Rushton M (2008) A note on the use and misuse of the racial diversity index. Policy Stud J 36(3):445-459 [19] Schloerke B, Crowley J, Cook D, Briatte F, Marbach M, Thoen E, Elberg A, Larmarange J (2018) GGally: extension to ’ggplot2’. R package version 1.4.0 [20] U.S. Census Bureau (2011) 2011 American community survey 5-year estimates. Tables B01001, B01003, B02001, B05005, B08202, B19001, B19301, B23006, B25026. http://factfinder2.census.gov [21] Wickham H (2017) tidyverse: easily install and load the ‘Tidyverse’. R package version 1.2.1 [22] Wickham H, Hofmann H (2011) Product plots. IEEE Trans Vis Comput Graph 17(12):2223-2230 [23] Wickham H, Hofmann H (2016) productplots: product plots for R. R package version 0.1.1 [24] Xie Y (2018) knitr: a general-purpose package for dynamic report generation in R. R package version 1.20 This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.