×

MyPYTHIA: a recommendation portal for scientific software and services. (English) Zbl 1007.68617

Summary: We outline the design of a recommendation system (MyPYTHIA) implemented as a Web portal. MyPYTHIA’s design objectives include evaluating the quality and performance of scientific software on Grid platforms, creating knowledge about which software and computational services should be selected for solving particular problems, selecting parameters of software (or of computational services) based on user-specified computational objectives, providing access to performance data and knowledge bases over the Web and enabling recommendations for targeted application domains. MyPYTHIA uses a combination of statistical analysis, pattern extraction techniques and a database of software performance to map feature-based representations of problem instances to appropriate software. MyPYTHIA’s open architecture allows the user to customize it for conducting individual case studies. We describe the architecture as well as several scientific domains of knowledge enabled by such case studies.

MSC:

68U99 Computing methodologies and applications
68N15 Theory of programming languages
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Casanova, International Journal of Supercomputer Applications and High Performance Computing 11 pp 212– (1997)
[2] Chandy, IEEE Computational Science and Engineering 3 pp 43– (1996)
[3] Foster, International Journal of Supercomputer Applications and High Performance Computing 11 pp 115– (1997)
[4] The Grid: Blueprint for a new computing infrastructure. Morgan Kaufmann: San Mateo, CA, 1998.
[5] Grimshaw, Communications of the ACM 41 pp 46– (1998)
[6] Grimshaw, Journal of Parallel and Distributed Computing 21 pp 257– (1994)
[7] Houstis, ACM Transactions on Mathematical Software 26 pp 227– (2000)
[8] Recommender systems for problem solving environments. PhD Thesis, Department of Computer Sciences, Purdue University, 1997.
[9] Ramakrishnan, ACM Transactions on Mathematical Software 26 pp 254– (2000)
[10] Knowledge discovery in scientific databases. PhD Thesis, Department of Computer Sciences, Purdue University, 1999.
[11] Mining the performance of complex systems. Proceedings IEEE International Conference on Information Intelligence and Systems. IEEE Computer Society Press, 1999; 606-612. · Zbl 0973.46017
[12] Verykios, Journal of Neural, Parallel and Scientific Computations 8 pp 115– (2000)
[13] Verykios, IEEE Transactions on Systems, Man and Cybernetics: Part A 31 pp 573– (2001)
[14] Weerawarana, ACM Transactions on Mathematical Software 22 pp 447– (1996)
[15] The linear system analyzer. Enabling Technologies for Computational Science, (eds.). Kluwer: Dordrecht, 2000; 123-134.
[16] Markus, IEEE Computational Science and Engineering 4 pp 43– (1997)
[17] Ferris, ACM Transactions on Mathematical Software 26 pp 1– (2000)
[18] Rice, Journal of the ACM 22 pp 61– (1973)
[19] Rice, Advances in Computers 15 pp 65– (1976) · Zbl 0357.68002
[20] Addison, ACM Transactions on Mathematical Software 17 pp 1– (1991)
[21] The development and implementation of a performance database server. Technical Report CS-93-195, University of Tennessee, August 1993.
[22] A data mining environment for modeling the performance of scientific software. Enabling Technologies for Computational Science: Frameworks, Middleware and Environments, ch. 21. Kluwer: Dordrecht, 2000; 261-271.
[23] Apache Webserver. http://www.apache.org/ [2001].
[24] PHP. http://www.php.net/ [2001].
[25] MySQL. http://www.mysql.com/ [2001].
[26] PhpMyAdmin. http://www.phpwizard.net/projects/phpMyAdmin/ [2001].
[27] Nonparametric Statistical Methods. John Wiley & Sons, 1973. · Zbl 0277.62030
[28] Wrappers for performance enhancement and oblivious decision graphs. PhD Thesis, Department of Computer Science, Stanford University, 1995.
[29] Bratko, Communications of the ACM 38 pp 65– (1995)
[30] Neural networks. The Computer Science and Engineering Handbook, (ed.). CRC Press: Boca Raton, FL, 1997; 536-556.
[31] Joshi, IEEE Computational Science and Engineering 3 pp 44– (1996)
[32] Quinlan, Machine Learning 1 pp 81– (1986)
[33] C4.5: Programs for Machine Learning. Morgan Kaufmann: San Mateo, CA, 1993.
[34] Aha, Machine Learning 6 pp 37– (1991)
[35] The NIST guide to available mathematical software. http://gams.nist.gov [1996].
[36] Ramakrishnan, Advances in Engineering Software 33 pp 27– (2001)
[37] Rice, Mathematics of Computation 36 pp 475– (1981)
[38] Houstis, International Journal for Numerical Methods in Engineering 18 pp 737– (1982)
[39] Adve, IEEE Transactions on Software Engineering 26 pp 1027– (2000)
[40] Koch, Transactions of the American Nuclear Society 65 pp 198– (1992)
[41] Foster, IEEE Computer 32 pp 36– (1999) · Zbl 05091577
[42] Karin, Communications of the ACM 41 pp 32– (1998)
[43] Data-intensive computing. The Grid: Blueprint for a New Computing Infrastructure, (eds.). Morgan Kaufmann: San Mateo, CA, 1998; 105-129.
[44] Moore, Communications of the ACM 41 pp 56– (1998)
[45] Ramakrishnan, Advances in Computers 55 pp 119– (2001)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.