×

A survey of commonsense knowledge acquisition. (English) Zbl 1296.68167

Summary: Collecting massive commonsense knowledge (CSK) for commonsense reasoning has been a long time standing challenge within artificial intelligence research. Numerous methods and systems for acquiring CSK have been developed to overcome the knowledge acquisition bottleneck. Although some specific commonsense reasoning tasks have been presented to allow researchers to measure and compare the performance of their CSK systems, we compare them at a higher level from the following aspects: CSK acquisition task (what CSK is acquired from where), technique used (how can CSK be acquired), and CSK evaluation methods (how to evaluate the acquired CSK). In this survey, we first present a categorization of CSK acquisition systems and the great challenges in the field. Then, we review and compare the CSK acquisition systems in detail. Finally, we conclude the current progress in this field and explore some promising future research issues.

MSC:

68T30 Knowledge representation
68-02 Research exposition (monographs, survey articles) pertaining to computer science
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Minsky M. The Emotion Machine. Simon & Schuster New York, 2006.
[2] McCarthy J (1984) Some expert systems need common sense. Annals of the New York Academy of Sciences 426:129–137 · doi:10.1111/j.1749-6632.1984.tb16516.x
[3] Lenat D, Prakash M, Shepherd M (1986) CYC: Using common sense knowledge to overcome brittleness and knowledge acquisition bottlenecks. AI Magazine 6(4):65–85
[4] Lieberman H, Liu H, Singh P, Barry B (2004) Beating common sense into interactive applications. AI Magazine 25(4):63–76
[5] Curtis J, Cabral J, Baxter D. On the application of the Cyc ontology to word sense disambiguation. In Proc. the 19th Int. Florida AI Research Society Conf., May 2006, pp.652–657.
[6] Dahlgren K, McDowell J. Using commonsense knowledge to disambiguate prepositional phrase modifiers. In Proc. the 5th National Conf. Arti\={}cial Intelligence, Aug. 1986, pp.589–593.
[7] Havasi C, Speer R, Pustejovsky J. Coarse word-sense disambiguation using common sense. In Proc. the AAAI Fall Symposium Series, Nov. 2010.
[8] Cambria E, Hussain A, Havasi C, Eckl C. Affectivespace: Blending common sense and affective knowledge to perform emotive reasoning. In Proc. the 1st WOMSA at CAEPIA 2009, Nov. 2009, pp.32–41.
[9] Cambria E, Hussain A, Havasi C, Eckl C. Sentic computing: Exploitation of common sense for the development of emotion-sensitive systems. In Lecture Notes in Computer Science 5967, Hutchison D, Kanade T, Kittler J et al. (eds.), 2010, pp.148–156.
[10] Curtis J, Matthews G, Baxter D. On the effective use of Cyc in a question answering system. In Proc. IJCAI Workshop. Knowledge and Reasoning for Answering Questions, Aug. 2005.
[11] Liu H, Singh P. Makebelieve: Using commonsense knowledge to generate stories. In Proc. the 18th National Conf. Artificial Intelligence, Jul.28-Aug.1, 2002, pp.957–958.
[12] Ong E. A commonsense knowledge base for generating children’s stories. In Proc. the AAAI Fall Symposium Series on Common Sense Knowledge, Nov. 2010, pp.82–87.
[13] Liu H, Lieberman H, Selker T. Goose: A goal-oriented search engine with commonsense. In Proc. the 2nd Int. Conf. Adaptive Hypermedia and Adaptive Web-Based Systems, Jun. 2006, pp.253–263. · Zbl 1047.68858
[14] Hsu M, Chen H. Information retrieval with commonsense knowledge. In Proc. the 29th SIGIR 2006, Aug. 2006, pp.651–652.
[15] Nilsson N. Artificial Intelligence: A New Synthesis. Morgan Kaufmann, 1998. · Zbl 1012.68605
[16] Gupta R, Kochenderfer M. Common sense data acquisition for indoor mobile robots. In Proc. the 19th National Conf. Artificial Intelligence, Jul. 2004, pp.605–610.
[17] McCarthy J. Programs with common sense. In Proc. the Teddington Conf. the Mechanization of Thought Processes, Dec. 1958.
[18] Lenat DB (1995) CYC: A large-scale investment in knowledge infrastructure. Communications of the ACM 38(11):33–38 · Zbl 01936346 · doi:10.1145/219717.219745
[19] Singh P, Lin T, Mueller E, Lim G, Perkins T, Zhu W. Open mind common sense: Knowledge acquisition from the general public. In Proc. Conf. Cooperative Information Systems, Oct.30-Nov.1 2002, pp.1223–1237. · Zbl 1018.68815
[20] Dong Z, Dong Q. HowNet and the Computation of Meaning. Singapore: World Scientific Publishing Company, 2006.
[21] Liu H, Singh P (2004) Conceptnet–A practical commonsense reasoning tool-kit. BT Technology Journal 22(4):211–226 · doi:10.1023/B:BTTJ.0000047600.45421.6d
[22] Speer R, Havasi C, Lieberman H. AnalogySpace: Reducing the dimensionality of common sense knowledge. In Proc. the 23rd AAAI, Jul. 2008, pp.548–553.
[23] Schubert L. Can we derive general world knowledge from texts? In Proc. the 2nd Int. Conf. Human Language Technology Research, Mar. 2002, pp.94–97.
[24] Torisawa K. An unsupervised learning method for common-sensical inference rules on events. In Proc. the 2nd CoLogNet-EIsNET Symposium, Dec. 2003.
[25] Torisawa K. Acquiring inference rules with temporal constraints by using Japanese coordinated sentences and nounverb co-occurrences. In Proc. the Human Language Technology Conf. the North American Chapter of the Association of Computational Linguistics (HLT/NAACL), Jun. 2006, pp.57–64.
[26] Chklovski T. Learner: A system for acquiring commonsense knowledge by analogy. In Proc. the 2nd Int. Conf. Knowledge Capture, Oct. 2003, pp.4–12.
[27] Witbrock M, Matuszek C, Brusseau A, Kahlert R, Fraser C, Lenat D. Knowledge begets knowledge: Steps towards assisted knowledge acquisition in Cyc. In Proc. the AAAI Spring Symposium on Knowledge Collection from Volunteer Contributors, Mar. 2005, pp.99–105.
[28] von Ahn L, Kedia M, Blum M. Verbosity: A game for collecting common-sense facts. In Proc. the ACM SIGCHI Conf. Human Factors in Computing Systems, Apr. 2006, pp.75–78.
[29] Lieberman H, Smith D, Teeters A. Common consensus: A web-based game for collecting commonsense goals. In Proc. Int. Conf. Intelligent User Interfaces, Jan. 2007.
[30] Speer R, Krishnamurthy J, Havasi C, Smith D, Lieberman H, Arnold K. An interface for targeted collection of common sense knowledge using a mixture model. In Proc. the 14th Int. Conf. Intelligent User Interfaces, Feb. 2009, pp.137–146.
[31] Kuo Y, Lee J, Chiang K, Wang R, Shen E, Chan C, Hsu J. Community-based game design: Experiments on social games for commonsense data collection. In Proc. the ACM SIGKDD Workshop on Human Computation, Jun. 2009, pp.15–22.
[32] Kuo Y, Hsu J. Goal-oriented knowledge collection. In Proc. the AAAI Fall Symposium Series, Nov. 2010.
[33] Banko M, Cafarella M, Soderland S, Broadhead M, Etzioni O. Open information extraction from the web. In Proc. the 20th IJCAI, Jan. 2007, pp. 2670–2676.
[34] Etzioni O, Fader A, Christensen J, Soderland S, Center M. Open information extraction: The second generation. In Proc. the 22th Int. Joint Conf. Artificial Intelligence, Jul. 2011, pp.3–10.
[35] Fader A, Soderland S, Etzioni O. Identifying relations for open information extraction. In Proc. the Conf. Empirical Methods in Natural Language Processing, Jul. 2011, pp.1535–1545.
[36] Mausam J C, Soderland S, Etzioni O. Learning arguments for open information extraction. Technical Report, University of Washington, 2011.
[37] Matuszek C, Witbrock M, Kahlert R, Cabral J, Schneider D, Shah P, Lenat D. Searching for common sense: Populating CycTM from the Web. In Proc. the National Conf. Artificial Intelligence, Jul. 2005, pp.1430–1435.
[38] Forbus K, Riesbeck C, Birnbaum L, Livingston K, Sharma A, Ureel L. Integrating natural language, knowledge representation and reasoning, and analogical processing to learn by reading. In Proc. the 22nd National Conf. Artificial Intelligence, Jul. 2007, pp.1542–1547.
[39] Auer S, Bizer C, Kobilarov G, Lehmann J, Cyganiak R, Ives Z. DBpedia: A nucleus for a web of open data. In Proc. the ISWC 2007/ASWC2007, Nov. 2007, pp.722–735.
[40] Bizer C, Lehmann J, Kobilarov G, Auer S, Becker C, Cyganiak R, Hellmann S (2009) DBpedia: A crystallization point for the web of data. Web Semantics: Science, Services and Agents on the World Wide Web 7(3):154–165 · Zbl 05738204 · doi:10.1016/j.websem.2009.07.002
[41] Suchanek F M, Kasneci G, Weikum G. YAGO: A core of semantic knowledge. In Proc. the 16th Int. Conf. World Wide Web, May 2007, pp.697–706.
[42] Hoffart J, Suchanek FM, Berberich K, Weikum G (2013) YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia. Artificial Intelligence 194:28–61 · Zbl 1270.68303 · doi:10.1016/j.artint.2012.06.001
[43] Niu X, Sun X, Wang H, Rong S, Qi G, Yu Y. Zhishi.me: Weaving Chinese linking open data. In Proc. the 10th Int. Conf. the Semantic Web, Oct. 2011, pp.205–220.
[44] Wang Z C, Wang Z G, Li J Z, Pan J Z. Knowledge extraction from Chinese wiki encyclopedias. Journal of Zhejiang University|Science C, 2012, 13(4): 268–280.
[45] Zeng Y. Extracting, linking and analyzing the Web of structured Chinese data. Technical Report, Institute of Automation, Chinese Academy of Sciences, 2012.
[46] Singh P. The public acquisition of commonsense knowledge. In Proc. the AAAI Spring Symposium on Acquiring (and Using) Linguistic (and World) Knowledge for Information Access, Mar. 2002.
[47] Adomavicius G, Tuzhilin A (2005) Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering 17(6):734–749 · Zbl 05109798 · doi:10.1109/TKDE.2005.99
[48] Schubert L, Tong M. Extracting and evaluating general world knowledge from the Brown corpus. In Proc. the HLT-NAACL 2003 Workshop on Text Meaning, Vol.9, May 2003, pp.7–13.
[49] Zhu Y, Zang L, Wang D, Cao C. A manual experiment on commonsense knowledge acquisition from web corpora. In Proc. the Int. Conf. Machine Learning and Cybernetics, Vol.3, Jul. 2008, pp.1564–1569.
[50] Vanderwende L. Volunteers created the web. In Proc. AAAI Spring Symposium on Knowledge Collection from Volunteer Cotnributors, Mar. 2005.
[51] Eslick I. Searching for commonsense [Master Thesis]. Massachusetts Institute of Technology, 2006.
[52] Etzioni O, Cafarella M, Downey D, Kok S, Popescu A, Shaked T, Soderland S, Weld D, Yates A. Web-scale information extraction in KnowItAll: (Preliminary results). In Proc. the 13th Int. Conf. World Wide Web, May 2004, pp.100–110.
[53] Etzioni O, Cafarella M, Downey D, Popescu A, Shaked T, Soderland S, Weld D, Yates A (2005) Unsupervised named-entity extraction from the Web: An experimental study. Artificial Intelligence 165(1):91–134 · doi:10.1016/j.artint.2005.03.001
[54] Sharma A, Forbus K. Graph-based reasoning and reinforcement learning for improving Q/A performance in large knowledge-based systems. In Proc. the AAAI Fall Symposium Series, Nov. 2010.
[55] Liu H, Lieberman H, Selker T. A model of textual affect sensing using real-world knowledge. In Proc. the 8th Int. Conf. Intelligent User Interfaces, Jan. 2003, pp.125–132.
[56] Gordon A S, Kozareva Z, Roemmele M. SemEval-2012 Task7:Choice of Plausible Alternatives : An Evaluation of Commonsense Causal Reasoning. In Proc. the 1st Joint Conference on Lexical and Computational Semantics, Jun. 2012.
[57] Gennari JH, Musen MA, Fergerson RW, Grosso WE, Crubézy M, Eriksson H, Noy NF, Tu SW (2003) The evolution of Protégé: An environment for knowledge-based systems development. Int Journal of Human-Computer Studies 58(1):89–123 · Zbl 05696308 · doi:10.1016/S1071-5819(02)00127-1
[58] Panton K, Miraglia P, Salay N, Kahlert R C, Baxter D, Reagan R. Knowledge formation and dialogue using the KRAKEN toolset. In Proc. AAAI, Jul. 2002, pp.900–905.
[59] Witbrock M, Baxter D, Curtis J, Schneider D, Kahlert R, Miraglia P, Wagner P, Panton K, Matthews G. An interactive dialogue system for knowledge acquisition in Cyc. In Proc. Int. Joint Conf. Artificial Intelligence, Aug. 2003.
[60] Quinlan J, Cameron-Jones R (1995) Induction of logic programs: FOIL and related systems. New Generation Computing 13(3):287–312 · Zbl 05479870 · doi:10.1007/BF03037228
[61] Masters J, Matuszek C, Witbrock M. Ontology-based integration of knowledge from semi-structured web pages. Technical Report, Cycorp, 2006.
[62] Medelyan O, Legg C. Integrating Cyc and Wikipedia: Folksonomy meets rigorously defined common-sense. In Proc. AAAI Workshop on Wikipedia and AI, Jul. 2008.
[63] Pierce C, Booth D, Ogbuji C, Deaton C, Blackstone E, Lenat D (2012) SemanticDB: A semanticWeb infrastructure for clinical research and quality reporting. Current Bioinformatics 7(3):267–277 · doi:10.2174/157489312802460730
[64] Witbrock M, Panton K, Reed S, Schneider D, Aldag B, Reimers M, Bertolo S. Automated OWL annotation assisted by a large knowledge base. In Proc. the ISWC Workshop on Knowledge Markup and Semantic Annotation, Nov. 2004, pp.71–80.
[65] Mueller E. Natural language processing with ThoughtTreasure. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.35.9720 , 1998.
[66] Mueller E. A calendar with common sense. In Proc. the 5th Int. Conf. Intelligent User Interfaces, Jan. 2000, pp.198–201.
[67] Speer R, Havasi C, Surana H. Using verbosity: Common sense data from games with a purpose. In Proc. the 23nd Int. Florida AI Research Society Conf., May 2010.
[68] Anacleto J, Lieberman H, Tsutsumi M, Neris V, Carvalho A, Espinosa J, Godoi M, Zem-Mascarenhas S. Can common sense uncover cultural differences in computer applications? In Proc. the 19th World Computer Congress, August 2006, pp.1–10.
[69] Eckhardt N. A kid’s open mind common sense [Ph.D. Thesis]. Tilburg University, 2008.
[70] Chung H. GlobalMind – Bridging the gap between different cultures and languages with commonsense computing [Master Thesis]. Massachusetts Institute of Technology, 2006.
[71] Cambria E, Xia Y, Hussain A. Affective common sense knowledge acquisition for sentiment analysis. In Proc. the 8th Int. Conf. Language Resources and Evaluation, May 2012, pp.3580–3595.
[72] von Ahn L (2006) Games with a purpose. IEEE Computer Magazine 39(6):92–94 · Zbl 05089121 · doi:10.1109/MC.2006.196
[73] Speer R. Open mind commons: An inquisitive approach to learning common sense. In Proc. the Workshop on Common Sense and Intelligent User Interfaces, Jan. 2007.
[74] Minsky M. The Society Of Mind. Simon and Schuster, 1988.
[75] Fellbaum C. WordNet: An Electronic Lexical Database. The MIT press, 1998. · Zbl 0913.68054
[76] Havasi C, Speer R, Alonso J. ConceptNet 3: A flexible, multilingual semantic network for common sense knowledge. In Proc. the 22nd AAAI, Sept. 2007.
[77] Alonso J. CSAMOA: A common sense application model of architecture. In Proc. the Workshop on Common Sense and Intelligent User Interfaces, Jan. 2007.
[78] Pustejovsky J, Havasi C, Sauri R et al. Towards a generative lexical resource: The brandeis semantic ontology. In Proc. the Language Resources and Evaluation Conf., May 2006.
[79] Speer R, Havasi C. Representing general relational knowledge in ConceptNet 5. In Proc. the 8th Int. Conf. Language Resources and Evaluation, May 2012, pp.3679–3686.
[80] Havasi C, Speer R, Pustejovsky J, Lieberman H (2009) Digital intuition: Applying common sense using dimensionality reduction. IEEE Intelligent Systems 24(4):24–35 · doi:10.1109/MIS.2009.72
[81] von Assem M, von Ossenbruggen J. Wordnet 3.0 in RDF. http://semanticweb.cs.vu.nl/lod/wn30/ , Sep. 2011, pp.10–24.
[82] von Ahn L, Dabbish L (2008) Designing games with a purpose. Communications of the ACM 51(8):58–67
[83] Androutsopoulos I, Malakasiotis P (2010) A survey of paraphrasing and textual entailment methods. Journal of Artificial Intelligence Research 38(1):135–187 · Zbl 1210.68122
[84] Hashimoto C, Torisawa K, De Saeger S, Oh J, Kazama J. Excitatory or inhibitory: A new semantic orientation extracts contradiction and causality from the Web. In Proc. EMNLP-CoNLL 2012, Jul. 2012.
[85] Tian W, Cao C, H W. Representation, acquisition and analysis of psychological commonsense concepts. Computer Science, 2004, 31(6): 5–12. (in Chinese)
[86] Peng H, Cao C. Research on mining of associated events and discovering roles relaionship. Computer Science, 2010, 12. (in Chinese)
[87] Cao Y, Cao C, Zang L, Zhu Y, Wang S, Wang D. Acquiring commonsense knowledge about properties of concepts from text. In Proc. the 5th Int. Conf. Fuzzy Systems and Knowledge Discovery, Aug. 2008, Vol.4, pp.155–159.
[88] Cao Y, Cao C, Zang L, Wang S, Wang D. Extracting comparative commonsense from the Web. In Proc. the IFIP International Conference on Intelligent Information Processing, Oct. 2010, pp.154–162.
[89] Cao Y, Cao C, Zang L, Wang S. Web mining for causal relations between events. Information: An International Interdisci Journal, 2011, 15(1).
[90] Bollacker K, Evans C, Paritosh P, Sturge T, Taylor J. Free-base: A collaboratively created graph database for structuring human knowledge. In Proc. SIGMOD2008, 2008, pp.1247–1250.
[91] Lehmann J, Isele R, Jakob M, Jentzsch A, Kontokostas D, Mendes P N, Hellmann S, Morsey M, Kleef v P, Auer S, Bizer C. Dbpedia – A large-scale, multilingual knowledge base extracted from Wikipedia. Semantic Web Journal, 2013, Under review.
[92] Ritter A, Mausam, Etzioni O. A latent dirichlet allocation method for selectional preferences. In Proc. the 48th Annual Meeting of the Association for Computational Linguistics, Jul. 2010, pp.424–434.
[93] Lin T, Mausam, Etzioni O. Identifying functional relations in web text. In Proc. the Conf. Empirical Methods in Natural Language Processing, Oct. 2010, pp.1266–1276.
[94] Schoenmackers S, Etzioni O, Weld D S, Davis J. Learning first-order Horn clauses from Web text. In Proc. the Conf. on Empirical Methods in Natural Language Processing, Oct. 2010, pp.1088–1098.
[95] Berant J, Dagan I, Goldberger J. Global learning of typed entailment rules. In Proc. the 49th HLT-ACL, Jun. 2011, vol.1, pp. 610–619.
[96] Soderland S, Roof B, Qin B, Xu S (2010) Mausam, Etzioni O. Adapting open information extraction to domain-speci\={}c relations. AI Magazine 31(3):93–102
[97] Klein D, Manning C. Accurate unlexicalized parsing. In Proc. the 41st Annual Meeting on Association for Computational Linguistics, Jul. 2003, Vol.1, pp.423–430.
[98] Downey D, Etzioni O, Soderland S. A probabilistic model of redundancy in information extraction. In Proc. Int. Joint Conf. Artificial Intelligence, Jul.30-Aug.5, 2005, Vol.19. · Zbl 1205.68447
[99] Banko M, Etzioni O. The tradeoffs between open and traditional relation extraction. In Proc. the 46th ACL, Jun. 2008, pp.28–36.
[100] Wu F, Weld D S. Open Information Extraction Using Wikipedia. In Proc. the 48th ACL, Jul. 2010, pp. 118–127.
[101] Wu W, Li H, Wang H, Zhu K Q. Probase: A probabilistic taxonomy for text understanding. In Proc. the ACM Int. Conf. Management of Data, Nov. 2012, pp. 481–492.
[102] Ponzetto S P, Strube M. Deriving a large scale taxonomy from Wikipedia. In Proc. AAAI2007, Jul. 2007, Volume 22. · Zbl 1182.68291
[103] Wang Y, Li H, Wang H, Zhu K Q. Towards topic search on the Web. Technical Report, Microsoft Research, 2010.
[104] Song Y, Wang H, Wang Z, Li H, Chen W. Short text conceptualization using a probabilistic knowledge base. In Proc. IJCAI2011, Jul. 2011, pp. 2330–2336.
[105] Carlson A, Betteridge J, Kisiel B, Settles B, Hruschka Jr E, Mitchell T. Toward an architecture for never-ending language learning. In Proc. the 24th AAAI, Jul. 2010, Vol.2.
[106] Carlson A, Betteridge J, Wang R et al. Coupled semi-supervised learning for information extraction. In Proc. the 3rd ACM Int. Conf. Web Search and Data Mining, Feb. 2010, pp.101–110.
[107] Fensel D, von Harmelen F, Andersson B et al. Towards LarKC: A platform for Web scale reasoning. In Proc. Int. Conf. Semantic Computing, Aug. 2008, pp.524–529.
[108] Assel M, Cheptsov A, Gallizo G et al. Large knowledge collider: A service-oriented platform for large-scale semantic reasoning. In Proc. the International Conference on Web Intelligence, Mining and Semantics, May 2011.
[109] Pasca M, Van Durme B. Weakly-supervised acquisition of open-domain classes and class attributes from web documents and query logs. In Proc. the 46th Annual Meeting of the Association for Computational Linguistics, Jun. 2008, pp.19–27.
[110] Turney P. Mining the Web for synonyms: PMI-IR versus LSA on TOEFL. In Proc. the 12th European Conf. Machine Learning, Sept. 2001, pp. 491–502. · Zbl 1007.68551
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.