GENIA corpus swMATH ID: 35527 Software Authors: Kim, J. D.; Ohta, T.; Tateisi, Y.; Tsujii, J. Description: GENIA corpus - a semantically annotated corpus for bio-textmining. Motivation: Natural language processing (NLP) methods are regarded as being useful to raise the potential of text mining from biological literature. The lack of an extensively annotated corpus of this literature, however, causes a major bottleneck for applying NLP techniques. GENIA corpus is being developed to provide reference materials to let NLP techniques work for bio-textmining. Results: GENIA corpus version 3.0 consisting of 2000 MEDLINE abstracts has been released with more than 400 000 words and almost 100 000 annotations for biological terms. Availability: GENIA corpus is freely available at http://www-tsujii.is.s.u-tokyo.ac.jp/GENIA Homepage: https://academic.oup.com/bioinformatics/article/19/suppl_1/i180/227927 Related Software: GENETAG; OpenDMAP; ABNER; Wikipedia Miner; GATE; MALLET; DrugBank; BioGRID; IntAct; Penn Treebank; bootstrap; RelEx; iHOP; MedScan; ProMiner; LinkGrammar-WN Cited in: 2 Documents all top 5 Cited by 6 Authors 1 Li, Yanpeng 1 Liao, Stephen Shaoyi 1 Lin, Hongfei 1 Xu, Kaiquan 1 Yang, Zhihao 1 Zhao, J. Leon Cited in 2 Serials 1 INFORMS Journal on Computing 1 Computational Biology and Chemistry Cited in 3 Fields 2 Computer science (68-XX) 1 Statistics (62-XX) 1 Biology and other natural sciences (92-XX) Citations by Year