×

BERT

swMATH ID: 30756
Software Authors: Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova
Description: BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. We introduce a new language representation model called BERT, which stands for Bidirectional Encoder Representations from Transformers. Unlike recent language representation models, BERT is designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers. As a result, the pre-trained BERT model can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks, such as question answering and language inference, without substantial task-specific architecture modifications. BERT is conceptually simple and empirically powerful. It obtains new state-of-the-art results on eleven natural language processing tasks, including pushing the GLUE score to 80.5
Homepage: https://arxiv.org/abs/1810.04805
Source Code:  https://github.com/google-research/bert
Related Software: Tensor2Tensor; Adam; word2vec; RoBERTa; PyTorch; GloVe; ImageNet; Python; GitHub; Transformers; AlexNet; XLNet; GPT-3; TensorFlow; SQuAD; ALBERT; GLUE; BLEU; Scikit; MNIST
Cited in: 183 Documents
all top 5

Cited by 651 Authors

3 Mues, Christophe
2 Baskerville, Nicholas P.
2 Bravo, Cristián
2 d’Ascoli, Stéphane
2 Grabovoy, Andrey Valerievich
2 Kadioglu, Serdar
2 Keating, Jonathan Peter
2 Li, Jianxin
2 Lim, Suryani
2 Mezzadri, Francesco
2 Najnudel, Joseph
2 Prade, Henri M.
2 Richard, Gilles
2 Song, Pengfei
2 Strijov, Vadim V.
2 Volpi, Riccardo
2 Wu, Jianhong
2 Xiong, Hui
2 Yang, Zijian Győző
2 Ying, Lexing
2 Zabaras, Nicholas J.
2 Zhang, Kunli
1 Abreu, Salvador
1 Adedoyin-Olowe, Mariam
1 Adouane, Lounis
1 Afantenos, Stergos
1 Ahmetoglu, Alper
1 Aickelin, Uwe
1 Alistarh, Dan
1 Allouah, Youssef
1 Androutsopoulos, Ion
1 Angelopoulos, Anastasios N.
1 Antonyrajah, Denvar
1 Aránguiz, Ignacio
1 Ashfaq, Awais
1 Atarashi, Kyohei
1 Avetisyan, Karen L.
1 Babbar, Rohit
1 Bahri, Yasaman
1 Bakhtin, Anton
1 Baldi, Pierre
1 Baldwin, Timothy
1 Banner, Ron
1 Barbiero, Pietro
1 Barceló, Pablo
1 Baskin, Chaim
1 Bates, Stephen
1 Beise, Hans-Peter
1 Ben-Nun, Tal
1 Bernardy, Jean-Philippe
1 Bertogna, Marko
1 Beygelzimer, Alina
1 Bhogal, Jagdev
1 Biroli, Giulio
1 Blumberg, Stefano B.
1 Borchert, Philipp
1 Bošnjak, Matko
1 Bronstein, Alex M.
1 Budak, Ceren
1 Büsing, Lars
1 Bustince Sola, Humberto
1 Camacho-Collados, José
1 Cambria, Erik
1 Camburu, Oana-Maria
1 Carnevali, Julio César
1 Cassé, Alhassan
1 Chan, Raymond Hon-Fu
1 Chang, Hongyang
1 Chen, Fukai
1 Chen, Jiaheng
1 Chen, Jiaoyan
1 Chen, Jingzhou
1 Chen, Junqing
1 Chen, Kairui
1 Chen, Lihan
1 Chen, Ruiqi
1 Chen, SiZhe
1 Chen, Tao
1 Chen, Wei
1 Chen, Xinlong
1 Chen, Yunwen
1 Chen, Zitao
1 Cheng, Lei
1 Chidlovskii, Boris
1 Chmiel, Brian
1 Choksi, Bhavin
1 Chow, Joseph Y. J.
1 Christophides, Vassilis
1 Ciatto, Giovanni
1 Ciravegna, Gabriele
1 Cohen, Elazar
1 Cohen, Trevor
1 Collins, Tom
1 Correia, Adriana D.
1 Costa, Gianni
1 Santos Costa, Vítor
1 Coussement, Kristof
1 Cruz, Meenalosini Vimal
1 Csurka, Gabriela
1 Cui, Tianyu
...and 551 more Authors
all top 5

Cited in 67 Serials

17 Artificial Intelligence
13 Machine Learning
9 The Journal of Artificial Intelligence Research (JAIR)
8 Information Sciences
8 Neural Networks
7 European Journal of Operational Research
7 Journal of Machine Learning Research (JMLR)
7 Mathematical Biosciences and Engineering
5 Computer Science Review
4 Journal of Computational Physics
4 Journal of Statistical Mechanics: Theory and Experiment
3 Computer Methods in Applied Mechanics and Engineering
3 Journal of Scientific Computing
3 Automation and Remote Control
3 Data Mining and Knowledge Discovery
3 Quantum Information Processing
2 Chaos, Solitons and Fractals
2 Journal of Econometrics
2 International Journal of Approximate Reasoning
2 Neural Computation
2 INFORMS Journal on Computing
2 Lobachevskii Journal of Mathematics
2 Journal of Physics A: Mathematical and Theoretical
2 Annales Mathematicae et Informaticae
2 Foundations and Trends in Machine Learning
2 SIAM Journal on Mathematics of Data Science
1 Journal of Mathematical Biology
1 Journal of Statistical Physics
1 Psychometrika
1 Applied Mathematics and Computation
1 Computing
1 Journal of the American Statistical Association
1 Journal of Computational and Applied Mathematics
1 Programming and Computer Software
1 Insurance Mathematics & Economics
1 Physica D
1 Computers & Operations Research
1 SIAM Journal on Matrix Analysis and Applications
1 Queueing Systems
1 Annals of Operations Research
1 Cybernetics and Systems Analysis
1 Journal of Computer and Systems Sciences International
1 Journal of Inverse and Ill-Posed Problems
1 ETNA. Electronic Transactions on Numerical Analysis
1 Annals of Mathematics and Artificial Intelligence
1 Mathematical Finance
1 Soft Computing
1 Scandinavian Actuarial Journal
1 Quantitative Finance
1 Theory and Practice of Logic Programming
1 North American Actuarial Journal
1 International Journal of Wavelets, Multiresolution and Information Processing
1 Parallel Processing Letters
1 Mathematics in Industry
1 Foundations and Trends in Computer Graphics and Vision
1 Foundations and Trends in Signal Processing
1 Advances in Applied Mathematics and Mechanics
1 Science China. Mathematics
1 Synthesis Lectures on Artificial Intelligence and Machine Learning
1 Bayesian Analysis
1 Communications in Mathematics and Statistics
1 Journal of the Operations Research Society of China
1 Research in the Mathematical Sciences
1 Texts in Computer Science
1 Information Geometry
1 Prikladnaya Diskretnaya Matematika
1 Electronic Research Archive

Citations by Year