Inspired by the concept of content-addressable retrieval from cognitive science, we propose a novel fragment-based Chinese named entity recognition (NER) model augmented with a lexicon-based memory in which both character-level and word-level features are combined to generate better feature representations for possible entity names. Observing that the boundary information of entity names is particularly useful to locate and classify them into pre-defined categories, position-dependent features, such as prefix and suffix, are introduced and taken into account for NER tasks in the form of distributed representations. The lexicon-based memory is built to help generate such position-dependent features and deal with the problem of out-of-vocabulary words. Experimental results show that the proposed model, called LEMON, achieved state-of-the-art performance with an increase in the F1-score up to 3.2% over the state-of-the-art models on four different widely-used NER datasets.
Nothman J, Ringland N, Radford W, Murphy T, Curran J R. Learning multilingual named entity recognition from Wikipedia. Artificial Intelligence , 2013, 194: 151–175. DOI: 10.1016/j.artint.2012.03.006.
Collobert R, Weston J, Bottou L, Karlen M, Kavukcuoglu K, Kuksa P. Natural language processing (almost) from scratch. The Journal of Machine Learning Research , 2011, 12: 2493–2537.
Li J, Sun A X, Han J L, Li C L. A survey on deep learning for named entity recognition. IEEE Trans. Knowledge and Data Engineering , 2022, 34(1): 50–70. DOI: 10.1109/TKDE.2020.2981314.
Chiu J P C, Nichols E. Named entity recognition with bidirectional LSTM-CNNs. Trans. Association for Computational Linguistics , 2016, 4: 357–370. DOI: 10.1162/tacl_a_00104.
Hassabis D, Kumaran D, Summerfield C, Botvinick M. Neuroscience-inspired artificial intelligence. Neuron , 2017, 95(2): 245–258. DOI: 10.1016/j.neuron.2017.06.011.
Hopfield J J. Neural networks and physical systems with emergent collective computational abilities. Proceedings of the National Academy of Sciences of the United States of America , 1982, 79(8): 2554–2558. DOI: 10.1073/pnas.79.8.2554.
Anderson J R, Bothell D, Byrne M D, Douglass S, Lebiere C, Qin Y L. An integrated theory of the mind. Psychological Review , 2004, 111(4): 1036–1060. DOI: 10.1037/0033-295X.111.4.1036.
Elman J L. Finding structure in time. Cognitive Science , 1990, 14(2): 179–211. DOI: 10.1207/s15516709cog1402_1.
Hochreiter S, Schmidhuber J. Long short-term memory. Neural Computation , 1997, 9(8): 1735–1780. DOI: 10.1162/ neco.1997.9.8.1735.
Liu W J, Zhou P, Zhao Z, Wang Z R, Ju Q, Deng H T, Wang P. K-BERT: Enabling language representation with knowledge graph. Proceedings of the AAAI Conference on Artificial Intelligence , 2020, 34(3): 2901–2908. DOI: 10.1609/aaai.v34i03.5681.