PCP-tuning: Personalized Continuous Prompt Tuning for Few-Shot Learning

Ting LIU; Shaotian CAI; Xiaojun CHEN; Qin ZHANG

doi:10.13568/j.cnki.651094.651316.2023.09.17.0001

| Sign up

PDF (7.8 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

Show Outline

Outline

Abstract

Keywords

References

Show full outline

Hide outline

PCP-tuning: Personalized Continuous Prompt Tuning for Few-Shot Learning

Ting LIU, Shaotian CAI, Xiaojun CHEN(), Qin ZHANG

School of Computer Science and Technology, Shenzhen University, Shenzhen Guangdong 518071, China

Show Author Information

Abstract

Pre-trained language models have achieved remarkable performance in few-shot learning with the rise of “prompt learning”, where the key problem is how to construct a suitable prompt for each example. Sample and prompt will be combined as a new input to language model (LM). A series of prompt construction methods have been proposed recently, some of these methods are for discrete prompt construction, and some focus on continuous prompt construction, both of them normally apply a unified prompt to all examples. However, the results show that it is hard to find a perfect unified prompt that works for all examples in a task, one prompt can only help LM assign the correct class to some samples in the downstream classification task and give the wrong result to others. To this end, we propose a novel personalized continuous prompt tuning (PCP-tuning) method to learn personalized prompts that are tailored to each sample's semantic for few-shot learning. Two calibration techniques are proposed to control the distribution of generated prompts for better prompts. Extensive experimental results on ten benchmark tasks demonstrate the superior performance of our method.

Keywords

natural language processing large scale pre-trained models prompt learning text classification

Article ID: 2096-7675(2024)01-0059-010

References

[1]

AISHAN

, WEI

W L

, ZAOKERE

. Sentiment analysis based on BiLSTM+Attention in sports field[J]. Journal of Xinjiang University (Natural Science Edition in Chinese and English), 2020, 37(2): 142-149. (in Chinese)

Google Scholar

[2]

ZENG

, HUANG

D Q

, WEI

, et al. Short-term traffic flow forecast based on modified WOA optimized LSTM neural network[J]. Journal of Xinjiang University (Natural Science Edition in Chinese and English), 2022, 39(2): 242-248. (in Chinese)

Google Scholar

[3]

TAN

, TUERGEN

, AISHAN

, et al. Uygur words clustering based on the similarity calculation[J]. Journal of Xinjiang University (Natural Science Edition), 2012, 29(1): 104-107. (in Chinese)

Google Scholar

[4]

YALIQING

, HALIDAN

, CHEN

. Uygur text filtering based on vector space model[J]. Journal of Xinjiang University (Natural Science Edition), 2015, 32(2): 221-226. (in Chinese)

Google Scholar

[5]

RAFFEL

, SHAZEER

, ROBERTS

, et al. Exploring the limits of transfer learning with a unified text-to-text transformer[EB/OL]. 2019: arXiv: 1910.10683. http://arxiv.org/abs/1910.10683.pdf.

Google Scholar

[6]

SUI

D B

, CHEN

Y B

, MAO

B J

, et al. Knowledge guided metric learning for few-shot text classification[C]//Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Online. Stroudsburg, PA, USA: Association for Computational Linguistics, 2021: 3266-3271.