TopicLPRank: a keyphrase extraction method based on improved TopicRank

作者:Liao, Shengbin*; Yang, Zongkai; Liao, Qingzhou; Zheng, Zhangxiong
来源:Journal of Supercomputing, 2023, 79(8): 9073-9092.
DOI:10.1007/s11227-022-05022-0

摘要

We present a keyphrase extraction algorithm named TopicLPRank in this paper, which is an improved TopicRank algorithm. Different from the TopicRank which only uses the relative distance information of the text, we think that the length and absolute position of the text candidate keyphrases also have a certain influence on the results of the model for extraction keyphrases. Therefore, the proposed Topi-cLPRank incorporates these two factors on the basis of the TopicRank. The experi-mental results show that adding the location information and length information of candidate keyphrases can, respectively, increase the F-Score of the model by around 2.7% points and 1.7% points, which is equivalent to an increase of 19.6 and 12.3% compared with the TopicRank. At the same time, the fusion of the length and loca-tion information of the candidate keyphrase can increase the F-Score by around 3.5 percentage points, which is equivalent to an increase of 25.21% compared with the TopicRank in the dataset NUS.

全文