School of Computer Science, Northwestern Polytechnical University, Xi’an 710072, China
Ningbo Institute of Northwestern Polytechnical University, Ningbo 315103, China, and is also with School of Computer Science, Northwestern Polytechnical University, Xi’an 710072, China
Show Author Information
Hide Author Information
Abstract
The massive growth of online commercial data has raised the request for an automatic recommender system to benefit both users and merchants. One of the most frequently used recommendation methods is collaborative filtering, but its accuracy is limited by the sparsity of the rating dataset. Most existing collaborative filtering methods consider all features when calculating user/item similarity and ignore much local information. In collaborative filtering, selecting neighbors and determining users’ similarities are the most important parts. For the selection of better neighbors, this study proposes a novel biclustering method based on modified fuzzy adaptive resonance theory. To reflect the similarity between users, a new measure that considers the effect of the number of users’ common items is proposed. Specifically, the proposed novel biclustering method is first adopted to obtain local similarity and local prediction. Second, item-based collaborative filtering is used to generate global predictions. Finally, the two resultant predictions are fused to obtain a final one. Experiment results demonstrate that the proposed method outperforms state-of-the-art models in terms of several aspects on three benchmark datasets.
No abstract is available for this article. Click the button above to view the PDF directly.
References
[1]
J.Lu, D. S.Wu, M. S.Mao, W.Wang, and G. Q.Zhang, Recommender system application developments: A survey, Decis. Support Syst., vol. 74, pp. 12–32, 2015.
B. M.Sarwar, G.Karypis, J.Konstan, and J.Riedl, Recommender systems for large-scale E-commerce: Scalable neighborhood formation using clustering, Communications, vol. 50, no. 12, pp. 158–167, 2002.
J.Lee, S.Bengio, S.Kim, G.Lebanon, and Y.Singer, Local collaborative ranking, in Proc. 23rd Int. Conf. on World Wide Web, Seoul, Republic of Korea, 2014, pp. 85–96.
J.Lee, S.Kim, G.Lebanon, and Y.Singer, Local low- rank matrix approximation, in Proc. 30th Int. Conf. on Machine Learning, Atlanta, GA, USA, 2013, pp. 82–90.
Q. H.Huang, Y. D.Chen, L. Z.Liu, D. C.Tao, and X. L.Li, On combining biclustering mining and AdaBoost for breast tumor classification, IEEE Trans. Knowl. Data Eng., vol. 32, no. 4, pp. 728–738, 2020.
Q. H.Huang, J.Yang, X. F.Feng, A. W. C.Liew, and X. L.Li, Automated trading point forecasting based on bicluster mining and fuzzy inference, IEEE Trans. Fuzzy Syst., vol. 28, no. 2, pp. 259–272, 2020.
J. M.Cotelo, F. J.Ortega, J. A.Troyano, F.Enríquez, and F. L.Cruz, Known by who we follow: A biclustering application to community detection, IEEE Access, vol. 8, pp. 192218–192228, 2020.
G. A.Carpenter, S.Grossberg, and D. B.Rosen, Fuzzy ART: Fast stable learning and categorization of analog patterns by an adaptive resonance system, Neural Netw., vol. 4, no. 6, pp. 759–771, 1991.
P.Thakkar, K.Varma, V.Ukani, S.Mankad, and S.Tanwar, Combining user-based and item-based collaborative filtering using machine learning, in Information and Communication Technology for Intelligent Systems, S.Satapathy and A.Joshi, eds.Singapore: Springer, 2019, pp. 173–180.
B.Sarwar, G.Karypis, J.Konstan, and J.Riedl, Item-based collaborative filtering recommendation algorithms, in Proc. 10th Int. Conf. on World Wide Web, Hong Kong, China, 2001, pp. 285–295.
T. L.Huang, R. J.Zhao, L. Q.Bi, D. F.Zhang, and C.Lu, Neural embedding singular value decomposition for collaborative filtering, IEEE Trans. Neural Netw. Learn. Syst., .
A.Ramlatchan, M. Y.Yang, Q.Liu, M.Li, J. X.Wang, and Y. H.Li, A survey of matrix completion methods for recommendation systems, Big Data Mining and Analytics, vol. 1, no. 4, pp. 308–323, 2018.
X.Wang, H. Y.Jin, A.Zhang, X. N.He, T.Xu, and T. S.Chua, Disentangled graph collaborative filtering, in Proc. 43rd Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, Virtual Event, China, 2020, pp. 1001–1010.
M. G.Vozalis and K. G.Margaritis, A recommender system using principal component analysis, in Proc. 11th Panhellenic Conf. in Informatics, Patra, Greece, 2007, pp. 271–283.
J. C.Fan, L. J.Ding, Y. D.Chen, and M.Udell, Factor group-sparse regularization for efficient low-rank matrix recovery, in Proc. 33rd Int. Conf. on Neural Information Processing Systems, Vancouver, Canada, 2019, pp. 5104–5114.
Z. D.Zhao and M. S.Shang, User-based collaborative-filtering recommendation algorithms on Hadoop, in Proc. 3rd Int. Conf. on Knowledge Discovery and Data Mining, Phuket, Thailand, 2010, pp. 478–481.
P. K.Singh, P. K. D.Pramanik, and P.Choudhury, Mitigating sparsity using Bhattacharyya coefficient and items’ categorical attributes: Improving the performance of collaborative filtering based recommendation systems, Appl. Intell., vol. 52, no. 5, pp. 5513–5536, 2022.
M. Y.Jiang, Z. F.Zhang, J. Q.Jiang, Q. H.Wang, and Z. L.Pei, A collaborative filtering recommendation algorithm based on information theory and bi-clustering, Neural Comput. Appl., vol. 31, no. 12, pp. 8279–8287, 2019.
M.Duma and B.Twala, Sparseness reduction in collaborative filtering using a nearest neighbour artificial immune system with genetic algorithms, Expert Syst. Appl., vol. 132, pp. 110–125, 2019.
C. G.Huang and J.Yin, Effective association clusters filtering to cold-start recommendations, in Proc. 7th Int. Conf. on Fuzzy Systems and Knowledge Discovery, Yantai, China, 2010, pp. 2461–2464.
S.Kant and T.Mahara, Merging user and item based collaborative filtering to alleviate data sparsity, Int. J. Syst. Assur. Eng. Manag., vol. 9, no. 1 pp. 173–1797, 2018.
Z. J.Yang, D. H.Xia, J.Liu, C.Zheng, Y. Z.Qu, Y. D.Chen, and C. J.Zhang, Fusion of internal similarity to improve the accuracy of recommendation algorithm, J. Internet Things, vol. 3, no. 2 pp. 65–76, 2021.
C. X.Zhang, M.Yang, J.Lv, and W. Q.Yang, An improved hybrid collaborative filtering algorithm based on tags and time factor, Big Data Mining and Analytics, vol. 1, no. 2, pp. 128–136, 2018.
Sun J, Zhang Y. Recommendation System with Biclustering. Big Data Mining and Analytics, 2022, 5(4): 282-293. https://doi.org/10.26599/BDMA.2022.9020012
The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).
10.26599/BDMA.2022.9020012.F001
Example for illustrating the difference between one-way clustering and biclustering.
10.26599/BDMA.2022.9020012.F002
Workflow of the proposed method.
10.26599/BDMA.2022.9020012.F003
Example for illustrating dataset separation.
10.26599/BDMA.2022.9020012.F004
Fuzzy ART[
35
].
Biclusters searching
Biclustering can be considered as the combination of column and row clusterings. Item clustering is performed with the standard FA model [
15
]. The inputs are item vectors (), vigilance parameter , choice parameter , learning parameter , and maximal epochs . Finally, the item cluster set composed of item clusters can be obtained.
User clustering is performed with , a modified FA. The pseudocode of user clustering is displayed in Algorithm 1. The modification lies in that when an input pattern (user vector) passes the vigilance test, subsequently, the bicluster test needs to be passed before assigning to the winning neuron (user cluster). The bicluster test is that if the correlation between the input pattern and the bicluster set is not smaller than a predetermined threshold , then the input pattern is assigned to the winning neuron; otherwise, increase (the vigilance parameter, and is its increment) and rerun with the updated . Given that high vigilance threshold leads to narrow generalization and the neuron (cluster) represents fewer input patterns, if no existing committed neuron (cluster) passes the bicluster test, must be enlarged. With , user cluster set composed of user clusters can be obtained. When an input pattern is presented to the layer of , the following steps are performed:
(1) The input is complement-coded to avoid category proliferation. The complement coding is performed with , where is the complement of , and .
(2) Each neuron in calculates a value for the cluster choice function, which selects the neuron () having maximal function value, as shown in the following:
where is the weight of the -th neuron, “” is the fuzzy MIN operation defined by , is the value of the -th neuron’s -th weight. “” means taking the first-order norm, and is a very small positive real number.
(3) Calculate the match function value of as follows:
if , passes the vigilance test and becomes the potential winning neuron . If , then apparently cannot be the potential winning neuron and is shut off by the orienting subsystem. The competition among the remaining neurons is rerun until the potential winning neuron is found.
(4) Combine (the user cluster that contains ) and each item cluster in to construct biclusters. Then, calculate the correlation () between the bicluster and input pattern as follows:
where denotes the items co-rated by users and . is the cardinality of . is the local similarity measure of two users and is the modified Pearson correlation coefficient. The difference lies in that considers the number of co-rated items . The bigger becomes, the more items two users rate simultaneously, and the higher their similarity becomes. is the -th bicluster, represents the -th row of , and is the number of rows (users) in .
If at least one of these passes the bicluster test (), then set as the final winning neuron . Otherwise, increase the vigilance with and jump to the second step to rerun.
(5) Update the weight of the winning neuron for learning the information contained in the input pattern as follows:
where is the learning rate.
With the above item and user clustering, the item clusters and user clusters are generated, biclusters can be obtained by pairwise coupling and . The next step is to obtain local predictions with the mined biclusters and BBCF from .