HW-Forest: Deep Forest with Hashing Screening andWindow Screening

作者:Ma, Pengfei; Wu, Youxi*; Li, Yan; Guo, Lei; Jiang, He; Zhu, Xingquan; Wu, Xindong
来源:ACM Transactions on Knowledge Discovery from Data, 2022, 16(6): 123.
DOI:10.1145/3532193

摘要

As a novel deep learning model, gcForest has been widely used in various applications. However, current multi-grained scanning of gcForest produces many redundant feature vectors, and this increases the time cost of the model. To screen out redundant feature vectors, we introduce a hashing screening mechanism for multi-grained scanning and propose a model called HW-Forest which adopts two strategies: hashing screening andwindowscreening. HW-Forest employs perceptual hashing algorithm to calculate the similarity between feature vectors in hashing screening strategy, which is used to remove the redundant feature vectors produced by multi-grained scanning and can significantly decrease the time cost and memory consumption. Furthermore, we adopt a self-adaptive instance screening strategy called window screening to improve the performance of our approach, which can achieve higher accuracy without hyperparameter tuning on different datasets. Our experimental results show that HW-Forest has higher accuracy than other models, and the time cost is also reduced.