IRUS Total

Acceleration of ListNet for ranking using reconfigurable architecture

File Description SizeFormat 
Qiang-Li-2020-PhD-Thesis.pdfThesis4.51 MBAdobe PDFView/Open
Title: Acceleration of ListNet for ranking using reconfigurable architecture
Authors: Li, Qiang
Item Type: Thesis or dissertation
Abstract: Document ranking is used to order query results by relevance with ranking models. ListNet is a well-known ranking approach for constructing and training learning-to-rank models. Compared with traditional learning approaches, ListNet delivers better accuracy, but is computationally too expensive to learn models with large data sets due to the large number of permutations and documents involved in computing the gradients. Currently, the long training time limits the practicality of ListNet in ranking applications such as breaking news search and stock prediction, and this situation is getting worse with the increase in data-set size. In order to tackle the challenge of long training time, this thesis optimises the ListNet algorithm, and designs hardware accelerators for learning the ListNet algorithm using Field Programmable Gate Arrays (FPGAs), making the algorithm more practical for real-world application. The contributions of this thesis include: 1) A novel computation method of the ListNet algorithm for ranking. The proposed computation method exposes more fine-grained parallelism for FPGA implementation. 2) A weighted sampling method that takes into account the ranking positions, along with an effective quantisation method based on FPGA devices. The proposed design achieves a 4.42x improvement over GPU implementation speed, while still guaranteeing the accuracy. 3) A full reconfigurable architecture for the ListNet training using multiple bitstream kernels. The proposed method achieves a higher model accuracy than pure fixed point training, and a better throughput than pure floating point training. This thesis has resulted in the acceleration of the ListNet algorithm for ranking using FPGAs by applying the above techniques. Significant improvements in speed have been achieved in this work against CPU and GPU implementations.
Content Version: Open Access
Issue Date: Sep-2019
Date Awarded: Apr-2020
URI: http://hdl.handle.net/10044/1/80539
DOI: https://doi.org/10.25560/80539
Copyright Statement: Creative Commons Attribution NonCommercial Licence
Supervisor: Cheung, Peter Ying-Kay
Thomas, David Barrie
Department: Electrical and Electronic Engineering
Publisher: Imperial College London
Qualification Level: Doctoral
Qualification Name: Doctor of Philosophy (PhD)
Appears in Collections:Electrical and Electronic Engineering PhD theses