45
IRUS TotalDownloads
Altmetric
Effective deep leaning methodologies for salient object detection
File | Description | Size | Format | |
---|---|---|---|---|
Ren-GY-2023-PhD-Thesis.pdf | Thesis | 14.81 MB | Adobe PDF | View/Open |
Title: | Effective deep leaning methodologies for salient object detection |
Authors: | Ren, Guangyu |
Item Type: | Thesis or dissertation |
Abstract: | Salient object detection has achieved great improvement by using deep neural networks. Diverse challenging problems arise from this computer vision task and attract more attention from researchers. This thesis investigates the issues in specific tasks and aims at proposing effective methodologies to tackle the problems and improve detection performance. More specifically, existing network architectures may cause dilution problems in highlevel semantic information during up-sample operations in the top-down pathway of the Feature Pyramid Network (FPN). In order to overcome this limitation, we propose a novel pyramid self-attention module (PSAM) and the adoption of an independent feature-complementing strategy. In addition, a channel-wise attention module is also employed to reduce redundant features of the feature pyramid network and provide refined results. Introducing depth information in a suboptimal fusion strategy may have a negative influence on the performance of SOD. We discuss the advantages of the so-called progressive multi-scale fusion method and propose a mask-guided feature aggregation module (MGFA). We also introduce a mask-guided refinement module (MGRM) to complement the high-level semantic features and reduce the irrelevant features from multi-scale fusion, leading to an overall refinement of detection. RGB-D methods sacrifice the model size to improve the detection accuracy, which may impede the practical application of SOD problems. To tackle this dilemma, we propose a dynamic distillation method along with simple noise elimination. To this end, the final model can significantly reduce the computational burden while maintaining the validity and mitigating the impact of distorted training data caused by low-quality depth maps. Furthermore, in cosaliency detection, we propose a novel adaptive intra-group aggregation (AIGA) method to model the relationship between individual feature representation in a single image and group feature representation. This proposed AIGA can effectively improve the performance without increasing extra network parameters. |
Content Version: | Open Access |
Issue Date: | Jan-2023 |
Date Awarded: | Mar-2023 |
URI: | http://hdl.handle.net/10044/1/103997 |
DOI: | https://doi.org/10.25560/103997 |
Copyright Statement: | Creative Commons Attribution NonCommercial Licence |
Supervisor: | Stathaki, Tania |
Department: | Electrical and Electronic Engineering |
Publisher: | Imperial College London |
Qualification Level: | Doctoral |
Qualification Name: | Doctor of Philosophy (PhD) |
Appears in Collections: | Electrical and Electronic Engineering PhD theses |
This item is licensed under a Creative Commons License