Egocentric image captioning for privacy-preserved passive dietary intake monitoring
OA Location
Author(s)
Type
Journal Article
Abstract
Camera-based passive dietary intake monitoring is able to continuously capture the eating episodes of a subject, recording rich visual information, such as the type and volume of food being consumed, as well as the eating behaviors of the subject. However, there currently is no method that is able to incorporate these visual clues and provide a comprehensive context of dietary intake from passive recording (e.g., is the subject sharing food with others, what food the subject is eating, and how much food is left in the bowl). On the other hand, privacy is a major concern while egocentric wearable cameras are used for capturing. In this article, we propose a privacy-preserved secure solution (i.e., egocentric image captioning) for dietary assessment with passive monitoring, which unifies food recognition, volume estimation, and scene understanding. By converting images into rich text descriptions, nutritionists can assess individual dietary intake based on the captions instead of the original images, reducing the risk of privacy leakage from images. To this end, an egocentric dietary image captioning dataset has been built, which consists of in-the-wild images captured by head-worn and chest-worn cameras in field studies in Ghana. A novel transformer-based architecture is designed to caption egocentric dietary images. Comprehensive experiments have been conducted to evaluate the effectiveness and to justify the design of the proposed architecture for egocentric dietary image captioning. To the best of our knowledge, this is the first work that applies image captioning for dietary intake assessment in real-life settings.
Date Issued
2023-03-06
Date Acceptance
2023-01-27
Citation
IEEE Transactions on Cybernetics, 2023, PP, pp.1-14
ISSN
1083-4419
Publisher
Institute of Electrical and Electronics Engineers
Start Page
1
End Page
14
Journal / Book Title
IEEE Transactions on Cybernetics
Volume
PP
Copyright Statement
© 2023 The Author(s). This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.
Sponsor
Bill and Melinda Gates Foundation
Identifier
https://www.ncbi.nlm.nih.gov/pubmed/37028043
Grant Number
OPP1171395
Subjects
0102 Applied Mathematics
0801 Artificial Intelligence and Image Processing
0906 Electrical and Electronic Engineering
Artificial Intelligence & Image Processing
Publication Status
Published online
Coverage Spatial
United States
Date Publish Online
2023-03-06