Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • About
  • Communities & Collections
  • Advanced Search
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Engineering
  3. Computing
  4. Computing
  5. Imperial College London Submission to VATEX Video Captioning Task.
 
  • Details
Imperial College London Submission to VATEX Video Captioning Task.
File(s)
1910.07482v1.pdf (211.08 KB)
Working paper
Author(s)
Caglayan, O
Wu, Z
Madhyastha, P
Wang, J
Specia, L
Type
Working Paper
Abstract
This paper describes the Imperial College London team's submission to the 2019' VATEX video captioning challenge, where we first explore two sequence-to-sequence models, namely a recurrent (GRU) model and a transformer model, which generate captions from the I3D action features. We then investigate the effect of dropping the encoder and the attention mechanism and instead conditioning the GRU decoder over two different vectorial representations: (i) a max-pooled action feature vector and (ii) the output of a multi-label classifier trained to predict visual entities from the action features. Our baselines achieved scores comparable to the official baseline. Conditioning over entity predictions performed substantially better than conditioning on the max-pooled feature vector, and only marginally worse than the GRU-based sequence-to-sequence baseline.
Date Issued
2019
Citation
2019
URI
http://hdl.handle.net/10044/1/74178
Publisher
arxiv
Copyright Statement
© 2019 The Authors.
Identifier
http://arxiv.org/abs/1910.07482
Subjects
cs.CL
cs.CL
cs.NE
cs.CL
cs.CL
cs.NE
Publication Status
Published
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback