Bi-directional ConvLSTM-based interactive query focused video summarisation Online publication date: Fri, 05-Apr-2024
by Vasudha Tiwari; Charul Bhatnagar
International Journal of Wireless and Mobile Computing (IJWMC), Vol. 26, No. 3, 2024
Abstract: Video summarisation presents the essence of a video in a compact form by extracting the most salient frames from it in a temporal sequence. Although, over time, video summarisation has gained much focus from the researchers, yet the need of personalised summaries that are based on user's intent still needs exploration. The authors propose an interactive, Query Focussed Video Summarisation (QFVS) approach which attempts to find those frames of a video that have maximum pertinence to user's text query. The proposed model consists of a Bi-directional ConvLSTM as an enhancement over ConvLSTM along with Resnet-50 for feature extraction. The input text query is matched with the predicted labels produced by the model and the frames with maximum similarity are selected for summary generation. The proposed method is evaluated on performance with previous state-of-art works and the results clearly demonstrate a significant improvement in the performance and prove the efficiency of the approach.
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Wireless and Mobile Computing (IJWMC):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email subs@inderscience.com