カストナー マークアウレル

博士(情報学)

Image Collection Scene Graph Summarization Enhancing Relation Predictor with External Knowledge

研究業績へ戻る

著者: Itthisak Phueaksri, Marc A. Kastner, Yasutomo Kawanishi, Takahiro Komamizu, Ichiro Ide

あらすじ:

A summarization task is widely introduced to summarize multiple documents into a short description or to find representative information. A text summarization task is a task that summarizes textual information into a short description, whereas an image collection summarization task also known as an album summarization task, aims to find the visual representative information. Since scene graph generation has the advantage of describing the visual contexts of an image and, in recent years, incorporating external knowledge into scene graph generation has shown effectiveness, we propose a novel scene graph summarization method incorporating external knowledge. The key idea of the proposed method is to enhance the relation predictor toward ambiguous relationships of an image collection. Due to the annotation limitation of the dataset for this task, we first train and evaluate the model using the Visual Genome dataset, a single image scene graph dataset. Then, we introduce an extended annotated MS-COCO dataset for this task to evaluate the model on an image collection scene graph summarization task. A preliminary experiment shows promising direction of this approach.

種類: Poster at MIRU Symposium (画像の認識・理解シンポジウム)

日付: July 2023


この研究についてコメントやご意見がある場合、ぜひ以下にコメントを投稿してくだい。メールにてご連絡も大歓迎です。
© 2013-2023 Marc A. Kastner. Powered by KirbyCMS. Some rights reserved. Privacy policy.