Department of Intelligence Science and Technology, Graduate School of Informatics, Kyoto University, Japan
An arXiv preprint for my in-progress work on "IPA-CLIP: Integrating Phonetic Priors into Vision and Language Pretraining" has been made available.
[Call for papers] Our ICME 2023 Special Session on "Quality Enhancement and Assessment for Low-quality Multimedia Data Understanding" got accepted. We are looking forward to your submissions!
My paper "Towards captioning an image collection from a combined scene graph representation approach" got accepted for the 29th Intl. Conf. on MultiMedia Modeling (MMM2023).
My paper "Detection of birds in a 3D environment referring to audio-visual information" got accepted for the 18th IEEE Intl. Conf. on Advanced Video and Signal-based Surveillance (AVSS2022).
My paper "Action Semantic Alignment for Image Captioning" got accepted for IEEE Int. Conf. on Multimedia Information Processing and Retrieval (MIPR) 2022.
Computer Vision Laboratory, Kyoto University, Japan
My other websites (Outgoing links)
My Flickr profile (Photography) My personal blog in Esperanto My website on productivity