Dr. Marc A. Kastner

About me

Assistant Professor
Department of Intelligence Science and Technology, Graduate School of Informatics, Kyoto University, Japan

Research Interests

  • Connecting Computer Vision with the Human
  • Human Perception in Vision
  • Visual sentiment



An arXiv preprint for my in-progress work on "IPA-CLIP: Integrating Phonetic Priors into Vision and Language Pretraining" has been made available.


[Call for papers] Our ICME 2023 Special Session on "Quality Enhancement and Assessment for Low-quality Multimedia Data Understanding" got accepted. We are looking forward to your submissions!


My paper "Towards captioning an image collection from a combined scene graph representation approach" got accepted for the 29th Intl. Conf. on MultiMedia Modeling (MMM2023).


My paper "Detection of birds in a 3D environment referring to audio-visual information" got accepted for the 18th IEEE Intl. Conf. on Advanced Video and Signal-based Surveillance (AVSS2022).


My paper "Action Semantic Alignment for Image Captioning" got accepted for IEEE Int. Conf. on Multimedia Information Processing and Retrieval (MIPR) 2022.

Older news



Computer Vision Laboratory, Kyoto University, Japan

My other websites (Outgoing links)

My Flickr profile (Photography) My personal blog in Esperanto My website on productivity

© 2013-2022 Marc A. Kastner. Powered by KirbyCMS. Some rights reserved. Privacy policy.