D-ro Marc A. Kastner

Pri mi

Towards detecting birds from panorama video aided by Sound Source Localization

Reen al la antaŭa paĝo

Aŭtoroj: Baidong Chu, Chihaya Matsuhira, Yasutomo Kawanishi, Marc A. Kastner, Takahiro Komamizu, Ichiro Ide, Daisuke Deguchi

Resumo:

In this report, we study a method to detect birds from a panorama video aided by Sound Source Localization (SSL). In the video, birds are relatively tiny to be detected from panorama frames. In the proposed method, birds are roughly localized in audio data by SSL algorithms, then corresponding regions are cropped from video frames and input to a Convolutional Neural Network (CNN) for detection. By narrowing down the searching area with SSL, relatively tiny birds in large video frames can be detected, and both detection precision and time performance are improved. Finally, we applied our method to a bird dataset and confirmed its effectiveness.

Tipo: Talk at Meeting of the Technical Committee on Media Experience and Virtual Environment, MVE (メディアエクスペリエンス・バーチャル環境基礎研究会)

Dato de publikigo: March 2022


Se vi havas demandojn aŭ komentojn pri ĉi tiu esplorado, bonvolu lasi komenton sube aŭ sendi al mi retpoŝton. Mi respondos rapide.
© 2013-2023 Marc A. Kastner. Powered by KirbyCMS. Some rights reserved. Privacy policy.