NEWS お知らせ

過去のお知らせ

  • Extending gaussian splatting to audio: optimizing audio points for novel-view acoustic synthesis

    Masaki Yoshida, Ren Togo, Takahiro Ogawa, Miki Haseyama

    2025 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (IEEE VRW)

  • Manta: Enhancing mamba for few-shot action recognition of long sub-sequence

    Wenbo Huang, Jinghui Zhang, Guang Li, Lei Zhang, Shuoyuan Wang, Fang Dong, Jiahui Jin, Takahiro Ogawa, Miki Haseyama

    The 39th AAAI Conference on Artificial Intelligence (AAAI-25)

  • Expert comment generation from sports videos using multimodal LLM

    Tatsuki Seino, Naoki Saito, Takahiro Ogawa, Huang-Chia Shih, Satoshi Asamizu, Miki Haseyama

    2025 International Workshop on Advanced Image Technology (IWAIT2025)

  • Improving robustness of CLIP by adversarial training enhanced by brain activity

    Tasuku Nakajiama, Keisuke Maeda, Ren Togo, Takahiro Ogawa, Miki Haseyama

    2025 International Workshop on Advanced Image Technology (IWAIT2025)

  • Balancing generalization and personalization by sharing layers in clustered federated learning

    Kenta Kubota, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama

    2025 International Workshop on Advanced Image Technology (IWAIT2025)

  • Enhanced framework for generating counterfactual images with sophisticated caption and inversion-free image editing

    Xiang Li, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama

    2025 International Workshop on Advanced Image Technology (IWAIT2025)

  • Learning hierarchical video-text relationship via large language model for cross-modal video retrieval

    Huaying Zhang, Ren Togo, Takahiro Ogawa, Miki Haseyama

    2025 International Workshop on Advanced Image Technology (IWAIT2025)

  • Generalizing human motion style transfer method based on metadata-independent learning

    Yuki Era, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama

    SIGGRAPH Asia 2024 Posters

  • An evaluation metric for single image-to-3D models based on object detection perspective

    Yuiko Uchida, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama

    SIGGRAPH Asia 2024 Technical Communications

  • MMT-BERT: Chord-aware symbolic music generation based on multitrack music transformer and MusicBERT

    Jinlong Zhu, Keigo Sakurai, Ren Togo, Takahiro Ogawa, Miki Haseyama

    The 25th International Society for Music Information Retrieval Conference (ISMIR2024)

  • Personalized visual emotion classification via in-context learning in multimodal LLM

    Ryo Takahashi, Naoki Saito, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama

    2024 IEEE 13th Global Conference on Consumer Electronics (GCCE 2024)

  • Generative dataset distillation based on large model pool

    Longzhen Li, Guang Li, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama

    2024 IEEE 13th Global Conference on Consumer Electronics (GCCE 2024)

  • Multimodal adversarial defense trained on features extracted from images and brain activity

    Tasuku Nakajima, Keisuke Maeda, Ren Togo, Takahiro Ogawa, Miki Haseyama

    2024 IEEE 13th Global Conference on Consumer Electronics (GCCE 2024)

  • Improving zero-shot adversarial robustness via integrating image features of foundation models

    Koshiro Toishi, Keisuke Maeda, Ren Togo, Takahiro Ogawa, Miki Haseyama

    2024 IEEE 13th Global Conference on Consumer Electronics (GCCE 2024)

  • Lung disease classification with limited training data based on weight selection technique

    Ayaka Tsutsumi, Guang Li, Ren Togo, Takahiro Ogawa, Satoshi Kondo, Miki Haseyama

    2024 IEEE 13th Global Conference on Consumer Electronics (GCCE 2024)

  • Zero-shot controllable music generation from videos using facial expressions

    Shilin Liu, Kyohei Kamikawa, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama

    2024 IEEE 13th Global Conference on Consumer Electronics (GCCE 2024)

  • Zero-shot composed video retrieval with projection module bridging modality gap

    Kenta Uesugi, Naoki Saito, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama

    2024 IEEE 13th Global Conference on Consumer Electronics (GCCE 2024)

  • Zero-shot composed image retrieval considering query-target relationship leveraging masked image-text pairs

    Huaying Zhang, Rintaro Yanagi, Ren Togo, Takahiro Ogawa, Miki Haseyama

    2024 IEEE International Conference on Image Processing (ICIP 2024)

  • Lung cancer classification using masked autoencoder pretrained on J-MID database

    Ren Tasai, Guang Li, Ren Togo, Minghui Tang, Takaaki Yoshimura, Hiroyuki Sugimori, Kenji Hirata, Takahiro Ogawa, Kohsuke Kudo, Miki Haseyama

    2024 IEEE 13th Global Conference on Consumer Electronics (GCCE 2024)

  • An evaluation metric for single image-to-3D models based on a class confidence score of object detection models

    Yuiko Uchida, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama

    2024 IEEE 13th Global Conference on Consumer Electronics (GCCE 2024)