過去のお知らせ
-
Extending gaussian splatting to audio: optimizing audio points for novel-view acoustic synthesis
Masaki Yoshida, Ren Togo, Takahiro Ogawa, Miki Haseyama
2025 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (IEEE VRW)
-
Manta: Enhancing mamba for few-shot action recognition of long sub-sequence
Wenbo Huang, Jinghui Zhang, Guang Li, Lei Zhang, Shuoyuan Wang, Fang Dong, Jiahui Jin, Takahiro Ogawa, Miki Haseyama
The 39th AAAI Conference on Artificial Intelligence (AAAI-25)
-
Expert comment generation from sports videos using multimodal LLM
Tatsuki Seino, Naoki Saito, Takahiro Ogawa, Huang-Chia Shih, Satoshi Asamizu, Miki Haseyama
2025 International Workshop on Advanced Image Technology (IWAIT2025)
-
Improving robustness of CLIP by adversarial training enhanced by brain activity
Tasuku Nakajiama, Keisuke Maeda, Ren Togo, Takahiro Ogawa, Miki Haseyama
2025 International Workshop on Advanced Image Technology (IWAIT2025)
-
Balancing generalization and personalization by sharing layers in clustered federated learning
Kenta Kubota, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama
2025 International Workshop on Advanced Image Technology (IWAIT2025)
-
Enhanced framework for generating counterfactual images with sophisticated caption and inversion-free image editing
Xiang Li, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama
2025 International Workshop on Advanced Image Technology (IWAIT2025)
-
Learning hierarchical video-text relationship via large language model for cross-modal video retrieval
Huaying Zhang, Ren Togo, Takahiro Ogawa, Miki Haseyama
2025 International Workshop on Advanced Image Technology (IWAIT2025)
-
Generalizing human motion style transfer method based on metadata-independent learning
Yuki Era, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama
SIGGRAPH Asia 2024 Posters
-
An evaluation metric for single image-to-3D models based on object detection perspective
Yuiko Uchida, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama
SIGGRAPH Asia 2024 Technical Communications
-
MMT-BERT: Chord-aware symbolic music generation based on multitrack music transformer and MusicBERT
Jinlong Zhu, Keigo Sakurai, Ren Togo, Takahiro Ogawa, Miki Haseyama
The 25th International Society for Music Information Retrieval Conference (ISMIR2024)
-
Personalized visual emotion classification via in-context learning in multimodal LLM
Ryo Takahashi, Naoki Saito, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama
2024 IEEE 13th Global Conference on Consumer Electronics (GCCE 2024)
-
Generative dataset distillation based on large model pool
Longzhen Li, Guang Li, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama
2024 IEEE 13th Global Conference on Consumer Electronics (GCCE 2024)
-
Multimodal adversarial defense trained on features extracted from images and brain activity
Tasuku Nakajima, Keisuke Maeda, Ren Togo, Takahiro Ogawa, Miki Haseyama
2024 IEEE 13th Global Conference on Consumer Electronics (GCCE 2024)
-
Improving zero-shot adversarial robustness via integrating image features of foundation models
Koshiro Toishi, Keisuke Maeda, Ren Togo, Takahiro Ogawa, Miki Haseyama
2024 IEEE 13th Global Conference on Consumer Electronics (GCCE 2024)
-
Lung disease classification with limited training data based on weight selection technique
Ayaka Tsutsumi, Guang Li, Ren Togo, Takahiro Ogawa, Satoshi Kondo, Miki Haseyama
2024 IEEE 13th Global Conference on Consumer Electronics (GCCE 2024)
-
Zero-shot controllable music generation from videos using facial expressions
Shilin Liu, Kyohei Kamikawa, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama
2024 IEEE 13th Global Conference on Consumer Electronics (GCCE 2024)
-
Zero-shot composed video retrieval with projection module bridging modality gap
Kenta Uesugi, Naoki Saito, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama
2024 IEEE 13th Global Conference on Consumer Electronics (GCCE 2024)
-
Zero-shot composed image retrieval considering query-target relationship leveraging masked image-text pairs
Huaying Zhang, Rintaro Yanagi, Ren Togo, Takahiro Ogawa, Miki Haseyama
2024 IEEE International Conference on Image Processing (ICIP 2024)
-
Lung cancer classification using masked autoencoder pretrained on J-MID database
Ren Tasai, Guang Li, Ren Togo, Minghui Tang, Takaaki Yoshimura, Hiroyuki Sugimori, Kenji Hirata, Takahiro Ogawa, Kohsuke Kudo, Miki Haseyama
2024 IEEE 13th Global Conference on Consumer Electronics (GCCE 2024)
-
An evaluation metric for single image-to-3D models based on a class confidence score of object detection models
Yuiko Uchida, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama
2024 IEEE 13th Global Conference on Consumer Electronics (GCCE 2024)