Togo Ren(Faculty of Information Science and Technology　Media and Network Technologies　Information Media Science and Technology)

SEARCH

Search Details

Togo Ren

Faculty of Information Science and Technology　Media and Network Technologies　Information Media Science and Technology Associate Professor

2015年3月 北海道大学医学部保健学科放射線技術科学専攻 卒業．
2017年3月 北海道大学大学院情報科学研究科 修士課程 修了．
2019年3月 北海道大学大学院 情報科学研究科 博士後期課程 修了 (在学期間短縮)．
2019年4月 日本学術振興会 特別研究員 (PD)．
2020年2月 北海道大学 数理・データサイエンス教育研究センター 特任助教.
2022年2月 北海道大学 大学院情報科学研究院 特任助教.
2025年4月 北海道大学 大学院情報科学研究院 准教授.

医用画像を中心とした異分野連携に関する研究に従事．
診療放射線技師国家資格．
IEEE, ACM会員．

Researcher basic information

■ Degree

博士（情報科学）

■ URL
researchmap URL

https://researchmap.jp/r-togo?lang=en

ホームページURL

https://jglobal.jst.go.jp/detail?JGLOBAL_ID=201801008051611119

https://www-lmd.ist.hokudai.ac.jp/ren-togo/

■ Various IDs
Researcher number

60840395

J-Global ID

201801008051611119

■ Research Keywords and Fields
Research Keyword

computer vision

Retrieval

Signal processing

Image processing

Pylori

Gastric cancer

Medical image

X-ray

Machine learning

Deep learning

Research Field

Informatics, Perceptual information processing

Informatics, Intelligent informatics

■ Educational Organization

Bachelor's degree program, School of Engineering

Master's degree program, Graduate School of Information Science and Technology

Doctoral (PhD) degree program, Graduate School of Information Science and Technology

Research activity information

■ Awards

Aug. 2024, 構造工学委員会 AI・データサイエンス論文集編集小委員会, デジタルツイン・DX奨励賞

Jul. 2024, IEEE International Conference on Consumer Electronics – Taiwan (ICCE-TW), The 2024 IEEE ICCE-TW Best Paper Award
Discriminator-enhanced music generation based on multitrack music transformer

2024, The 2023 IEEE Sapporo Section Encouragement Award, The 2023 IEEE Sapporo Section Encouragement Award
Guang Li et al., ICASSP, 2022

2023, International Workshop on Advanced Image Technology Best Paper Award, Best Paper Award
Teruhisa Yamashiro

2023, 電気・情報関係学会北海道支部連合大会, The 2022 IEEE Sapporo Section Student Paper Contest Encouraging Prize
河合他

2023, IEEE Sapporo Section, The 2022 IEEE Sapporo Section Encouragement Award
Nao Nakagawa et al.

2023, IEEE Sapporo Section, The 2023 IEEE Sapporo Young Professionals Best Researcher Award

2023, IEEE GCCE, Excellent Paper Award Silver Prize
Haruka Matsuda

2023, 電気・情報関係学会北海道支部連合大会, 若手優秀論文発表賞
松田他

2023, 電気・情報関係学会北海道支部連合大会, 若手優秀論文発表賞
佐藤他

2023, 電気・情報関係学会北海道支部連合大会, 若手優秀論文発表賞
藤後太郎

2023, 映像情報メディア学会, 映像情報メディア学会優秀研究発表賞
朱赫他

2023, 構造工学委員会 AI・データサイエンス論文集編集小委員会, AI・データサイエンス奨励賞
櫻井他

2023, 構造工学委員会 AI・データサイエンス論文集編集小委員会, AI・データサイエンス奨励賞
諸戸他

2023, 電気・情報関係学会北海道支部連合大会, The 2023 IEEE Sapporo Section Student Paper Contest Encouraging Prize
Masaki Kashiwagi et al.

Nov. 2022, IEEE GCCE2021 Excellent Student Poster Award Silver Prize

Oct. 2022, Silver Prize IEEE GCCE 2021 Excellent Poster Award

Sep. 2022, 土木情報学システム開発賞

Aug. 2022, The 2021 IEEE Sapporo Section Encouragement Award

Mar. 2022, IEEE LifeTech 2022 WIE Excellent Poster Award

2022, Best Paper Award, International Workshop on Advanced Image Technology (K. Sakurai et al., IWAIT)

2021, 令和3年度電気・情報関係学会北海道支部連合大会若手優秀論文発表賞 (櫻井他, 電気・情報関係学会北海道支部連合大会)

2021, 令和3年度電気・情報関係学会北海道支部連合大会若手優秀論文発表賞 (吉田他, 電気・情報関係学会北海道支部連合大会)

2021, 2nd Prize, IEEE LifeTech 2021 Excellent Student Paper Award for Oral Presentation (S. Takada et al., IEEE Lifetech)

2020, ACM Multimedia Asia 2020 Best Paper Runner-up Award (R. Yanagi et al., ACMM Asia)

2020, Gold Prize GCCE2020 Excellent Poster Award (S. Takada et al., GCCE2020)

2020, Gold Prize GCCE2020 Excellent Student Paper Award (G. Li et al., GCCE2020)

2020, Gold Prize GCCE2020 Excellent Demo! Award (N. Nakagawa et al., GCCE2020)

2020, 映像情報メディア学会丹羽高柳賞論文賞 (T. Ogawa et al., MTA)

2019, Silver Prize IEEE GCCE 2019 Excellent Poster Award
Megumi Kotera

2019, Outstanding Prize IEEE GCCE2019 Excellent Demo! Award
Rintaro Yanagi

2017, 電子情報通信学会北海道支部学生奨励賞
藤後廉

2015, 平成27年度電気・情報関係学会北海道支部連合大会優秀論文発表賞
藤後廉

■ Papers

Leveraging Attack Non-Transferability to Boost Adversarial Robustness for Foundation Models
Koshiro Toishi; Keisuke Maeda; Ren Togo; Takahiro Ogawa; Miki Haseyama
Applied Sciences, 17 Apr. 2026
Scientific journal

Personalized Longitudinal Medical Report Generation via Temporally-Aware Federated Adaptation.
He Zhu; Ren Togo; Takahiro Ogawa 0001; Kenji Hirata; Minghui Tang; Takaaki Yoshimura; Hiroyuki Sugimori; Noriko Nishioka; Yukie Shimizu; Kohsuke Kudo; Miki Haseyama
CoRR, abs/2602.19668, Feb. 2026
Scientific journal

L2R: Low-Rank and Lipschitz-Controlled Routing for Mixture-of-Experts.
Minghao Yang; Ren Togo; Guang Li 0008; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2601.21349, Jan. 2026
Scientific journal

Foreground-Aware Dataset Distillation via Dynamic Patch Selection.
Longzhen Li; Guang Li 0008; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2601.02727, Jan. 2026
Scientific journal

Personalized federated learning for medical vision-language models via efficient fine-tuning and uncertainty-aware disentanglement.
He Zhu; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
J. Biomed. Informatics, 178, 105014, 105014, 2026
Scientific journal

Dual-model weight selection and self-knowledge distillation for medical image classification.
Ayaka Tsutsumi; Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Satoshi Kondo; Miki Haseyama
Comput. Biol. Medicine, 204, 111510, 111510, 2026
Scientific journal

Deep Generative Replay-Based Personalization With Conditional Latent Attention for Diffusion Models
Haruka Matsuda; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama
IEEE Access, 14, 10568, 10578, 2026
Scientific journal

Generalizing Stylized Motion Generation Method by Introducing Metadata-Independent Learning and Unified Multiple Motion Dataset
Yuki Era; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama
IEEE Transactions on Multimedia, 28, 1884, 1893, 2026
Scientific journal

Privacy-Aware Continual Self-Supervised Learning on Multi-Window Chest Computed Tomography for Domain-Shift Robustness.
Ren Tasai; Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Kenji Hirata; Minghui Tang; Takaaki Yoshimura; Hiroyuki Sugimori; Noriko Nishioka; Yukie Shimizu; Kohsuke Kudo; Miki Haseyama
CoRR, abs/2510.27213, Oct. 2025
Scientific journal

Adaptive Shared Experts with LoRA-Based Mixture of Experts for Multi-Task Learning.
Minghao Yang; Ren Togo; Guang Li 0008; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2510.00570, Oct. 2025
Scientific journal

Cross-domain multi-step thinking: Zero-shot fine-grained traffic sign recognition in the wild
Yaozong Gan; Guang Li; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama
Knowledge-Based Systems, 327, 114172, 114172, Oct. 2025
Scientific journal

Objectness Similarity: Capturing Object-Level Fidelity in 3D Scene Evaluation.
Yuiko Uchida; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2509.09143, Sep. 2025
Scientific journal

GeoJapan Fusion Framework: A Large Multimodal Model for Regional Remote Sensing Recognition
Yaozong Gan; Guang Li; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama
Remote Sensing, 01 Sep. 2025
Scientific journal

Dual-Model Weight Selection and Self-Knowledge Distillation for Medical Image Classification.
Ayaka Tsutsumi; Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Satoshi Kondo; Miki Haseyama
CoRR, abs/2508.20461, Aug. 2025
Scientific journal

Analysis of Model Merging Methods for Continual Updating of Foundation Models in Distributed Data Settings
Kenta Kubota; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama
Applied Sciences, 07 May 2025
Scientific journal

Enhancing Adversarial Defense via Brain Activity Integration Without Adversarial Examples
Tasuku Nakajima; Keisuke Maeda; Ren Togo; Takahiro Ogawa; Miki Haseyama
Sensors, 25 Apr. 2025
Scientific journal

Generative Dataset Distillation Based on Self-knowledge Distillation
Longzhen Li; Guang Li; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama
ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), abs/2501.04202, 1, 5, IEEE, 06 Apr. 2025
International conference proceedings

Continual Self-supervised Learning Considering Medical Domain Knowledge in Chest CT Images
Ren Tasai; Guang Li; Ren Togo; Minghui Tang; Takaaki Yoshimura; Hiroyuki Sugimori; Kenji Hirata; Takahiro Ogawa; Kohsuke Kudo; Miki Haseyama
ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), abs/2501.04217, 1, 5, IEEE, 06 Apr. 2025
International conference proceedings

ConcVAE: Conceptual Representation Learning
Ren Togo; Nao Nakagawa; Takahiro Ogawa; Miki Haseyama
IEEE Transactions on Neural Networks and Learning Systems, 36, 4, 7529, 7541, Apr. 2025
Scientific journal

Multistage deep learning for classification of Helicobacter pylori infection status using endoscopic images
Guang Li; Ren Togo; Katsuhiro Mabe; Shunpei Nishida; Yoshihiro Tomoda; Fumiyuki Shiratani; Masashi Hirota; Takahiro Ogawa; Miki Haseyama
Journal of Gastroenterology, 60, 4, 408, 415, Springer Science and Business Media LLC, 15 Jan. 2025
Scientific journal

MLLM-guided Training-free Spherical Panorama Generation from a Single Image.
Yuki Katayama; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
SIGGRAPH Asia Posters, 54, 2, 2025
International conference proceedings

Lost in the Interface: How Structured UI Complexity Challenges Large Vision Language Models in Games.
Xiang Li; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
SIGGRAPH Asia Technical Communications, 8, 4, 2025
International conference proceedings

Improving Robustness of CLIP by Adversarial Training Enhanced by Brain Activity
Tasuku Nakajima; Keisuke Maeda; Ren Togo; Takahiro Ogawa; Miki Haseyama
INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2025, 13510, 2025
English, International conference proceedings

Learning Hierarchical Video-Text Relationship via Large Language Model for Cross-modal Video Retrieval
Huaying Zhang; Ren Togo; Takahiro Ogawa; Miki Haseyama
INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2025, 13510, 2025
English, International conference proceedings

Balancing Generalization and Personalization by Sharing Layers in Clustered Federated Learning
Kenta Kubota; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama
INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2025, 13510, 2025
English, International conference proceedings

Enhanced Framework for Generating Counterfactual Images with Sophisticated Caption and Inversion-free Image Editing
Xiang Li; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama
INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2025, 13510, 2025
English, International conference proceedings

Exponential Dissimilarity-Dispersion Family for Domain-Specific Representation Learning
Ren Togo; Nao Nakagawa; Takahiro Ogawa; Miki Haseyama
IEEE Transactions on Image Processing, 34, 6110, 6125, 2025, [Peer-reviewed], [Lead author]
English, Scientific journal

Extending Gaussian Splatting to Audio: Optimizing Audio Points for Novel-View Acoustic Synthesis.
Masaki Yoshida; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
VR Workshops, 1412, 1413, IEEE, 2025
International conference proceedings

Robust Adversarial Defense Based on Non-Transferability of Attack Across Foundation Models.
Koshiro Toishi; Keisuke Maeda; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ICASSP, 2024, 1, 5, IEEE, 2025
International conference proceedings

Generative Dataset Distillation Based on Self-knowledge Distillation.
Longzhen Li; Guang Li 0008; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
ICASSP, 1, 5, IEEE, 2025
International conference proceedings

Gradient-Oriented Clustered Federated Learning With Efficient Knowledge Sharing in Non-IID Settings.
Kenta Kubota; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
ICASSP, 1, 5, IEEE, 2025
International conference proceedings

AMDIS: Amplitude Dissimilarity Reduced Reference IQA Metric for Neural Radiance Field.
Ren Togo; Rintaro Yanagi; Masato Kawai; Takahiro Ogawa 0001; Miki Haseyama
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 108, 7, 897, 905, 2025
Scientific journal

Linear Structure Analysis of Embeddings for Bias Disparity Reduction in Collaborative Filtering
Hiroki Okamura; Keisuke Maeda; Ren Togo; Takahiro Ogawa; Miki Haseyama
IEEE Transactions on Services Computing, 18, 4, 2201, 2211, 2025
Scientific journal

LLM is Knowledge Graph Reasoner: LLM’s Intuition-Aware Knowledge Graph Reasoning for Cold-Start Sequential Recommendation
Keigo Sakurai; Ren Togo; Takahiro Ogawa; Miki Haseyama
ECIR (2), abs/2412.12464, 263, 278, Springer, 2025
International conference proceedings

Enhancing Classification Models With Sophisticated Counterfactual Images
Xiang Li; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama
IEEE Open Journal of Signal Processing, 2025
Scientific journal

Lung Disease Classification with Limited Training Data Based on Weight Selection Technique
Ayaka Tsutsumi; Guang Li; Ren Togo; Takahiro Ogawa; Satoshi Kondo; Miki Haseyama
2024 IEEE 13th Global Conference on Consumer Electronics (GCCE), 460, 461, IEEE, 29 Oct. 2024
International conference proceedings

Generative Dataset Distillation Based on Large Model Pool
Longzhen Li; Guang Li; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama
2024 IEEE 13th Global Conference on Consumer Electronics (GCCE), 458, 459, IEEE, 29 Oct. 2024
International conference proceedings

Lung Cancer Classification Using Masked Autoencoder Pretrained on J-MID Database
Ren Tasai; Guang Li; Ren Togo; Minghui Tang; Takaaki Yoshimura; Hiroyuki Sugimori; Kenji Hirata; Takahiro Ogawa; Kohsuke Kudo; Miki Haseyama
2024 IEEE 13th Global Conference on Consumer Electronics (GCCE), 456, 457, IEEE, 29 Oct. 2024
International conference proceedings

Multimodal Transformer Model Using Time-Series Data to Classify Winter Road Surface Conditions.
Yuya Moroto; Keisuke Maeda; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
Sensors, 24, 11, 3440, 3440, Jun. 2024
Scientific journal

Algal Bed Region Segmentation Based on a ViT Adapter Using Aerial Images for Estimating CO₂ Absorption Capacity
Guang Li; Ren Togo; Keisuke Maeda; Akinori Sako; Isao Yamauchi; Tetsuya Hayakawa; Shigeyuki Nakamae; Takahiro Ogawa; Miki Haseyama
Remote Sensing, 16, 10, 1742, 1742, 14 May 2024
Scientific journal

Analysis of Continual Learning Techniques for Image Generative Models with Learned Class Information Management
Taro Togo; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama
Sensors, 24, 10, 3087, 3087, 13 May 2024
Scientific journal

A Novel Frame-Selection Metric for Video Inpainting to Enhance Urban Feature Extraction
Yuhu Feng; Jiahuan Zhang; Guang Li; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama
Sensors, 24, 10, 3035, 3035, 10 May 2024
Scientific journal

Parameter-efficient tuning of cross-modal retrieval for a specific database via trainable textual and visual prompts.
Huaying Zhang; Rintaro Yanagi; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
Int. J. Multim. Inf. Retr., 13, 1, 14, 14, Mar. 2024
Scientific journal

Flexibly manipulating popularity bias for tackling trade-offs in recommendation
Hiroki Okamura; Keisuke Maeda; Ren Togo; Takahiro Ogawa; Miki Haseyama
Information Processing & Management, 61, 2, 103606, 103606, Mar. 2024
Scientific journal

Text-Guided Image Editing Based on Post Score for Gaining Attention on Social Media
Yuto Watanabe; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama
Sensors, 24, 3, 921, 921, 31 Jan. 2024
Scientific journal

LLM is Knowledge Graph Reasoner: LLM's Intuition-aware Knowledge Graph Reasoning for Cold-start Sequential Recommendation.
Keigo Sakurai; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2412.12464, 2024
Scientific journal

Which Client is Reliable?: A Reliable and Personalized Prompt-based Federated Learning for Medical Image Question Answering.
He Zhu; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2410.17484, 2024
Scientific journal

Think Twice Before Recognizing: Large Multimodal Models for General Fine-grained Traffic Sign Recognition.
Yaozong Gan; Guang Li 0008; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2409.01534, 2024
Scientific journal

MMT-BERT: Chord-aware Symbolic Music Generation Based on Multitrack Music Transformer and MusicBERT.
Jinlong Zhu; Keigo Sakurai; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2409.00919, 2024
Scientific journal

Generative Dataset Distillation Based on Diffusion Model.
Duo Su; Junjie Hou; Guang Li 0008; Ren Togo; Rui Song 0007; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2408.08610, 2024
Scientific journal

Generalizing Human Motion Style Transfer Method Based on Metadata-independent Learning.
Yuki Era; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
SIGGRAPH Asia Posters, 78, 3, 2024
International conference proceedings

An Evaluation Metric for Single Image-to-3D Models Based on Object Detection Perspective.
Yuiko Uchida; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
SIGGRAPH Asia 2024 Technical Communications, 31, 4, ACM, 2024
International conference proceedings

DQG: Database Question Generation for Exact Text-based Image Retrieval.
Rintaro Yanagi; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
Proceedings of the 32nd ACM International Conference on Multimedia, 7424, 7433, ACM, 2024
International conference proceedings

MMT-BERT: Chord-Aware Symbolic Music Generation Based on Multitrack Music Transformer and MusicBERT.
Jinlong Zhu; Keigo Sakurai; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ISMIR, 470, 477, 2024
International conference proceedings

Cross-Domain Few-Shot In-Context Learning For Enhancing Traffic Sign Recognition.
Yaozong Gan; Guang Li 0008; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
ICIP, 2564, 2570, 2024
International conference proceedings

Zero-Shot Composed Image Retrieval Considering Query-Target Relationship Leveraging Masked Image-Text Pairs.
Huaying Zhang; Rintaro Yanagi; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ICIP, 2431, 2437, 2024
International conference proceedings

Reinforcing Pre-Trained Models Using Counterfactual Images.
Xiang Li; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
ICIP, 486, 492, 2024
International conference proceedings

Introducing Class Replacement Technique in Class Incremental Learning in Generative Models.
Taro Togo; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
International Conference on Consumer Electronics - Taiwan(ICCE-Taiwan), 457, 458, IEEE, 2024
International conference proceedings

Discriminator-enhanced Music Generation Based on Multitrack Music Transformer.
Jinlong Zhu; Keigo Sakurai; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
International Conference on Consumer Electronics - Taiwan(ICCE-Taiwan), 363, 364, IEEE, 2024
International conference proceedings

Motion-STUDiO : Motion Style Transfer Utilized for Dancing Operation by Considering Both Style and Dance Features.
Yuki Era; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
International Conference on Consumer Electronics - Taiwan(ICCE-Taiwan), 127, 128, IEEE, 2024
International conference proceedings

Multimodal Adversarial Defense Trained on Features Extracted from Images and Brain Activity.
Tasuku Nakajima; Keisuke Maeda; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 1183, 1184, 2024
International conference proceedings

Generative Dataset Distillation Based on Diffusion Model.
Duo Su; Junjie Hou; Guang Li 0008; Ren Togo; Rui Song 0007; Takahiro Ogawa 0001; Miki Haseyama
ECCV Workshops (19), 83, 94, 2024
International conference proceedings

MLLM-based Automatic Exploration of Editing Prompt for High Engagement Image Generation.
Kenta Kubota; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 1165, 1166, 2024
International conference proceedings

An Evaluation Metric for Single Image-to-3D Models Based on a Class Confidence Score of Object Detection Models.
Yuiko Uchida; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 2024, 1163, 1164, 2024

Improving Zero-shot Adversarial Robustness via Integrating Image Features of Foundation Models.
Koshiro Toishi; Keisuke Maeda; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 148, 149, 2024
International conference proceedings

Emotion-conditional Image Generation Reflecting Semantic Alignment with Text-to-Image Models.
Kaede Hayakawa; Keisuke Maeda; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 10, 11, 2024
International conference proceedings

Structured Polyphonic Music Generation with Diffusion Transformer.
Jinlong Zhu; Keigo Sakurai; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 8, 9, 2024
International conference proceedings

AUTOMATIC RECOGNITION OF ALGAL BED AREAS BASED ON A LARGE-SCALE SEMANTIC SEGMENTATION MODEL FOR ESTIMATING CO₂ ABSORPTION BY BLUE CARBON
LI Guang; TOGO Ren; MAEDA Keisuke; SAKO Akinori; YAMAUCHI Isao; HAYAKAWA Tetsuya; NAKAMAE Shigeyuki; OGAWA Takahiro; HASEYAMA Miki
Japanese Journal of JSCE, 80, 17, n/a, Japan Society of Civil Engineers, 2024
Japanese, Measuring the CO₂ absorption of algal beds is one of the key issues for achieving carbon neutrality, but identifying the area of algal beds from UAV images requires a great deal of labor and experience. In this study, we propose a method for automatic recognition of algal beds using UAV images. The proposed method uses a model that enables semantic domain segmentation at the pixel level, and employs ViT-Adapter, one of the latest models. The advantage of this technique is that it effectively utilizes the knowledge of a trained large-scale model to recognize algal beds, and it can identify algal beds at the pixel level by adjusting the parameters of the model. In this study, we conducted learning using mask images of visually identified algal beds from aerial photographs, and further examined data expansion and other processing to adapt the learning to UAV images. The effectiveness of this method was verified through a demonstration using UAV images of the Erimo coast of Hokkaido.

Generative Dataset Distillation: Balancing Global Structure and Local Details.
Longzhen Li; Guang Li 0008; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
CVPR Workshops, 7664, 7671, IEEE, 2024
International conference proceedings

Prompt-based Personalized Federated Learning for Medical Visual Question Answering.
He Zhu; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2402.09677, 2024
Scientific journal

Importance-Aware Adaptive Dataset Distillation.
Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2401.15863, 2024
Scientific journal

Multi-Object Editing in Personalized Text-To-Image Diffusion Model Via Segmentation Guidance.
Haruka Matsuda; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
ICASSP, 8140, 8144, IEEE, 2024
International conference proceedings

Caption Unification for Multi-View Lifelogging Images Based on In-Context Learning with Heterogeneous Semantic Contents.
Masaya Sato; Keisuke Maeda; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ICASSP, 8085, 8089, IEEE, 2024
International conference proceedings

Enhancing Noisy Label Learning Via Unsupervised Contrastive Loss with Label Correction Based on Prior Knowledge.
Masaki Kashiwagi; Keisuke Maeda; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ICASSP, 6235, 6239, IEEE, 2024
International conference proceedings

Dataset Distillation Using Parameter Pruning.
Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 107, 6, 936, 940, 2024
Scientific journal

Accident risk estimation with depth information in construction site videos
GOKA Ryota; MAEDA Keisuke; TOGO Ren; OGAWA Takahiro; HASEYAMA Miki
Proceedings of the Annual Conference of JSAI, JSAI2024, 2C6GS701, 2C6GS701, The Japanese Society for Artificial Intelligence, 2024
Japanese, In the construction industry, reducing accident risk and improving safety is one of the high-priority tasks. Recently, several methods have been proposed to estimate the contact accident risk with heavy machinery on construction sites for enhancing safety. Conventional studies based on deep learning estimate the risk by using relations within the image space of detected workers and machinery captured in construction site videos. However, these approaches focus on the distance between detected objects in the image, leading to the problem that accident risk is overestimated even when there is distance between objects in the real world. In this study, to consider 3D spatial information in videos, we propose a method for estimating the accident risk with visual features regarding depth information. Experimental results show that the proposed method performs better than existing methods in estimating the contact accident risk.

Estimation of contact accident risk based on recurrent neural network introducing spatial-temporal attention
五箇亮太; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀
AI・データサイエンス論文集(Web), 5, 1, 2024

Zero-shot high-risk situation detection based on object detection and pose estimation using fixed camera at construction site
大羽賀駿也; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀
AI・データサイエンス論文集(Web), 5, 1, 2024

Prediction of event locations from urgent call using large language models
吉田将規; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀
AI・データサイエンス論文集(Web), 5, 1, 2024

Zero-Shot Traffic Sign Recognition Based on Midlevel Feature Matching
GAN YAOZONG; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama
Sensors, 04 Dec. 2023
Scientific journal

Manipulation Direction: Evaluating Text-Guided Image Manipulation Based on Similarity between Changes in Image and Text Modalities
Yuto Watanabe; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama
Sensors, 20 Nov. 2023
Scientific journal

Manipulation Direction: Evaluating Text-Guided Image Manipulation Based on Similarity between Changes in Image and Text Modalities
Yuto Watanabe; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama
Sensors, Nov. 2023
Scientific journal

Self-supervised learning for gastritis detection with gastric X-ray images.
Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
International Journal of Computer Assisted Radiology and Surgery, 18, 10, 1841, 1848, Oct. 2023
Scientific journal

Off-Screen Sound Separation Based on Audio-visual Pre-training Using Binaural Audio
Masaki Yoshida; Ren Togo; Takahiro Ogawa; Miki Haseyama
SENSORS, 23, 9, May 2023
English, Scientific journal

Multimodal Natural Language Explanation Generation for Visual Question Answering Based on Multiple Reference Data
He Zhu; Ren Togo; Takahiro Ogawa; Miki Haseyama
ELECTRONICS, 12, 10, May 2023
English, Scientific journal

Boosting automatic COVID-19 detection performance with self-supervised learning and batch knowledge ensembling.
Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
Comput. Biol. Medicine, 158, 106877, 106877, May 2023
Scientific journal

COVID-19 detection based on self-supervised transfer learning using chest X-ray images.
Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
Int. J. Comput. Assist. Radiol. Surg., 18, 4, 715, 722, Apr. 2023
Scientific journal

Diversity Learning Based on Multi-Latent Space for Medical Image Visual Question Generation.
He Zhu; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
Sensors, 23, 3, 1057, 1057, Feb. 2023
English, Scientific journal

Video-Music Retrieval with Fine-Grained Cross-Modal Alignment.
Yuki Era; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
ICIP, 2005, 2009, 2023
International conference proceedings

Text-Guided Facial Image Manipulation for Wild Images via Manipulation Direction-Based Loss.
Yuto Watanabe; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
ICIP, 361, 365, 2023
International conference proceedings

Text-to-image Diffusion Model Suppressing Catastrophic Forgetting via Elastic Weight Consolidation.
Haruka Matsuda; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 831, 832, 2023
International conference proceedings

Deterioration Level Estimation for Infrastructures Considering Noisy Labels via DivideMix.
Masaki Kashiwagi; Keisuke Maeda; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 829, 830, 2023
International conference proceedings

Novel Feature Extraction for Classification of Auditory-visual Stimuli from fNIRS Signals.
Taro Togo; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 759, 760, 2023
International conference proceedings

A Controllable Recoloring Method for Novel Views Using Segment Anything Model.
Haoyang Wang; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 587, 588, 2023
International conference proceedings

Caption Unification for Multiple Viewpoint Lifelogging Images and Its Verification.
Masaya Sato; Keisuke Maeda; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 415, 416, 2023
International conference proceedings

Improving Visual Counterfactual Explanation Models for Image Classification via CLIP.
Xiang Li; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 390, 391, 2023
International conference proceedings

Acquisition of feature representation of record data via graph neural network to support determination of deterioration levels
山本一輝; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀
AI・データサイエンス論文集(Web), 4, 3, 2023

Distress detection using egocentric videos for increasing discovery rate of novel distress during subway tunnel inspection
櫻井慶悟; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀
AI・データサイエンス論文集(Web), 4, 3, 2023

Classification of Winter Road Surface Condition Based on Multi-modal Transformer Using Sequential Data
諸戸祐哉; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀
AI・データサイエンス論文集(Web), 4, 3, 2023

TolerantGAN: Text-guided Image Manipulation Tolerant to Real-world Image
Yuto Watanabe; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama
IEEE Open Journal of Signal Processing, 2023
Scientific journal

Source-Data-Free Cross-Domain Knowledge Transfer for Semantic Segmentation
Zongyao Li; Ren Togo; Takahiro Ogawa; Miki Haseyama
IEEE Open Journal of Signal Processing, 2023
Scientific journal

Personalized Content Recommender System via Non-verbal Interaction Using Face Mesh and Facial Expression.
Yuya Moroto; Rintaro Yanagi; Naoki Ogawa; Kyohei Kamikawa; Keigo Sakurai; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
ACM Multimedia, 9399, 9401, 2023
International conference proceedings

Gromov-Wasserstein Autoencoders.
Nao Nakagawa; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ICLR, 2023
International conference proceedings

Parameter-efficient Tuning of a Pre-trained Model via Prompt Learning in Cross-modal Retrieval.
Huaying Zhang; Rintaro Yanagi; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ICCE-Taiwan, 811, 812, 2023
International conference proceedings

Binaural Audio Generation with Data Augmentation from 360° Videos.
Masaki Yoshida; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ICCE-Taiwan, 795, 796, 2023
International conference proceedings

Estimation of Amyloid-β Positivity Using QSM Images Considering Age Information.
Tsubasa Kunieda; Ren Togo; Noriko Nishioka; Yukie Shimizu; Shiro Watanabe; Kenji Hirata; Keisuke Maeda; Takahiro Ogawa 0001; Kohsuke Kudo; Miki Haseyama
ICCE-Taiwan, 165, 166, 2023
International conference proceedings

A Medical Domain Visual Question Generation Model via Large Language Model.
He Zhu; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ICCE-Taiwan, 163, 164, 2023
International conference proceedings

Binauralization Robust To Camera Rotation Using 360° Videos.
Masaki Yoshida; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ICASSP, 1, 5, 2023
International conference proceedings

Estimation of Visual Contents from Human Brain Signals via VQA Based on Brain-Specific Attention.
Ryo Shichida; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
ICASSP, 1, 5, 2023
International conference proceedings

Improving Dropout in Graph Convolutional Networks for Recommendation via Contrastive Loss.
Hiroki Okamura; Keisuke Maeda; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ICASSP, 1, 5, 2023
International conference proceedings

Material Compound-Property Retrieval Using Electron Microscope Images for Rubber Material Development.
Rintaro Yanagi; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
IEEE Access, 11, 88258, 88264, 2023
Scientific journal

Recallable Question Answering-Based Re-Ranking Considering Semantic Region for Cross-Modal Retrieval
Rintaro Yanagi; Ren Togo; Takahiro Ogawa; Miki Haseyama
IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 4, 1, 11, 2023
English, Scientific journal

Interpretable Visual Question Answering Referring to Outside Knowledge.
He Zhu; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2303.04388, 2023
We present a novel multimodal interpretable VQA model that can answer the
question more accurately and generate diverse explanations. Although
researchers have proposed several methods that can generate human-readable and
fine-grained natural language sentences to explain a model's decision, these
methods have focused solely on the information in the image. Ideally, the model
should refer to various information inside and outside the image to correctly
generate explanations, just as we use background knowledge daily. The proposed
method incorporates information from outside knowledge and multiple image
captions to increase the diversity of information available to the model. The
contribution of this paper is to construct an interpretable visual question
answering model using multimodal inputs to improve the rationality of generated
results. Experimental results show that our model can outperform
state-of-the-art methods regarding answer accuracy and explanation rationality.

Text-Guided Image Manipulation via Generative Adversarial Network With Referring Image Segmentation-Based Guidance.
Yuto Watanabe; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
IEEE Access, 11, 42534, 42545, 2023
English, Scientific journal

Cross-Modal Image Retrieval Considering Semantic Relationships With Many-to-Many Correspondence Loss.
Huaying Zhang; Rintaro Yanagi; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
IEEE Access, 11, 10675, 10686, 2023
English, Scientific journal

Similar interior coordination image retrieval with multi-view features
Ren Togo; Yuki Honma; Maiku Abe; Takahiro Ogawa; Miki Haseyama
International Journal of Multimedia Information Retrieval, 11, 4, 731, 740, 26 Aug. 2022
English, Scientific journal

Defect Detection of Subway Tunnels Using Advanced U-Net Network
An Wang; Ren Togo; Takahiro Ogawa; Miki Haseyama
SENSORS, 22, 6, 2330, 2330, Mar. 2022
English, Scientific journal

Boosting Automatic COVID-19 Detection Performance with Self-Supervised Learning and Batch Knowledge Ensembling.
Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2212.09281, 2022
Problem: Detecting COVID-19 from chest X-Ray (CXR) images has become one of
the fastest and easiest methods for detecting COVID-19. However, the existing
methods usually use supervised transfer learning from natural images as a
pretraining process. These methods do not consider the unique features of
COVID-19 and the similar features between COVID-19 and other pneumonia. Aim: In
this paper, we want to design a novel high-accuracy COVID-19 detection method
that uses CXR images, which can consider the unique features of COVID-19 and
the similar features between COVID-19 and other pneumonia. Methods: Our method
consists of two phases. One is self-supervised learning-based pertaining; the
other is batch knowledge ensembling-based fine-tuning. Self-supervised
learning-based pretraining can learn distinguished representations from CXR
images without manually annotated labels. On the other hand, batch knowledge
ensembling-based fine-tuning can utilize category knowledge of images in a
batch according to their visual feature similarities to improve detection
performance. Unlike our previous implementation, we introduce batch knowledge
ensembling into the fine-tuning phase, reducing the memory used in
self-supervised learning and improving COVID-19 detection accuracy. Results: On
two public COVID-19 CXR datasets, namely, a large dataset and an unbalanced
dataset, our method exhibited promising COVID-19 detection performance. Our
method maintains high detection accuracy even when annotated CXR training
images are reduced significantly (e.g., using only 10% of the original
dataset). In addition, our method is insensitive to changes in hyperparameters.

COVID-19 Detection Based on Self-Supervised Transfer Learning Using Chest X-Ray Images.
Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2212.09276, 2022
Purpose: Considering several patients screened due to COVID-19 pandemic,
computer-aided detection has strong potential in assisting clinical workflow
efficiency and reducing the incidence of infections among radiologists and
healthcare providers. Since many confirmed COVID-19 cases present radiological
findings of pneumonia, radiologic examinations can be useful for fast
detection. Therefore, chest radiography can be used to fast screen COVID-19
during the patient triage, thereby determining the priority of patient's care
to help saturated medical facilities in a pandemic situation. Methods: In this
paper, we propose a new learning scheme called self-supervised transfer
learning for detecting COVID-19 from chest X-ray (CXR) images. We compared six
self-supervised learning (SSL) methods (Cross, BYOL, SimSiam, SimCLR,
PIRL-jigsaw, and PIRL-rotation) with the proposed method. Additionally, we
compared six pretrained DCNNs (ResNet18, ResNet50, ResNet101, CheXNet,
DenseNet201, and InceptionV3) with the proposed method. We provide quantitative
evaluation on the largest open COVID-19 CXR dataset and qualitative results for
visual inspection. Results: Our method achieved a harmonic mean (HM) score of
0.985, AUC of 0.999, and four-class accuracy of 0.953. We also used the
visualization technique Grad-CAM++ to generate visual explanations of different
classes of CXR images with the proposed method to increase the
interpretability. Conclusions: Our method shows that the knowledge learned from
natural images using transfer learning is beneficial for SSL of the CXR images
and boosts the performance of representation learning for COVID-19 detection.
Our method promises to reduce the incidence of infections among radiologists
and healthcare providers.

Union-set Multi-source Model Adaptation for Semantic Segmentation.
Zongyao Li; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2212.02785, 2022
Scientific journal

RGMIM: Region-Guided Masked Image Modeling for COVID-19 Detection.
Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2211.00313, 2022
Scientific journal

Rubber Material Retrieval System using Electron Microscope Images for Rubber Material Development.
Rintaro Yanagi; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
MMAsia, 44, 3, ACM, 2022
International conference proceedings

Disentangled Image Attribute Editing in Latent Space via Mask-Based Retention Loss.
Shunya Ohaga; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
MMAsia, 25, 7, ACM, 2022
International conference proceedings

Assessment of Image Manipulation Using Natural Language Description: Quantification of Manipulation Direction.
Yuto Watanabe; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
ICIP, 1046, 1050, IEEE, 2022
International conference proceedings

Improving Model Adaptation for Semantic Segmentation by Learning Model-Invariant Features with Multiple Source-Domain Models.
Zongyao Li; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ICIP, 421, 425, IEEE, 2022
International conference proceedings

A Multimodal Interpretable Visual Question Answering Model Introducing Image Caption Processor.
He Zhu; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 777, 778, IEEE, 2022
International conference proceedings

Cross-modal Image Retrieval Considering Semantic Relationships with Object Information.
Huaying Zhang; Rintaro Yanagi; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 775, 776, IEEE, 2022
International conference proceedings

Free-viewpoint Sports Video Generation Based on Dynamic NeRF Considering Time Series.
Masato Kawai; Rintaro Yanagi; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 408, 409, IEEE, 2022
International conference proceedings

Content-based Image Retrieval Using Effective Synthesized Images from Different Camera Views via pixelNeRF.
Yuki Era; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 404, 405, IEEE, 2022
International conference proceedings

Analysis of Relationships between Visual Cognitive Contents and Response of Each Brain Region via Visual Question Answering.
Ryo Shichida; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 402, 403, IEEE, 2022
International conference proceedings

GCN-based Collaborative Filtering Considering Personality Bias.
Hiroki Okamura; Keisuke Maeda; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 278, 279, IEEE, 2022
International conference proceedings

Prediction of Amyloid-β Positivity Using QSM Images Based on Bootstrap Your Own Latent.
Tsubasa Kunieda; Ren Togo; Noriko Nishioka; Yukie Shimizu; Shiro Watanabe; Kenji Hirata; Keisuke Maeda; Takahiro Ogawa 0001; Kohsuke Kudo; Miki Haseyama
GCCE, 137, 138, IEEE, 2022
International conference proceedings

Cross-platform Recommendation Considering Common Users' Preferences Based on Preference Propagation GraphNet.
Kazuki Yamamoto; Keisuke Maeda; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 135, 136, IEEE, 2022
International conference proceedings

Union-Set Multi-source Model Adaptation for Semantic Segmentation.
Zongyao Li; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ECCV (29), 579, 595, Springer, 2022
This paper solves a generalized version of the problem of multi-source model
adaptation for semantic segmentation. Model adaptation is proposed as a new
domain adaptation problem which requires access to a pre-trained model instead
of data for the source domain. A general multi-source setting of model
adaptation assumes strictly that each source domain shares a common label space
with the target domain. As a relaxation, we allow the label space of each
source domain to be a subset of that of the target domain and require the union
of the source-domain label spaces to be equal to the target-domain label space.
For the new setting named union-set multi-source model adaptation, we propose a
method with a novel learning strategy named model-invariant feature learning,
which takes full advantage of the diverse characteristics of the source-domain
models, thereby improving the generalization in the target domain. We conduct
extensive experiments in various adaptation settings to show the superiority of
our method. The code is available at
https://github.com/lzy7976/union-set-model-adaptation.

Trial Analysis of the Relationship between Taste and Biological Information Obtained While Eating Strawberries for Sensory Evaluation.
Keisuke Maeda; Ren Togo; Takahiro Ogawa 0001; Shin-ichi Adachi; Fumiaki Yoshizawa; Miki Haseyama
Sensors, 22, 23, 9496, 9496, 2022
Scientific journal

Distress Detection in Subway Tunnel Images via Data Augmentation Based on Selective Image Cropping and Patching.
Keisuke Maeda; Saya Takada; Tomoki Haruyama; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
Sensors, 22, 22, 8932, 8932, 2022
Scientific journal

Compressed gastric image generation based on soft-label dataset distillation for medical data sharing.
Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
Comput. Methods Programs Biomed., 227, 107189, 107189, 2022
Background and objective: Sharing of medical data is required to enable the
cross-agency flow of healthcare information and construct high-accuracy
computer-aided diagnosis systems. However, the large sizes of medical datasets,
the massive amount of memory of saved deep convolutional neural network (DCNN)
models, and patients' privacy protection are problems that can lead to
inefficient medical data sharing. Therefore, this study proposes a novel
soft-label dataset distillation method for medical data sharing.
Methods: The proposed method distills valid information of medical image data
and generates several compressed images with different data distributions for
anonymous medical data sharing. Furthermore, our method can extract essential
weights of DCNN models to reduce the memory required to save trained models for
efficient medical data sharing.
Results: The proposed method can compress tens of thousands of images into
several soft-label images and reduce the size of a trained model to a few
hundredths of its original size. The compressed images obtained after
distillation have been visually anonymized; therefore, they do not contain the
private information of the patients. Furthermore, we can realize high-detection
performance with a small number of compressed images.
Conclusions: The experimental results show that the proposed method can
improve the efficiency and security of medical data sharing.

Dataset Complexity Assessment Based on Cumulative Maximum Scaled Area Under Laplacian Spectrum.
Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2209.14743, 2022
Scientific journal

Dataset Distillation using Parameter Pruning.
Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2209.14609, 2022
Scientific journal, The acquisition of advanced models relies on large datasets in many fields,
which makes storing datasets and training models expensive. As a solution,
dataset distillation can synthesize a small dataset such that models trained on
it achieve high performance on par with the original large dataset. The
recently proposed dataset distillation method by matching network parameters
has been proved effective for several datasets. However, a few parameters in
the distillation process are difficult to match, which harms the distillation
performance. Based on this observation, this paper proposes a new method to
solve the problem using parameter pruning. The proposed method can synthesize
more robust distilled datasets and improve the distillation performance by
pruning difficult-to-match parameters in the distillation process. Experimental
results on three datasets show that the proposed method outperformed other SOTA
dataset distillation methods.

Dataset Distillation for Medical Dataset Sharing.
Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2209.14603, 2022
Scientific journal, Sharing medical datasets between hospitals is challenging because of the
privacy-protection problem and the massive cost of transmitting and storing
many high-resolution medical images. However, dataset distillation can
synthesize a small dataset such that models trained on it achieve comparable
performance with the original large dataset, which shows potential for solving
the existing medical sharing problems. Hence, this paper proposes a novel
dataset distillation-based method for medical dataset sharing. Experimental
results on a COVID-19 chest X-ray image dataset show that our method can
achieve high detection performance even using scarce anonymized chest X-ray
images.

Gromov-Wasserstein Autoencoders.
Nao Nakagawa; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2209.07007, 2022
Scientific journal, Learning concise data representations without supervisory signals is a
fundamental challenge in machine learning. A prominent approach to this goal is
likelihood-based models such as variational autoencoders (VAE) to learn latent
representations based on a meta-prior, which is a general premise assumed
beneficial for downstream tasks (e.g., disentanglement). However, such
approaches often deviate from the original likelihood architecture to apply the
introduced meta-prior, causing undesirable changes in their training. In this
paper, we propose a novel representation learning method, Gromov-Wasserstein
Autoencoders (GWAE), which directly matches the latent and data distributions.
Instead of a likelihood-based objective, GWAE models have a trainable prior
optimized by minimizing the Gromov-Wasserstein (GW) metric. The GW metric
measures the distance structure-oriented discrepancy between distributions
supported on incomparable spaces, e.g., with different dimensionalities. By
restricting the family of the trainable prior, we can introduce meta-priors to
control latent representations for downstream tasks. The empirical comparison
with the existing VAE-based methods shows that GWAE models can learn
representations based on meta-priors by changing the prior family without
further modifying the GW objective.

TriBYOL: Triplet BYOL for Self-Supervised Representation Learning.
Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2206.03012, 2022
Scientific journal

Self-Knowledge Distillation based Self-Supervised Learning for Covid-19 Detection from Chest X-Ray Images.
Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
CoRR, abs/2206.03009, 2022
Scientific journal

Generating Captions of Imagined content from Human Brain Activities Applying An Image Captioning Model.
Saya Takada; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
4th IEEE Global Conference on Life Sciences and Technologies(LifeTech), 614, 615, IEEE, 2022
International conference proceedings

Knowledge-Guided Sequential Recommendation with Reinforcement Learning Using Empirical Distribution Function.
Keigo Sakurai; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
4th IEEE Global Conference on Life Sciences and Technologies(LifeTech), 187, 188, IEEE, 2022
International conference proceedings

Transformer Based Multimodal Scene Recognition in Soccer Videos.
Yaozong Gan; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ICME Workshops, 1, 6, IEEE, 2022
International conference proceedings

Scene Retrieval in Soccer Videos by Spatial-temporal Attention with Video Vision Transformer.
Yaozong Gan; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ICCE-TW, 453, 454, IEEE, 2022
International conference proceedings

Multi-scale Defect Detection from Subway Tunnel Images with Spatial Attention Mechanism.
An Wang; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ICCE-TW, 305, 306, IEEE, 2022
International conference proceedings

Action Classification Based on LSTM Using First and Third Person Videos of Engineers Inspecting Bridges.
Tsuyoshi Masuda; Keisuke Maeda; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ICCE-TW, 46, 6(MMS2022 1-37/ME2022 26-62/AIT2022 1-37), 303, 304, IEEE, 2022

TriBYOL: Triplet BYOL for Self-Supervised Representation Learning.
Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ICASSP, 3458, 3462, IEEE, 2022
This paper proposes a novel self-supervised learning method for learning
better representations with small batch sizes. Many self-supervised learning
methods based on certain forms of the siamese network have emerged and received
significant attention. However, these methods need to use large batch sizes to
learn good representations and require heavy computational resources. We
present a new triplet network combined with a triple-view loss to improve the
performance of self-supervised representation learning with small batch sizes.
Experimental results show that our method can drastically outperform
state-of-the-art self-supervised learning methods on several datasets in
small-batch cases. Our method provides a feasible solution for self-supervised
learning with real-world high-resolution images that uses small batch sizes.

Divergence-Guided Feature Alignment for Cross-Domain Object Detection.
Zongyao Li; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ICASSP, 2240, 2244, IEEE, 2022
International conference proceedings

Self-Knowledge Distillation based Self-Supervised Learning for Covid-19 Detection from Chest X-Ray Images.
Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ICASSP, 1371, 1375, IEEE, 2022
The global outbreak of the Coronavirus 2019 (COVID-19) has overloaded
worldwide healthcare systems. Computer-aided diagnosis for COVID-19 fast
detection and patient triage is becoming critical. This paper proposes a novel
self-knowledge distillation based self-supervised learning method for COVID-19
detection from chest X-ray images. Our method can use self-knowledge of images
based on similarities of their visual features for self-supervised learning.
Experimental results show that our method achieved an HM score of 0.988, an AUC
of 0.999, and an accuracy of 0.957 on the largest open COVID-19 chest X-ray
dataset.

Controllable Music Playlist Generation Based on Knowledge Graph and Reinforcement Learning.
Keigo Sakurai; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
Sensors, 22, 10, 3722, 3722, 2022
Scientific journal

Learning intra-domain style-invariant representation for unsupervised domain adaptation of semantic segmentation.
Zongyao Li; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
Pattern Recognit., 132, 108911, 108911, 2022
Scientific journal

Dataset complexity assessment based on cumulative maximum scaled area under Laplacian spectrum.
Guang Li 0008; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
Multimedia Tools and Applications, 81, 22, 32287, 32303, 2022
Scientific journal, Dataset complexity assessment aims to predict classification performance on a
dataset with complexity calculation before training a classifier, which can
also be used for classifier selection and dataset reduction. The training
process of deep convolutional neural networks (DCNNs) is iterative and
time-consuming because of hyperparameter uncertainty and the domain shift
introduced by different datasets. Hence, it is meaningful to predict
classification performance by assessing the complexity of datasets effectively
before training DCNN models. This paper proposes a novel method called
cumulative maximum scaled Area Under Laplacian Spectrum (cmsAULS), which can
achieve state-of-the-art complexity assessment performance on six datasets.

Generative Adversarial Network Including Referring Image Segmentation For Text-Guided Image Manipulation.
Yuto Watanabe; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
ICASSP, 4818, 4822, 2022
English, International conference proceedings

Interactive Re-ranking via Object Entropy-Guided Question Answering for Cross-Modal Image Retrieval.
Rintaro Yanagi; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
ACM Trans. Multim. Comput. Commun. Appl., 18, 3, 68, 17, 2022
English, Scientific journal

Domain Adaptive Cross-Modal Image Retrieval via Modality and Domain Translations
Rintaro Yanagi; Ren Togo; Takahiro Ogawa; Miki Haseyama
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, E104A, 6, 866, 875, Jun. 2021
English, Scientific journal

Rubber Material Property Prediction Using Electron Microscope Images of Internal Structures Taken under Multiple Conditions
Ren Togo; Naoki Saito; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama
SENSORS, 21, 6, 2088, 2088, Mar. 2021
English, Scientific journal

Database-adaptive Re-ranking for Enhancing Cross-modal Image Retrieval.
Rintaro Yanagi; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
MM '21: ACM Multimedia Conference, 3816, 3825, ACM, 2021
International conference proceedings

Interpretable Representation Learning on Natural Image Datasets via Reconstruction in Visual-Semantic Embedding Space.
Nao Nakagawa; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
2021 IEEE International Conference on Image Processing(ICIP), 2473, 2477, IEEE, 2021
International conference proceedings

Segmentation-Aware Text-Guided Image Manipulation.
Tomoki Haruyama; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
ICIP, 2433, 2437, IEEE, 2021
International conference proceedings

User Background Information-Aware Music Recommendation with Reinforcement Learning-Based Knowledge Graph Exploration.
Keigo Sakurai; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
IEEE International Conference on Consumer Electronics-Taiwan(ICCE-TW), 1, 2, IEEE, 2021
International conference proceedings

Cross-view Self-supervised Learning via Momentum Statistics in Batch Normalization.
Guang Li; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
IEEE International Conference on Consumer Electronics-Taiwan(ICCE-TW), 1, 2, IEEE, 2021
International conference proceedings

Triplet Self-Supervised Learning for Gastritis Detection with Scarce Annotations.
Guang Li; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 787, 788, IEEE, 2021
International conference proceedings

Text-Guided Image Manipulation for Desired Region Using Referring Image Segmentation.
Yuto Watanabe; Ren Togo; Keisuke Maeda; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 661, 662, IEEE, 2021
International conference proceedings

Listener Recommendation for Artist Based on Knowledge Graph and Reinforcement Learning.
Keigo Sakurai; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 202, 203, IEEE, 2021
International conference proceedings

Detection of Off-screen Sound Based on Loss Function of Self-supervised Audio-visual Spatialization.
Masaki Yoshida; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 193, 194, IEEE, 2021
International conference proceedings

Multi-class Similar Scene Retrieval in Soccer Videos: A Scene Confusion Reduction Method Based on Combination of Long and Short Frame Sequences.
Yaozong Gan; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 117, 118, IEEE, 2021
International conference proceedings

Sports Action Detection Based on Self-Supervised Feature Learning and Object Detection.
Tsuyoshi Masuda; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 54, 55, IEEE, 2021
International conference proceedings

Estimating Imagined Images from Brain Activities via Visual Question Answering.
Saya Takada; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 35, 36, IEEE, 2021
International conference proceedings

Action Classification from Egocentric Videos Using Reinforcement Learning-based Pose Estimation.
Shunya Ohaga; Ren Togo; Takahiro Ogawa 0001; Miki Haseyama
GCCE, 9, 10, IEEE, 2021
International conference proceedings

Interest estimation method based on 2D pose features on security camera
Yuki Honma; Ren Togo; Maiku Abe; Takahiro Ogawa; Miki Haseyama
Proceedings of SPIE - The International Society for Optical Engineering, 11766, 2021
International conference proceedings

SEMANTIC-AWARE UNPAIRED IMAGE-TO-IMAGE TRANSLATION FOR URBAN SCENE IMAGES
Zongyao Li; Ren Togo; Takahiro Ogawa; Miki Haseyama
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2150, 2154, 2021
English, International conference proceedings

A Note on Detection of Sports Action Based on Temporal Cycle Consistency Learning
Tsuyoshi Masuda; Ren Togo; Takahiro Ogawa; Miki Haseyama
INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2021, 11766, 2021
English, International conference proceedings

Disentangled Representation Learning in Real-World Image Datasets via Image Segmentation Prior
Nao Nakagawa; Ren Togo; Takahiro Ogawa; Miki Haseyama
IEEE ACCESS, 9, 110880, 110888, 2021
English, Scientific journal

Interior Coordination Image Retrieval with Object-Detection-Based and Color Features
Ren Togo; Takahiro Ogawa; Miki Haseyama
INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2021, 11766, 2021
English, International conference proceedings

IR Questioner: QA-based Interactive Retrieval System
Rintaro Yanagi; Ren Togo; Takahiro Ogawa; Miki Haseyama
PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 611, 614, 2021
English, International conference proceedings

Text-Guided Style Transfer-Based Image Manipulation Using Multimodal Generative Models
Ren Togo; Megumi Kotera; Takahiro Ogawa; Miki Haseyama
IEEE ACCESS, 9, 64860, 64870, 2021
English, Scientific journal

Self-Supervised Learning for Gastritis Detection with Gastric X-Ray Images.
Guang Li; Ren Togo; Takahiro Ogawa; Miki Haseyama
CoRR, abs/2104.02864, 2021
Purpose: It is time-consuming and expensive for doctors to annotate gastric
X-ray images for gastritis detection manually. This paper proposes a
self-supervised learning method to solve this problem. This study aims to
verify the effectiveness of the proposed self-supervised learning method in
gastritis detection using a few annotated gastric X-ray images. Methods: In
this paper, we propose a novel method that can perform explicit self-supervised
learning and learn discriminative representations from gastric X-ray images.
Models trained with the proposed method were fine-tuned on datasets with a few
annotated gastric X-ray images. Five self-supervised learning methods, i.e.,
SimSiam, BYOL, PIRL-jigsaw, PIRL-rotation, and SimCLR, were compared with the
proposed method. Furthermore, three previous methods, one pretrained on
ImageNet, one trained from scratch, and one semi-supervised learning method,
were compared with the proposed method. Results: The proposed method$'$s
harmonic mean score of sensitivity and specificity after fine-tuning with the
annotated data of 10, 20, 30, and 40 patients were 0.875, 0.911, 0.915, and
0.931, respectively. The proposed method outperformed all comparative methods,
including the five self-supervised learning and three previous methods.
Experimental results showed the effectiveness of the proposed method in
gastritis detection with a few annotated gastric X-ray images. Conclusions: The
proposed self-supervised learning method shows potential clinical use in
gastritis detection using a few annotated gastric X-ray images.

Soft-Label Anonymous Gastric X-ray Image Distillation.
Guang Li; Ren Togo; Takahiro Ogawa; Miki Haseyama
CoRR, abs/2104.02857, 2021
Scientific journal

Music Playlist Generation Based on Graph Exploration Using Reinforcement Learning.
Keigo Sakurai; Ren Togo; Takahiro Ogawa; Miki Haseyama
3rd IEEE Global Conference on Life Sciences and Technologies(LifeTech), 2021, 53, 54, IEEE, 2021

Question Answering from Brain Activity Data via Decoder Based on Neural Networks.
Saya Takada; Ren Togo; Takahiro Ogawa; Miki Haseyama
3rd IEEE Global Conference on Life Sciences and Technologies(LifeTech), 51, 52, IEEE, 2021
International conference proceedings

Preliminary study of AI-assisted diagnosis using FDG-PET/CT for axillary lymph node metastasis in patients with breast cancer
Zongyao Li; Kazuhiro Kitajima; Kenji Hirata; Ren Togo; Junki Takenaka; Yasuo Miyoshi; Kohsuke Kudo; Takahiro Ogawa; Miki Haseyama
EJNMMI RESEARCH, 11, 1, Jan. 2021, [Peer-reviewed]
English, Scientific journal

Deep convolutional neural network-based anomaly detection for organ classification in gastric X-ray examination
Ren Togo; Haruna Watanabe; Takahiro Ogawa; Miki Haseyama
COMPUTERS IN BIOLOGY AND MEDICINE, 123, Aug. 2020, [Peer-reviewed]
English, Scientific journal

Chronic atrophic gastritis detection with a convolutional neural network considering stomach regions
Misaki Kanai; Ren Togo; Takahiro Ogawa; Miki Haseyama
WORLD JOURNAL OF GASTROENTEROLOGY, 26, 25, 3650, 3659, Jul. 2020, [Peer-reviewed]
English, Scientific journal

Chronic gastritis classification using gastric X-ray images with a semi-supervised learning method based on tri-training
Zongyao Li; Ren Togo; Takahiro Ogawa; Miki Haseyama
MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 58, 6, 1239, 1250, Jun. 2020, [Peer-reviewed]
English, Scientific journal

Interactive re-ranking for cross-modal retrieval based on object-wise question answering.
Rintaro Yanagi; Ren Togo; Takahiro Ogawa; Miki Haseyama
MMAsia 2020: ACM Multimedia Asia(MMAsia), 37, 7, ACM, 2020
International conference proceedings

Image Retrieval with Data Augmentation of Sentence Labels Based on Paraphrasing
Rintaro Yanagi; Ren Togo; Takahiro Ogawa; Miki Haseyama
2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020
English, International conference proceedings

Estimating Viewed Images with Natural Language Question Answering from fMRI Data.
Saya Takada; Ren Togo; Takahiro Ogawa; Miki Haseyama
2nd IEEE Global Conference on Life Sciences and Technologies(LifeTech), 99, 100, IEEE, 2020
International conference proceedings

Generation of Viewed Image Captions From Human Brain Activity Via Unsupervised Text Latent Space.
Saya Takada; Ren Togo; Takahiro Ogawa; Miki Haseyama
IEEE International Conference on Image Processing(ICIP), 2521, 2525, IEEE, 2020
International conference proceedings

MULTIMODAL IMAGE-TO-IMAGE TRANSLATION FOR GENERATION OF GASTRITIS IMAGES
Ren Togo; Takahiro Ogawa; Miki Haseyama
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2466, 2470, 2020
English, International conference proceedings

IMAGE RETRIEVAL WITH LINGUAL AND VISUAL PARAPHRASING VIA GENERATIVE MODELS
Rintaro Yanagi; Ren Togo; Takahiro Ogawa; Miki Haseyama
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2431, 2435, 2020
English, International conference proceedings

Variational Autoencoder Based Unsupervised Domain Adaptation For Semantic Segmentation.
Zongyao Li; Ren Togo; Takahiro Ogawa; Miki Haseyama
IEEE International Conference on Image Processing(ICIP), 2426, 2430, IEEE, 2020
International conference proceedings

Soft-Label Anonymous Gastric X-Ray Image Distillation.
Guang Li; Ren Togo; Takahiro Ogawa; Miki Haseyama
IEEE International Conference on Image Processing(ICIP), 305, 309, IEEE, 2020
This paper presents a soft-label anonymous gastric X-ray image distillation
method based on a gradient descent approach. The sharing of medical data is
demanded to construct high-accuracy computer-aided diagnosis (CAD) systems.
However, the large size of the medical dataset and privacy protection are
remaining problems in medical data sharing, which hindered the research of CAD
systems. The idea of our distillation method is to extract the valid
information of the medical dataset and generate a tiny distilled dataset that
has a different data distribution. Different from model distillation, our
method aims to find the optimal distilled images, distilled labels and the
optimized learning rate. Experimental results show that the proposed method can
not only effectively compress the medical dataset but also anonymize medical
images to protect the patient's private information. The proposed approach can
improve the efficiency and security of medical data sharing.

Estimation Of Visual Contents Based On Question Answering From Human Brain Activity.
Saya Takada; Ren Togo; Takahiro Ogawa; Miki Haseyama
IEEE International Conference on Image Processing(ICIP), 61, 65, IEEE, 2020
International conference proceedings

Unsupervised Domain Adaptation for Semantic Segmentation with Symmetric Adaptation Consistency.
Zongyao Li; Ren Togo; Takahiro Ogawa; Miki Haseyama
2020 IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP), 2263, 2267, IEEE, 2020
International conference proceedings

Music Playlist Generation Based on Reinforcement Learning Using Acoustic Feature Map.
Keigo Sakurai; Ren Togo; Takahiro Ogawa; Miki Haseyama
9th IEEE Global Conference on Consumer Electronics(GCCE), 942, 943, IEEE, 2020
International conference proceedings

Question Answering for Estimation of Seen Image Contents from Multi-subject fMRI Responses.
Saya Takada; Ren Togo; Takahiro Ogawa; Miki Haseyama
9th IEEE Global Conference on Consumer Electronics(GCCE), 712, 713, IEEE, 2020
International conference proceedings

Face Synthesis via User Manipulation of Disentangled Latent Representation.
Nao Nakagawa; Ren Togo; Takahiro Ogawa; Miki Haseyama
9th IEEE Global Conference on Consumer Electronics(GCCE), 692, 693, IEEE, 2020
International conference proceedings

Complexity Evaluation of Medical Image Data for Classification Problem Based on Spectral Clustering.
Guang Li; Ren Togo; Takahiro Ogawa; Miki Haseyama
9th IEEE Global Conference on Consumer Electronics(GCCE), 667, 669, IEEE, 2020
International conference proceedings

Enhancing Cross-Modal Retrieval Based on Modality-Specific and Embedding Spaces
Rintaro Yanagi; Ren Togo; Takahiro Ogawa; Miki Haseyama
IEEE ACCESS, 8, 96777, 96786, 2020, [Peer-reviewed]
English, Scientific journal

Aesthetic style transfer through text-to-image synthesis and image-to-image translation
Megumi Kotera; Ren Togo; Takahiro Ogawa; Miki Haseyama
IEEE Global Conference on Consumer Electronics (GCCE), 492, 493, IEEE, Oct. 2019, [Peer-reviewed]
English, International conference proceedings

Voice-input multimedia information retrieval system based on text-to-image GAN
Rintaro Yanagi; Ren Togo; Takahiro Ogawa; Miki Haseyama
IEEE Global Conference on Consumer Electronics (GCCE), 943, 944, IEEE, Oct. 2019, [Peer-reviewed]
English, International conference proceedings

Estimation of drilling energy from tunnel cutting face image based on online learning
Kentaro Yamamoto; Ren Togo; Takahiro Ogawa; Miki Haseyama
IEEE Global Conference on Consumer Electronics (GCCE), 77, 1, 794, 795, Oct. 2019, [Peer-reviewed]
English, International conference proceedings

Detection of distress region from subway tunnel images via U-net-based deep semantic segmentation
An Wang; Ren Togo; Takahiro Ogawa; Miki Haseyama
IEEE Global Conference on Consumer Electronics (GCCE), 766, 767, IEEE, Oct. 2019, [Peer-reviewed]
English, International conference proceedings

Detection of gastritis by a deep convolutional neural network from double-contrast upper gastrointestinal barium X-ray radiography
Ren Togo; Nobutake Yamamichi; Katsuhiro Mabe; Yu Takahashi; Chihiro Takeuchi; Mototsugu Kato; Naoya Sakamoto; Kenta Ishihara; Takahiro Ogawa; Miki Haseyama
JOURNAL OF GASTROENTEROLOGY, 54, 4, 321, 329, Apr. 2019, [Peer-reviewed]
English, Scientific journal

Extraction of regions related to cardiac sarcoidosis in polar map images
Ren Togo; Takahiro Ogawa; Osamu Manabe; Kenji Hirata; Tohru Shiga; Miki Haseyama
2019 IEEE 1st Global Conference on Life Sciences and Technologies, LifeTech 2019, 237, 238, Mar. 2019, [Peer-reviewed]
English, International conference proceedings

Bone metastatic tumor detection based on AnoGAN using CT images
Haruna Watanabe; Ren Togo; Takahiro Ogawa; Miki Haseyama
2019 IEEE 1st Global Conference on Life Sciences and Technologies, LifeTech 2019, 235, 236, Mar. 2019, [Peer-reviewed]
English, International conference proceedings

Fine-tuning of pre-trained DCNN for gastritis detection from gastric X-ray images
Misaki Kanai; Ren Togo; Takahiro Ogawa; Miki Haseyama
2019 IEEE 1st Global Conference on Life Sciences and Technologies, LifeTech 2019, 196, 197, Mar. 2019, [Peer-reviewed]
English, International conference proceedings

Classification of subcellular protein patterns in human cells with transfer learning
Zongyao Li; Ren Togo; Takahiro Ogawa; Miki Haseyama
2019 IEEE 1st Global Conference on Life Sciences and Technologies, LifeTech 2019, 273, 274, Mar. 2019, [Peer-reviewed]
International conference proceedings

Estimating Regions of Deterioration in Electron Microscope Images of Rubber Materials via a Transfer Learning-Based Anomaly Detection Model
Ren Togo; Naoki Saito; Takahiro Ogawa; Miki Haseyama
IEEE ACCESS, 7, 162395, 162404, 2019, [Peer-reviewed]
English, Scientific journal

Text-to-Image GAN-Based Scene Retrieval and Re-Ranking Considering Word Importance
Rintaro Yanagi; Ren Togo; Takahiro Ogawa; Miki Haseyama
IEEE ACCESS, 7, 169920, 169930, 2019, [Peer-reviewed]
English, Scientific journal

Query is GAN: Scene Retrieval With Attentional Text-to-Image Generative Adversarial Network
Rintaro Yanagi; Ren Togo; Takahiro Ogawa; Miki Haseyama
IEEE ACCESS, 7, 153183, 153193, 2019, [Peer-reviewed]
English, Scientific journal

Effectiveness Evaluation of Deep Features for Image Reconstruction from fMRI Signals.
Saya Takada; Ren Togo; Takahiro Ogawa; Miki Haseyama
IEEE Global Conference on Consumer Electronics (GCCE), 479, 480, IEEE, 2019, [Peer-reviewed]
English, International conference proceedings

Scene Retrieval Using Text-to-image GAN-based Visual Similarities and Image-to-text Model-based Textual Similarities.
Rintaro Yanagi; Ren Togo; Takahiro Ogawa; Miki Haseyama
IEEE Global Conference on Consumer Electronics (GCCE), 13, 14, IEEE, 2019, [Peer-reviewed]
English, International conference proceedings

SCENE RETRIEVAL FOR VIDEO SUMMARIZATION BASED ON TEXT-TO-IMAGE GAN
Rintaro Yanagi; Ren Togo; Takahiro Ogawa; Miki Haseyama
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 1825, 1829, 2019, [Peer-reviewed]
English, International conference proceedings

GASTRITIS DETECTION FROM GASTRIC X-RAY IMAGES VIA FINE-TUNING OF PATCH-BASED DEEP CONVOLUTIONAL NEURAL NETWORK
Misaki Kanai; Ren Togo; Takahiro Ogawa; Miki Haseyama
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 1371, 1375, 2019, [Peer-reviewed]
English, International conference proceedings

Scene Retrieval from Multiple Resolution Generated Images Based on Text-to-image GAN
Rintaro Yanagi; Ren Togo; Takahiro Ogawa; Miki Haseyama
2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 1, 5, 2019, [Peer-reviewed]
English, International conference proceedings

Semi-supervised Learning Based on Tri-training for Gastritis Classification Using Gastric X-ray Images
Zongyao Li; Ren Togo; Takahiro Ogawa; Miki Haseyama
2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 1, 5, 2019, [Peer-reviewed]
English, International conference proceedings

Synthetic Image Generation for Gastritis Detection Based on Auxiliary Classifier Generative Adversarial Network
Misaki Kanai; Ren Togo; Takahiro Ogawa; Miki Haseyama
2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 1, 5, 2019, [Peer-reviewed]
English, International conference proceedings

Synthetic Gastritis Image Generation via Loss Function-Based Conditional PGGAN
Ren Togo; Takahiro Ogawa; Miki Haseyama
IEEE ACCESS, 7, 87448, 87457, 2019, [Peer-reviewed]
English, Scientific journal

Cardiac sarcoidosis classification with deep convolutional neural network-based features using polar maps
Ren Togo; Kenji Hirata; Osamu Manabe; Hiroshi Ohira; Ichizo Tsujino; Keiichi Magota; Takahiro Ogawa; Miki Haseyama; Tohru Shiga
COMPUTERS IN BIOLOGY AND MEDICINE, 104, 81, 86, Jan. 2019, [Peer-reviewed], [International Magazine]
English, Scientific journal

Multi-classifier Decision: Integration of Multiple Brain Activity-based Classifications
Takahiro Ogawa; Kento Sugata; Ren Togo; Miki Haseyama
ITE TRANSACTIONS ON MEDIA TECHNOLOGY AND APPLICATIONS, 7, 1, 36, 44, 2019, [Peer-reviewed]
English, Scientific journal

Use of deep convolutional neural network-based features for detection of cardiac sarcoidosis from polar map
Ren Togo; Kenji Hirata; Osamu Manabe; Hiroshi Ohira; Ichizo Tsujino; Takahiro Ogawa; Miki Haseyama; Tohru Shiga
JOURNAL OF NUCLEAR MEDICINE, 59, May 2018, [Peer-reviewed]
English, International conference proceedings

Strategy to develop convolutional neural network-based classifier for diagnosis of whole-body FDG PET images
Keisuke Kawauchi; Kenji Hirata; Seiya Ichikawa; Osamu Manabe; Kentaro Kobayashi; Shiro Watanabe; Miki Haseyama; Takahiro Ogawa; Ren Togo; Tohru Shiga; Chietsugu Katoh
JOURNAL OF NUCLEAR MEDICINE, 59, May 2018, [Peer-reviewed]
English, International conference proceedings

Preliminary study of automatic gastric cancer risk classification from photofluorography
Ren Togo; Kenta Ishihara; Katsuhiro Mabe; Harufumi Oizumi; Takahiro Ogawa; Mototsugu Kato; Naoya Sakamoto; Shigemi Nakajima; Masahiro Asaka; Miki Haseyama
WORLD JOURNAL OF GASTROINTESTINAL ONCOLOGY, 10, 2, 62, 70, Feb. 2018, [Peer-reviewed]
English, Scientific journal

Download

ANONYMOUS IMAGE DATA GENERATION FROM GASTRIC X-RAY IMAGES FOR IMPROVING GASTRITIS RECOGNITION PERFORMANCE
Ren Togo; Kenta Ishihara; Takahiro Ogawa; Miki Haseyama
2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2018, [Peer-reviewed]
English, International conference proceedings

ANONYMOUS GASTRITIS IMAGE GENERATION VIA ADVERSARIAL LEARNING FROM GASTRIC X-RAY IMAGES
Ren Togo; Kenta Ishihara; Takahiro Ogawa; Miki Haseyama
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2082, 2086, 2018, [Peer-reviewed]
English, International conference proceedings

Image Retrieval from Vague Description Based on AttnGAN
Rintaro Yanagi; Ren Togo; Takahiro Ogawa; Miki Haseyama
2018 IEEE 7TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE 2018), 198, 199, 2018, [Peer-reviewed]
English, International conference proceedings

Effectiveness Evaluation of Imaging Direction for Estimation of Gastritis Regions on Gastric X-ray Images
Ren Togo; Kenta Ishihara; Takahiro Ogawa; Miki Haseyama
International Technical Conference on Circuits, Systems, Computers, and Communications (ITC-CSCC), 459, 460, May 2017, [Peer-reviewed]
English, International conference proceedings

Aesthetic Quality Assessment of Images via Supervised Locality Preserving CCA
Misaki Kanai; Ren Togo; Takahiro Ogawa; Miki Haseyama
2017 IEEE 6TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE), 2017-, 1, 2, 2017, [Peer-reviewed]
English, International conference proceedings

Estimation of salient regions related to chronic gastritis using gastric X-ray images
Ren Togo; Kenta Ishihara; Takahiro Ogawa; Miki Haseyama
COMPUTERS IN BIOLOGY AND MEDICINE, 77, 9, 15, Oct. 2016, [Peer-reviewed]
English, Scientific journal

■ Other Activities and Achievements

A note on motion recognition for tire inspection based on the cooperative use of object tracking models and video-LLMs
上川恭平; 藤後廉; 前田圭介; 小川貴弘; 長谷山美紀, 人工知能学会全国大会論文集(Web), 39th, 2025

A Note on Sensitivity Evaluation of Novel View Synthesis Metrics in 3D Scenes with Limited Conditions
WANG Haoyang; 藤後廉; 前田圭介; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告(Web), 49, 4(MMS2025 1-40/ME2025 1-40/AIT2025 1-40/SIP2025 1-40), 2025

Damage Classification Using Road Attachment Images Based on Vision Transformer and Vision Language Model
渡部航史; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告(Web), 49, 4(MMS2025 1-40/ME2025 1-40/AIT2025 1-40/SIP2025 1-40), 2025

Effectiveness Verification of Introducing Model Merging in Federated Learning-Investigation from Image Classification Tasks Targeting Multiple Domains-
久保田健太; 藤後廉; 前田圭介; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告(Web), 49, 4(MMS2025 1-40/ME2025 1-40/AIT2025 1-40/SIP2025 1-40), 2025

Event Location Prediction from Urgent Calls based on Fine-tuning of Speech Recognition Models for Geographic Name Recognition
吉田将規; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告(Web), 49, 4(MMS2025 1-40/ME2025 1-40/AIT2025 1-40/SIP2025 1-40), 2025

A Note on Supporting Interior Coordination Using Image Generation and Complementary Recommendation Techniques
櫻井慶悟; 岡村洋希; 藤後廉; 前田圭介; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告(Web), 49, 4(MMS2025 1-40/ME2025 1-40/AIT2025 1-40/SIP2025 1-40), 2025

A Note on Personalized Anomaly Detection Based on Vision Language Model Using Image Prompt
松田遥; 藤後廉; 前田圭介; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告(Web), 49, 4(MMS2025 1-40/ME2025 1-40/AIT2025 1-40/SIP2025 1-40), 2025

A Note on the Effectiveness of Brain Activity Information Against Adversarial Attacks-Utilization of Image Reconstruction Method from Brain Signals Using Generative Models-
中島佑; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告(Web), 49, 4(MMS2025 1-40/ME2025 1-40/AIT2025 1-40/SIP2025 1-40), 2025

A Note on Consideration of Spatial Integrity in Continuous 3D Scene Generation Method
江良勇輝; 藤後廉; 前田圭介; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告(Web), 49, 4(MMS2025 1-40/ME2025 1-40/AIT2025 1-40/SIP2025 1-40), 2025

A Note on Interpretability of Visual Language Model by Few-shot Learning Based on the Linear Representation Hypothesis
岡村洋希; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告(Web), 49, 4(MMS2025 1-40/ME2025 1-40/AIT2025 1-40/SIP2025 1-40), 2025

Advanced Finding Generation AI Based on In-context Learning for Inspection Report Creation
佐藤雅也; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告(Web), 49, 4(MMS2025 1-40/ME2025 1-40/AIT2025 1-40/SIP2025 1-40), 2025

A Note on Customer Interest Estimation Method Based on Multiple Transformer Models Using Real Store Video Data
山城輝久; 藤後廉; 本間勇紀; 吉田裕; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告(Web), 49, 4(MMS2025 1-40/ME2025 1-40/AIT2025 1-40/SIP2025 1-40), 2025

Automatic generation of findings using generative AI to support for inspection report creation -Introduction of in-context learning based on similar image retrieval through cluster analysis-
佐藤雅也; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, AI・データサイエンス論文集(Web), 6, 1, 2025

Damage level estimation of inspection images in road infrastructures using in-context learning with data augmentation
中島佑; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, AI・データサイエンス論文集(Web), 6, 1, 2025

Dynamic analysis of factory and environmental factors affecting properties of rubber materials based on causal inference
ZHANG Huaying; 藤後廉; 前田圭介; 小川貴弘; 長谷山美紀, 人工知能学会全国大会論文集(Web), 39th, 2025

RGMIM: Region-Guided Masked Image Modeling for Learning Meaningful Representations from X-Ray Images
Guang Li; Ren Togo; Takahiro Ogawa; Miki Haseyama, ECCV Workshops (14), 148, 157, 2025
Purpose: Self-supervised learning has been gaining attention in the medical
field for its potential to improve computer-aided diagnosis. One popular method
of self-supervised learning is masked image modeling (MIM), which involves
masking a subset of input pixels and predicting the masked pixels. However,
traditional MIM methods typically use a random masking strategy, which may not
be ideal for medical images that often have a small region of interest for
disease detection. To address this issue, this work aims to improve MIM for
medical images and evaluate its effectiveness in an open X-ray image dataset.
Methods: In this paper, we present a novel method called region-guided masked
image modeling (RGMIM) for learning meaningful representation from X-ray
images. Our method adopts a new masking strategy that utilizes organ mask
information to identify valid regions for learning more meaningful
representations. The proposed method was contrasted with five self-supervised
learning techniques (MAE, SKD, Cross, BYOL, and, SimSiam). We conduct
quantitative evaluations on an open lung X-ray image dataset as well as masking
ratio hyperparameter studies. Results: When using the entire training set,
RGMIM outperformed other comparable methods, achieving a 0.962 lung disease
detection accuracy. Specifically, RGMIM significantly improved performance in
small data volumes, such as 5% and 10% of the training set (846 and 1,693
images) compared to other methods, and achieved a 0.957 detection accuracy even
when only 50% of the training set was used. Conclusions: RGMIM can mask more
valid regions, facilitating the learning of discriminative representations and
the subsequent high-accuracy lung disease detection. RGMIM outperforms other
state-of-the-art self-supervised learning methods in experiments, particularly
when limited training data is used.

Cross-domain Few-shot In-context Learning for Enhancing Traffic Sign Recognition
Yaozong Gan; Guang Li; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama, CoRR, abs/2407.05814, 08 Jul. 2024
Recent multimodal large language models (MLLM) such as GPT-4o and GPT-4v have
shown great potential in autonomous driving. In this paper, we propose a
cross-domain few-shot in-context learning method based on the MLLM for
enhancing traffic sign recognition (TSR). We first construct a traffic sign
detection network based on Vision Transformer Adapter and an extraction module
to extract traffic signs from the original road images. To reduce the
dependence on training data and improve the performance stability of
cross-country TSR, we introduce a cross-domain few-shot in-context learning
method based on the MLLM. To enhance MLLM's fine-grained recognition ability of
traffic signs, the proposed method generates corresponding description texts
using template traffic signs. These description texts contain key information
about the shape, color, and composition of traffic signs, which can stimulate
the ability of MLLM to perceive fine-grained traffic sign categories. By using
the description texts, our method reduces the cross-domain differences between
template and real traffic signs. Our approach requires only simple and uniform
textual indications, without the need for large-scale traffic sign images and
labels. We perform comprehensive evaluations on the German traffic sign
recognition benchmark dataset, the Belgium traffic sign dataset, and two
real-world datasets taken from Japan. The experimental results show that our
method significantly enhances the TSR performance.

Zero-shot Composed Image Retrieval Considering Query-target Relationship Leveraging Masked Image-text Pairs
Huaying Zhang; Rintaro Yanagi; Ren Togo; Takahiro Ogawa; Miki Haseyama, CoRR, abs/2406.18836, 27 Jun. 2024
This paper proposes a novel zero-shot composed image retrieval (CIR) method
considering the query-target relationship by masked image-text pairs. The
objective of CIR is to retrieve the target image using a query image and a
query text. Existing methods use a textual inversion network to convert the
query image into a pseudo word to compose the image and text and use a
pre-trained visual-language model to realize the retrieval. However, they do
not consider the query-target relationship to train the textual inversion
network to acquire information for retrieval. In this paper, we propose a novel
zero-shot CIR method that is trained end-to-end using masked image-text pairs.
By exploiting the abundant image-text pairs that are convenient to obtain with
a masking strategy for learning the query-target relationship, it is expected
that accurate zero-shot CIR using a retrieval-focused textual inversion network
can be realized. Experimental results show the effectiveness of the proposed
method.

Reinforcing Pre-trained Models Using Counterfactual Images
Xiang Li; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama, CoRR, abs/2406.13316, 19 Jun. 2024
This paper proposes a novel framework to reinforce classification models
using language-guided generated counterfactual images. Deep learning
classification models are often trained using datasets that mirror real-world
scenarios. In this training process, because learning is based solely on
correlations with labels, there is a risk that models may learn spurious
relationships, such as an overreliance on features not central to the subject,
like background elements in images. However, due to the black-box nature of the
decision-making process in deep learning models, identifying and addressing
these vulnerabilities has been particularly challenging. We introduce a novel
framework for reinforcing the classification models, which consists of a
two-stage process. First, we identify model weaknesses by testing the model
using the counterfactual image dataset, which is generated by perturbed image
captions. Subsequently, we employ the counterfactual images as an augmented
dataset to fine-tune and reinforce the classification model. Through extensive
experiments on several classification models across various datasets, we
revealed that fine-tuning with a small set of counterfactual images effectively
strengthens the model.

Generative Dataset Distillation: Balancing Global Structure and Local Details
Longzhen Li; Guang Li; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama, CoRR, abs/2404.17732, 26 Apr. 2024
In this paper, we propose a new dataset distillation method that considers
balancing global structure and local details when distilling the information
from a large dataset into a generative model. Dataset distillation has been
proposed to reduce the size of the required dataset when training models. The
conventional dataset distillation methods face the problem of long redeployment
time and poor cross-architecture performance. Moreover, previous methods
focused too much on the high-level semantic attributes between the synthetic
dataset and the original dataset while ignoring the local features such as
texture and shape. Based on the above understanding, we propose a new method
for distilling the original image dataset into a generative model. Our method
involves using a conditional generative adversarial network to generate the
distilled dataset. Subsequently, we ensure balancing global structure and local
details in the distillation process, continuously optimizing the generator for
more information-dense dataset generation.

Enhancing Generative Class Incremental Learning Performance with Model Forgetting Approach
Taro Togo; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama, CoRR, abs/2403.18258, 27 Mar. 2024
This study presents a novel approach to Generative Class Incremental Learning
(GCIL) by introducing the forgetting mechanism, aimed at dynamically managing
class information for better adaptation to streaming data. GCIL is one of the
hot topics in the field of computer vision, and this is considered one of the
crucial tasks in society, specifically the continual learning of generative
models. The ability to forget is a crucial brain function that facilitates
continual learning by selectively discarding less relevant information for
humans. However, in the field of machine learning models, the concept of
intentionally forgetting has not been extensively investigated. In this study
we aim to bridge this gap by incorporating the forgetting mechanisms into GCIL,
thereby examining their impact on the models' ability to learn in continual
learning. Through our experiments, we have found that integrating the
forgetting mechanisms significantly enhances the models' performance in
acquiring new knowledge, underscoring the positive role that strategic
forgetting plays in the process of continual learning.

Prompt-based Personalized Federated Learning for Medical Visual Question Answering
He Zhu; Ren Togo; Takahiro Ogawa; Miki Haseyama, ICASSP, abs/2402.09677, 1821, 1825, 15 Feb. 2024
We present a novel prompt-based personalized federated learning (pFL) method
to address data heterogeneity and privacy concerns in traditional medical
visual question answering (VQA) methods. Specifically, we regard medical
datasets from different organs as clients and use pFL to train personalized
transformer-based VQA models for each client. To address the high computational
complexity of client-to-client communication in previous pFL methods, we
propose a succinct information sharing system by introducing prompts that are
small learnable parameters. In addition, the proposed method introduces a
reliability parameter to prevent the negative effects of low performance and
irrelevant clients. Finally, extensive evaluations on various heterogeneous
medical datasets attest to the effectiveness of our proposed method., IEEE

Importance-Aware Adaptive Dataset Distillation
Guang Li; Ren Togo; Takahiro Ogawa; Miki Haseyama, Neural Networks, 172, 106154, 106154, 29 Jan. 2024
Herein, we propose a novel dataset distillation method for constructing small
informative datasets that preserve the information of the large original
datasets. The development of deep learning models is enabled by the
availability of large-scale datasets. Despite unprecedented success,
large-scale datasets considerably increase the storage and transmission costs,
resulting in a cumbersome model training process. Moreover, using raw data for
training raises privacy and copyright concerns. To address these issues, a new
task named dataset distillation has been introduced, aiming to synthesize a
compact dataset that retains the essential information from the large original
dataset. State-of-the-art (SOTA) dataset distillation methods have been
proposed by matching gradients or network parameters obtained during training
on real and synthetic datasets. The contribution of different network
parameters to the distillation process varies, and uniformly treating them
leads to degraded distillation performance. Based on this observation, we
propose an importance-aware adaptive dataset distillation (IADD) method that
can improve distillation performance by automatically assigning importance
weights to different network parameters during distillation, thereby
synthesizing more robust distilled datasets. IADD demonstrates superior
performance over other SOTA dataset distillation methods based on parameter
matching on multiple benchmark datasets and outperforms them in terms of
cross-architecture generalization. In addition, the analysis of self-adaptive
weights demonstrates the effectiveness of IADD. Furthermore, the effectiveness
of IADD is validated in a real-world medical application such as COVID-19
detection.

A Note on Similar Case Retrieval via Deep Metric Learning Using Sensor Data Obtained from Semiconductor Manufacturing Equipment
斉藤直輝; 藤後廉; 前田圭介; 小林累輝; 中村隆央; 岡谷基弘; 数井誠人; 松沢貴仁; 小川貴弘; 長谷山美紀, 人工知能学会全国大会論文集(Web), 38th, 2024

Investigation on estimation of factory and environmental factors affecting properties of rubber materials
柳凜太郎; 藤後廉; 前田圭介; 小川貴弘; 長谷山美紀, 人工知能学会全国大会論文集(Web), 38th, 2024

A note on recommendation with explainability based on knowledge graph reasoning using graph masked autoencoders
櫻井慶悟; 藤後廉; 小川貴弘; 長谷山美紀, 人工知能学会全国大会論文集(Web), 38th, 2024

Prediction of Event Locations from Urgent Call Using Speech Recognition and Generative AI
吉田将規; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告(Web), 48, 6(MMS2024 1-30/ME2024 17-46/AIT2024 1-30), 2024

A Note on Domain Adaptation by Setting Features of Interest in Visual Language Models
岡村洋希; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告(Web), 48, 6(MMS2024 1-30/ME2024 17-46/AIT2024 1-30), 2024

A Note on Reduced Reference Image Quality Assessment for Neural Radiance Fields
河合雅斗; 柳凜太郎; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告(Web), 48, 6(MMS2024 1-30/ME2024 17-46/AIT2024 1-30), 2024

A Note on Improvement of Multimodal Large Language Model Considering Object Attributes and Relationships
大羽賀駿也; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告(Web), 48, 6(MMS2024 1-30/ME2024 17-46/AIT2024 1-30), 2024

A Note on Pass Area Estimation Considering Positional Coordinate Information of American Football
河合雅斗; 柳凜太郎; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告(Web), 48, 6(MMS2024 1-30/ME2024 17-46/AIT2024 1-30), 2024

A Note on Text to Image Generation Based on Stable Diffusion Using Brain Activity Data While Gazing on Image-Introduction of Controllable Mechanism with Brain Activity Data in Latent Space-
七田亮; 藤後廉; 前田圭介; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告(Web), 48, 6(MMS2024 1-30/ME2024 17-46/AIT2024 1-30), 2024

Automatic generation of findings using generative AI to support for inspection report creation -Introduction of in-context learning based on similar image retrieval using data pool compression-
佐藤雅也; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, AI・データサイエンス論文集(Web), 5, 3, 2024

A Note on Improving Robustness of CLIP by Adversarial Training Enhanced with Brain Activity
中島佑; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2024, 2024

A Note on Lung Disease Classification Considering Domain Knowledge Via Continuous Self-supervised Learning Pre-trained on J-MID Database
太齊蓮; LI Guang; 藤後廉; TANG Minghui; 吉村高明; 杉森博行; 平田健司; 小川貴弘; 工藤與亮; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2024, 2024

A Note on Model Generalization Based on Generated Images Using Data Selection Considering Class Information
早川楓; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2024, 2024

A Note on Improvement of Non-hierarchical Clients Clustering Utilizing Model Learning Trajectories on Personalized Federated Learning
久保田健太; 藤後廉; 前田圭介; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2024, 2024

A Note on Text-Controllable Symbolic Music Generation Based on Diffusion Models
ZHU Jinlong; 櫻井慶悟; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2024, 2024

A Note on Lung Disease Classification in Small Datasets Based on Weight Selection
堤彩花; LI Guang; 藤後廉; 小川貴弘; 近藤敏志; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2024, 2024

A Note on Zero-shot Composed Image Retrieval Considering the Relationships between Images and Texts based on Textual Inversion
ZHANG Huaying; 柳凜太郎; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2023, 2023

A Note on Improving Noisy Labels Learning via Label Correction Utilizing Pre-trained Models
柏木將希; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2023, 2023

A Note on Caption Unification for Multi-view Lifelogging Images Using In-context Learning
佐藤雅也; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2023, 2023

A Note on the Personalization for Multiple Objects in Image Generation Using a Diffusion Model
松田遥; 藤後廉; 前田圭介; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2023, 2023

A Note on the Estimation of Language Information from fMRI Using a Multimodal Large-Scale Language Model-Estimation of Language Information from Temporal Auditory Stimuli Based on In-context Learning-
藤後太郎; 藤後廉; 前田圭介; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2023, 2023

A Study on Evaluating the Role of Feature Extraction in Enhancing the Accuracy of Visual Counterfactual Machine Learning Models
LI Xiang; 藤後廉; 前田圭介; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2023, 2023

A Note on Text Prompt Tuning in Cross-modal Image Retrieval for a Specific Database
ZHANG Huaying; 柳凛太郎; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告, 47, 6(MMS2023 1-34/ME2023 21-54/AIT2023 1-34), 2023

A Note on Improvement of Binauralization Performance Based on Multi-view Learning on 360° Videos
吉田将規; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告, 47, 6(MMS2023 1-34/ME2023 21-54/AIT2023 1-34), 2023

A Note on Interpretable Visual Question Answering Model with Visual Representation Based on Object Detection Model
ZHU He; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告, 47, 6(MMS2023 1-34/ME2023 21-54/AIT2023 1-34), 2023

Compressed Gastric Image Generation Based on Soft-Label Dataset Distillation for Medical Data Sharing
Guang Li; Ren Togo; Takahiro Ogawa; Miki Haseyama, CoRR, abs/2209.14635, 29 Sep. 2022
Background and objective: Sharing of medical data is required to enable the
cross-agency flow of healthcare information and construct high-accuracy
computer-aided diagnosis systems. However, the large sizes of medical datasets,
the massive amount of memory of saved deep convolutional neural network (DCNN)
models, and patients' privacy protection are problems that can lead to
inefficient medical data sharing. Therefore, this study proposes a novel
soft-label dataset distillation method for medical data sharing. Methods: The
proposed method distills valid information of medical image data and generates
several compressed images with different data distributions for anonymous
medical data sharing. Furthermore, our method can extract essential weights of
DCNN models to reduce the memory required to save trained models for efficient
medical data sharing. Results: The proposed method can compress tens of
thousands of images into several soft-label images and reduce the size of a
trained model to a few hundredths of its original size. The compressed images
obtained after distillation have been visually anonymized; therefore, they do
not contain the private information of the patients. Furthermore, we can
realize high-detection performance with a small number of compressed images.
Conclusions: The experimental results show that the proposed method can improve
the efficiency and security of medical data sharing.

Variational Autoencoderに基づく深層生成モデルを用いた潜在表現のDisentanglementに関する検討 : Disentanglement評価指標を含む正則化損失の導入—A Note on Disentanglement Using Deep Generative Model Based on Variational Autoencoder : Introduction of Regularization Losses Based on Metrics of Disentangled Representation—マルチメディアストレージメディア工学映像表現&コンピュータグラフィックス
中川真; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 46, 6, 97, 102, Feb. 2022
Japanese

ゴム材料開発のためのGenerative Adversarial Networkに基づく配合量および物性からの電子顕微鏡画像の生成に関する一検討—A Note on Electron Microscope Image Generation from Mix Proportion and Material Property via Generative Adversarial Network for Rubber Materials—マルチメディアストレージメディア工学映像表現&コンピュータグラフィックス
柳凛太郎; 藤後廉; 前田圭介; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 46, 6, 187, 191, Feb. 2022
Japanese

セマンティックセグメンテーションに対するマルチソースモデル適応に関する検討 : 複数のソースモデルからの不変な特徴表現の学習による適応精度の向上—A note on multi-source model adaptation for semantic segmentation : Improving adaptation performance by learning model-invariant representation from multiple source models—マルチメディアストレージメディア工学映像表現&コンピュータグラフィックス
李宗曜; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 46, 6, 37, 41, Feb. 2022
Japanese

地下鉄トンネル点検時の技術者から取得される生体信号と技術者の点検行動の関連性分析—Relevance Analysis between Bio-signals of Engineers Inspecting Subway Tunnels and Their Inspection Behaviors—マルチメディアストレージメディア工学映像表現&コンピュータグラフィックス
平澤魁人; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 46, 6, 365, 370, Feb. 2022
Japanese

高速道路の遮音壁画像を用いた物体検出手法による変状分類の高精度化に関する検討—A note on improvement of distress classification using noise barrier images on highway via object detection method—マルチメディアストレージメディア工学映像表現&コンピュータグラフィックス
梁鋆; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 46, 6, 359, 363, Feb. 2022
Japanese

地下鉄トンネルの維持管理支援のためのマルチスケール解析を導入した深層学習に基づく変状検出に関する検討—A Note on Distress Detection Based on Deep Learning with Hierarchical Multi-Scale Attention Mechanism for Supporting Maintenance of Subway Tunnels—マルチメディアストレージメディア工学映像表現&コンピュータグラフィックス
高田紗弥; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 46, 6, 377, 381, Feb. 2022
Japanese

橋梁点検時の技術者の一人称および三人称視点映像を用いた点検動作の分類に関する検討—A Note on Inspection Action Classification Using First and Third Person Video of Engineers Inspecting Bridges—マルチメディアストレージメディア工学映像表現&コンピュータグラフィックス
増田毅; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 46, 6, 177, 180, Feb. 2022
Japanese

ユーザの嗜好を考慮した強化学習と知識グラフに基づく楽曲プレイリスト生成に関する検討—A Note on User Preference-Aware Music Playlist Generation Based on Reinforcement Learning and Knowledge Graph—マルチメディアストレージメディア工学映像表現&コンピュータグラフィックス
櫻井慶悟; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 46, 6, 109, 112, Feb. 2022
Japanese

WINTER ROAD SURFACE CONDITION CLASSIFICATION USING DEEP LEARNING WITH FOCAL LOSS BASED ON TEXT AND IMAGE INFORMATION
諸戸祐哉; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, AI・データサイエンス論文集(Web), 3, J2, 2022

DISTRESS DETECTION BASED ON VISION TRANSFORMER USING EGOCENTRIC VIDEOS WHILE INSPECTING IN SUBWEY TUNNELS
櫻井慶悟; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, AI・データサイエンス論文集(Web), 3, J2, 2022

CLASSIFICATION OF INSPECTION ACTION OF ENGINEERS FOR EFFICIENT VIDEO PRESENTATION TO ASSIST ASSESSMENT OF INFRASTRUCTURE FACILITY DISTRESS
上川恭平; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, AI・データサイエンス論文集(Web), 3, J2, 2022

A note on multi-source model adaptation for semantic segmentation-Improving adaptation performance by learning model-invariant representation from multiple source models-
LI Zongyao; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告, 46, 6(MMS2022 1-37/ME2022 26-62/AIT2022 1-37), 2022

A Note on User Preference-Aware Music Playlist Generation Based on Reinforcement Learning and Knowledge Graph
櫻井慶悟; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告, 46, 6(MMS2022 1-37/ME2022 26-62/AIT2022 1-37), 2022

A Note on Disentanglement Using Deep Generative Model Based on Variational Autoencoder-Introduction of Regularization Losses Based on Metrics of Disentangled Representation-
中川真; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告, 46, 6(MMS2022 1-37/ME2022 26-62/AIT2022 1-37), 2022

A Note of the Relationship between Popularity Bias and Embedding Representations of Latent Factor Models in Collaborative Filtering
岡村洋希; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2022, 2022

A Note on Robust Recommendation System for Domain-dependent Preference Based on Domain-shared Network
山本一輝; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2022, 2022

A Note on Image Retrieval Robust for Changing Camera Views Using Synthesized Images by pixelNeRF
江良勇輝; 藤後廉; 前田圭介; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2022, 2022

A Study on Non-reference Image Quality Assessment Considering Phaseless Components of Neural Radiance Fields
河合雅斗; 柳凛太郎; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2022, 2022

A Note on Estimation of Viewed Images Using Brain Activity Data While Viewing Images-A Verification of Estimation Accuracy by Regression Model Used for fMRI Decoder-
七田亮; 藤後廉; 前田圭介; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2022, 2022

A Note on Estimation of Diabetic Retinopathy Grades Based on Unsupervised Domain Adaptation Using Fundus Images
國枝翼; 藤後廉; 前田圭介; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2022, 2022

A Note on Electron Microscope Image Generation from Mix Proportion and Material Property via Generative Adversarial Network for Rubber Materials
柳凜太郎; 藤後廉; 前田圭介; 前田圭介; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告, 46, 6(MMS2022 1-37/ME2022 26-62/AIT2022 1-37), 2022

A Note on Distress Detection Based on Deep Learning with Hierarchical Multi-Scale Attention Mechanism for Supporting Maintenance of Subway Tunnels
高田紗弥; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告, 46, 6(MMS2022 1-37/ME2022 26-62/AIT2022 1-37), 2022

Relevance Analysis between Bio-signals of Engineers Inspecting Subway Tunnels and Their Inspection Behaviors
平澤魁人; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告, 46, 6(MMS2022 1-37/ME2022 26-62/AIT2022 1-37), 2022

地下鉄トンネルの維持管理支援を目的とした深層学習に基づく変状検出の高精度化に関する検討 : 壁面の施工方法に注目した精度検証—A Note on Improving Performance of Deep Learning-based Distress Detection for Supporting Maintenance of Subway Tunnels : Accuracy Verification Focusing on Tunnel Wall Characteristics—マルチメディアストレージメディア工学映像表現&コンピュータグラフィックス
春山知生; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 45, 4, 1, 6, Feb. 2021
映像情報メディア学会, Japanese

特別講演実店舗の防犯カメラ映像データを用いた顧客の関心推定に関する検討 : 姿勢推定モデルから得られる特徴量を用いた異常検知モデルの精度検証—A Note on Customer's Interest Estimation Method Using Security Camera Video Data from the Real Store : Validation of the accuracy of an anomaly detection model using features obtained from a posture estimation model—マルチメディアストレージメディア工学映像表現&コンピュータグラフィックス
本間勇紀; 藤後廉; 阿部真育; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 45, 4, 193, 198, Feb. 2021
映像情報メディア学会, Japanese

特別講演ゴム材料開発のためのConditional StyleGANに基づく配合量からの電子顕微鏡画像の生成に関する一検討—A Note on Electron Microscope Image Generation from Mix Proportion via Conditional Style Generative Adversarial Network for Rubber Materials—マルチメディアストレージメディア工学映像表現&コンピュータグラフィックス
柳凜太郎; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 45, 4, 171, 175, Feb. 2021
映像情報メディア学会, Japanese

TCCに基づく自己教師学習による特徴表現を用いた映像中の人物動作の検出に関する検討 : 動作検出パラメータが与える影響に対する考察—A Note on Action Detection Using Feature Representation via Self-Supervised Learning based on TCC : Effectiveness evaluation of parameters for action detection—マルチメディアストレージメディア工学映像表現&コンピュータグラフィックス
増田毅; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 45, 4, 47, 51, Feb. 2021
映像情報メディア学会, Japanese

電子顕微鏡により撮像されたゴム材料からの異常検知に基づく劣化領域の推定に関する一検討 : 深層学習モデルにより抽出された特徴表現の有効性検証—A Note on Estimation of Deteriorated Regions Based on Anomaly Detection from Rubber Material Electron Microscope Images : Verification of Feature Representations Extracted from Deep Learning Models—マルチメディアストレージメディア工学映像表現&コンピュータグラフィックス
松本真直; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 45, 4, 43, 46, Feb. 2021
映像情報メディア学会, Japanese

特別講演ユーザの嗜好を反映可能なインテリアコーディネート検索技術の構築 : コーディネートを表現可能な特徴の抽出と実データへの適用—A Note on Interior Coordination Retrieval Reflecting User's Preferences : Extraction of Features Representing Coordination and Application to Real Data—マルチメディアストレージメディア工学映像表現&コンピュータグラフィックス
藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 45, 4, 187, 191, Feb. 2021
映像情報メディア学会, Japanese

A Note on Off-screen Sound Detection Based on Audio-visual Spatialization-Introduction of Audio-visual Features Based on Self-supervised Learning-
吉田将規; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2021, 2021

A Note on Effect of Visual Representation Ability to Performance of World Models
大羽賀駿也; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2021, 2021

A Note on Accuracy Assessment Focusing on Manipulation Area by Text-Guided Image Manipulation
渡邉優宇人; 藤後廉; 前田圭介; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2021, 2021

The Clinical Application of Artificial Intelligence for Detecting Atrophic Gastritis by Upper Gastrointestinal Barium X-ray Radiography
竹内千尋; 藤後廉; 小川貴弘; 長谷山美紀; 山道信毅, 臨床消化器内科, 36, 8, 2021

A Note on Estimation of Semantic Content Based on a Question Answering Model Using Brain Activity Data while Viewing Images-Improvement of Estimation Performance Based on fine-tuning of the VQA model-
高田紗弥; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告, 45, 4(MMS2021 1-28/ME2021 1-28/AIT2021 1-28), 2021

A Note on Improving Performance of Deep Learning-based Distress Detection for Supporting Maintenance of Subway Tunnels-Accuracy Verification Focusing on Tunnel Wall Characteristics-
春山知生; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告, 45, 4(MMS2021 1-28/ME2021 1-28/AIT2021 1-28), 2021

A Note on Customer’s Interest Estimation Method Using Security Camera Video Data from the Real Store-Validation of the accuracy of an anomaly detection model using features obtained from a posture estimation model-
本間勇紀; 本間勇紀; 藤後廉; 阿部真育; 小川貴弘; 長谷山美紀; 長谷山美紀, 映像情報メディア学会技術報告, 45, 4(MMS2021 1-28/ME2021 1-28/AIT2021 1-28), 2021

A Note on Estimating of Deteriorated Refions Based on Anomaly Detection from Rubber Material Electron Microscope Images-Verification of Feature Representations Extracted from Deep Learning Models-
松本真直; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告, 45, 4(MMS2021 1-28/ME2021 1-28/AIT2021 1-28), 2021

A Note on Action Detection Using Feature Representation via Self-Supervised Learning based on TCC-Effectiveness evaluation of parameters for action detection-
増田毅; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告, 45, 4(MMS2021 1-28/ME2021 1-28/AIT2021 1-28), 2021

A Note on Interior Coordination Retrieval Reflecting User’s Preferences-Extraction of Features Representing Coordination and Application to Real Data-
藤後廉; 小川貴弘; 長谷山美紀; 長谷山美紀, 映像情報メディア学会技術報告, 45, 4(MMS2021 1-28/ME2021 1-28/AIT2021 1-28), 2021

A Note on Electron Microscope Image Generation from Mix Proportion via Conditional Style Generative Adversarial Network for Rubber Materials
柳凜太郎; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告, 45, 4(MMS2021 1-28/ME2021 1-28/AIT2021 1-28), 2021

AIを中心とした医療デジタル技術基盤の構築へ向けた取り組み
藤後廉; 小川貴弘; 長谷山美紀, 日本消化器がん検診学会雑誌(Web), 59, Supplement 1, 2021

胃X線画像を用いたAIによるH.pylori感染識別と今後の展望
藤後廉; 小川貴弘; 間部克裕; 加藤元嗣; 長谷山美紀, 日本消化器がん検診学会雑誌, 58, 2, 127, 127, Mar. 2020
(一社)日本消化器がん検診学会, Japanese

画像内の物体に着目した画像検索に関する検討 : RetinaNetを用いた物体認識に基づく高精度化—A Note on Image Retrieval Focusing on Objects in Images : Improving Retrieval Performance Based on Object Recognition Using RetinaNet—マルチメディアストレージヒューマンインフォメーションメディア工学映像表現&コンピュータグラフィックス
柳凛太郎; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 44, 6, 377, 381, Feb. 2020
映像情報メディア学会, Japanese

画像内の物体に着目した画像検索に関する検討 : RetinaNetを用いた物体認識に基づく高精度化—A Note on Image Retrieval Focusing on Objects in Images Improving Retrieval Performance Based on Object Recognition Using RetinaNet—ITS : Intelligent Transport Systems Technology
柳凜太郎; 藤後廉; 小川貴弘; 長谷山美紀, 電子情報通信学会技術研究報告 = IEICE technical report : 信学技報, 119, 421, 377, 381, Feb. 2020
電子情報通信学会, Japanese

弱異常検知アルゴリズムに基づくCCTV映像を用いた河川利用者の異常行動の検出に関する検討—A Note on Abnormal Motion Detection of River Users on CCTV Videos Based on Weakly Supervised Anomaly Detection Algorithm—マルチメディアストレージヒューマンインフォメーションメディア工学映像表現&コンピュータグラフィックス
渡邊はるな; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 44, 6, 367, 370, Feb. 2020
映像情報メディア学会, Japanese

ゴム材料の配合量を用いたAC-GANに基づく電子顕微鏡画像の生成に関する一検討—A Note on Generation of Electron Microscope Images via Auxiliary Classifier Generative Adversarial Network with Mix Proportions—ITS : Intelligent Transport Systems Technology
金井美岬; 藤後廉; 小川貴弘; 長谷山美紀, 電子情報通信学会技術研究報告 = IEICE technical report : 信学技報, 119, 421, 107, 111, Feb. 2020
電子情報通信学会, Japanese

ゴム材料の配合量を用いたAC-GANに基づく電子顕微鏡画像の生成に関する一検討—A Note on Generation of Electron Microscope Images via Auxiliary Classifier Generative Adversarial Network with Mix Proportions—マルチメディアストレージヒューマンインフォメーションメディア工学映像表現&コンピュータグラフィックス
金井美岬; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 44, 6, 107, 111, Feb. 2020
映像情報メディア学会, Japanese

穿孔データを用いたオンライン学習に基づく岩盤の圧縮強度指数推定に関する検討—A Note on Estimation of rock compressive strength index from drilling data based on online learning—マルチメディアストレージヒューマンインフォメーションメディア工学映像表現&コンピュータグラフィックス
山本健太郎; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 44, 6, 57, 60, Feb. 2020
映像情報メディア学会, Japanese

穿孔データを用いたオンライン学習に基づく岩盤の圧縮強度指数推定に関する検討—A Note on Estimation of rock compressive strength index from drilling data based on online learning—ITS : Intelligent Transport Systems Technology
山本健太郎; 藤後廉; 小川貴弘; 長谷山美紀, 電子情報通信学会技術研究報告 = IEICE technical report : 信学技報, 119, 421, 57, 60, Feb. 2020
電子情報通信学会, Japanese

A Note on Estimation of rock compressive strength index from drilling data based on online learning
山本健太郎; 藤後廉; 小川貴弘; 長谷山美紀, 電子情報通信学会技術研究報告, 119, 421(ITS2019 30-56), 2020

A Note on Abnormal Motion Detection of River Users on CCTV Videos Based on Weakly Supervised Anomaly Detection Algorithm
渡邊はるな; 藤後廉; 小川貴弘; 長谷山美紀, 電子情報通信学会技術研究報告, 119, 421(ITS2019 30-56), 2020

A Note on Generation of Electron Microscope Images via Auxiliary Classifier Generative Adversarial Network with Mix Proportions
金井美岬; 藤後廉; 小川貴弘; 長谷山美紀, 電子情報通信学会技術研究報告, 119, 421(ITS2019 30-56), 2020

A Note on Image Retrieval Focusing on Objects in Images-Improving Retrieval Performance Based on Object Recognition Using RetinaNet-
柳凛太郎; 藤後廉; 小川貴弘; 長谷山美紀, 電子情報通信学会技術研究報告, 119, 421(ITS2019 30-56), 2020

A Note on Music Playlist Generation Based on Reinforcement Learning Using Self-Organized Map
櫻井慶悟; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2020, 2020

A Note on Detection of Human Actions Based on Temporal Cycle Consistency Learning
増田毅; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2020, 2020

A Note on Dataset Complexity Assessment Based on nAULS
LI Guang; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2020, 2020

A Note on Improvement of Distress Detection Performance in Subway Tunnel Images by Data Augmentation Based on RICAP
春山知生; 前田圭介; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2020, 2020

A Note on Disentangled Representation Learning via Pre-Trained Semantic Segmentation Model
中川真; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2020, 2020

胃X線画像を用いたAIによるH.pylori感染識別と今後の展望
藤後廉; 小川貴弘; 間部克裕; 加藤元嗣; 長谷山美紀, 日本消化器がん検診学会雑誌(Web), 58, 2, 2020

【Helicobacter Topics!-「旬のHelicobacter」を知り、驚き、理解を深め、楽しむ-】胃バリウム検査におけるAIによるHelicobacter pylori診断
藤後廉; 間部克裕; 山道信毅; 大泉晴史; 小川貴弘; 長谷山美紀; 加藤元嗣; 坂本直哉, Helicobacter Research, 23, 2, 133, 137, Nov. 2019
(株)先端医学社, Japanese

A Note on Semantic Evaluation of Images Generated by Text-to-image Generative Adversarial Networks
柳凜太郎; 藤後廉; 小川貴弘; 長谷山美紀, 電子情報通信学会技術研究報告, 119, 131, 21, 24, 19 Jul. 2019
電子情報通信学会, Japanese

【人工知能が医療を変える!医療分野におけるAI研究開発最前線2019】領域別・画像診断におけるAI研究開発の最前線その他核医学におけるディープラーニングを用いた画像診断、画像処理
平田健司; 藤後廉; 小川貴弘; 長谷山美紀; 志賀哲, INNERVISION, 34, 7, 60, 63, Jun. 2019
(株)インナービジョン, Japanese

A Note on Estimation of Deteriorated Regions Based on Anomaly Detection from Rubber Material Electron Microscope Images
藤後廉; 小川貴弘; 長谷山美紀, 電子情報通信学会技術研究報告, 118, 449, 265, 268, 19 Feb. 2019
電子情報通信学会, Japanese

A Note on Automatic Malignant Tumor Candidate Detection Based on a 3D Deep Residual Network with FDG-PET/CT Images (ITS)
李宗曜; 藤後廉; 小川貴弘; 平田健司; 真鍋治; 志賀哲; 長谷山美紀, 電子情報通信学会技術研究報告 = IEICE technical report : 信学技報, 118, 449, 311, 314, 19 Feb. 2019
電子情報通信学会, English

A Note on Gastritis Detection from Gastric X-ray Images via Transfer Learning Approach
金井美岬; 藤後廉; 小川貴弘; 長谷山美紀, 電子情報通信学会技術研究報告, 118, 449, 315, 318, 19 Feb. 2019
電子情報通信学会, Japanese

A Note on Generation of Gastritis Images Based on Progressive Growing GAN for Gastritis Classification
渡邊はるな; 藤後廉; 小川貴弘; 長谷山美紀, 電子情報通信学会技術研究報告, 118, 449, 319, 322, 19 Feb. 2019
電子情報通信学会, Japanese

A Note on Estimation of Deteriorated Regions Based on Anomaly Detection from Rubber Material Electron Microscope Images
藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 43, 5, 265, 268, Feb. 2019
映像情報メディア学会, Japanese

A Note on Automatic Malignant Tumor Candidate Detection Based on a 3D Deep Residual Network with FDG-PET/CT Images (マルチメディアストレージヒューマンインフォメーションメディア工学映像表現&コンピュータグラフィックス)
李宗曜; 藤後廉; 小川貴弘; 平田健司; 真鍋治; 志賀哲; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 43, 5, 311, 314, Feb. 2019
映像情報メディア学会, English

A Note on Gastritis Detection from Gastric X-ray Images via Transfer Learning Approach
金井美岬; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 43, 5, 315, 318, Feb. 2019
映像情報メディア学会, Japanese

A Note on Generation of Gastritis Images Based on Progressive Growing GAN for Gastritis Classification
渡邊はるな; 藤後廉; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 43, 5, 319, 322, Feb. 2019
映像情報メディア学会, Japanese

Text-to-imageGANに基づくスタイル変換に関する検討-Image-to-textモデル導入による高精度化-
古寺恵; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2019, 2019

Semantic Segmentationに基づく地下鉄トンネルにおける変状検出に関する検討
WANG An; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2019, 2019

脳活動データを用いた注視画像の再構成における用いる視覚野の領域に関する検討
高田紗弥; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2019, 2019

Automatic Metastatic Bone Tumor Classification with DCNN-based Features Using Treatment-planning CT Images
Haruna Watanabe; Ren Togo; Takahiro Ogawa; Miki Haseyama; Koichi Yasuda; Khin Khin Tha; Kohsuke Kudo; Hiroki Shirato, INTERNATIONAL FORUM ON MEDICAL IMAGING IN ASIA 2019, 11050, 2019, [Peer-reviewed]
English

深層学習に基づく画像特徴量を利用した放射線治療用CT画像における転移性骨腫瘍の検出に関する検討
渡邊はるな; 藤後廉; 小川貴弘; 長谷山美紀; 安田耕一; THA Khin Khin; 工藤與亮; 白土博樹, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2018, ROMBUNNO.87, 27 Oct. 2018
Japanese

AC‐GANに基づく胃炎識別のための画像生成に関する検討
金井美岬; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2018, ROMBUNNO.89, 27 Oct. 2018
Japanese

Tri‐trainingに基づく胃X線画像を用いた胃炎の識別に関する検討
LI Zongyao; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2018, ROMBUNNO.88, 27 Oct. 2018
Japanese

AttnGANを用いたシーン検索に関する検討―再検索の導入による高精度化―
柳凜太郎; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2018, ROMBUNNO.10, 27 Oct. 2018
Japanese

A Note on Use of Generative Adversarial Networks for Gastritis Classification from Gastric X-ray images
藤後廉; 石原賢太; 小川貴弘; 長谷山美紀, 電子情報通信学会技術研究報告, 117, 431(ITS2017 61-83), 299‐303, 08 Feb. 2018
Japanese

A Note on Use of Generative Adversarial Networks for Gastritis Classification from Gastric X-ray images
藤後廉; 石原賢太; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 42, 4, 299, 303, Feb. 2018
映像情報メディア学会, Japanese

敵対的学習により生成された画像が与える審美的印象に関する検討
金井美岬; 藤後廉; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2017, ROMBUNNO.115, 28 Oct. 2017
Japanese

Estimation of Regions Related to Helicobacter Pylori Infection from Gastric X-ray Images
Ren Togo; Kenta Ishihara; Takahiro Ogawa; Miki Haseyama, 7, 8, Jul. 2017
English, Summary international conference

A Note on Selection of Representative Images for Deterioration Diagnosis of Steel Tower
藤後廉; 高橋翔; 小川貴弘; 長谷山美紀, 電子情報通信学会技術研究報告, 116, 463(ITS2016 42-76), 47‐50, 13 Feb. 2017
Japanese

A Note on Selection of Representative Images for Deterioration Diagnosis of Steel Tower
藤後廉; 高橋翔; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 41, 5, 47, 50, Feb. 2017
映像情報メディア学会, Japanese

A Note on Accurate Detection of Helicobacter Pylori Infection from Gastric X-ray Images (3)
藤後廉; 石原賢太; 小川貴弘; 長谷山美紀, 電子情報通信学会技術研究報告, 115, 458(ITS2015 56-83), 333‐336, 15 Feb. 2016
Japanese

A Note on Accurate Detection of Helicobacter Pylori Infection from Gastric X-ray Images(3)Estimation of Regions Causing Classification Performance Degradation
藤後廉; 石原賢太; 小川貴弘; 長谷山美紀, 映像情報メディア学会技術報告 = ITE technical report, 40, 6, 333, 336, Feb. 2016
映像情報メディア学会, Japanese

胃X線画像を用いたHelicobacter Pylori感染の高精度識別に関する検討(4)
藤後廉; 石原賢太; 小川貴弘; 長谷山美紀, 信号処理シンポジウム講演論文集(CD-ROM), 31st, ROMBUNNO.B4‐2, 2016
Japanese

胃X線画像を用いたHelicobacter pylori感染の高精度識別に関する検討(2)―撮像方向が識別に有効な画像領域に与える影響に対する考察―
藤後廉; 石原賢太; 小川貴弘; 長谷山美紀, 電気・情報関係学会北海道支部連合大会講演論文集(CD-ROM), 2015, ROMBUNNO.123, 07 Nov. 2015
Japanese

胃X線画像を用いたHelicobacter pylori感染の高精度識別に関する検討(1)―識別精度向上に有効な画像領域の抽出手法―
藤後廉; 石原賢太; 小川貴弘; 長谷山美紀, 信号処理シンポジウム講演論文集(CD-ROM), 30th, ROMBUNNO.C4‐4, 2015
Japanese

■ Lectures, oral presentations, etc.

データサイエンス人財育成の新しいカタチ
NoMaps 2021, Sep. 2021, Public discourse

AIを中心とした医療デジタル技術基盤の構築へ向けた取り組み
第60回日本消化器がん検診学会総会附置研究会3, Jun. 2021

ゴム材料開発のためのConditional StyleGANに基づく配合量からの電子顕微鏡画像の生成に関する一検討
映像情報メディア学会技術報告, Feb. 2021

実店舗の防犯カメラ映像データを用いた顧客の関心推定に関する検討～姿勢推定モデルから得られる特徴量を用いた異常検知モデルの精度検証～
映像情報メディア学会技術報告, Feb. 2021

ユーザの嗜好を反映可能なインテリアコーディネート検索技術の構築～コーディネートを表現可能な特徴の抽出と実データへの適用～
映像情報メディア学会技術報告, Feb. 2021

■ Affiliated academic society

IEEE

■ Research Themes

生成AIとの融合により潜在的嗜好を把握可能とするユーザ中心推薦技術の構築
科学研究費助成事業
01 Apr. 2024 - 31 Mar. 2028
長谷山美紀; 斉藤直輝; 小川貴弘; 藤後廉
日本学術振興会, 基盤研究(B), 北海道大学, 24K02942

Construction of ultra-low-volume anonymous learning technology and universal learning technology to improve the versatility of medical AI
Grants-in-Aid for Scientific Research
Apr. 2023 - Mar. 2028
藤後廉
本研究では，医療AIの汎用性向上を目指す超少量匿名学習技術および汎用学習技術の構築を目指す．一般画像を対象とするAI技術の指数関数的発展と比較して，医用画像を対象とするAI技術の応用範囲は依然として限定的である．そこで本研究では，一般画像と医用画像のそれぞれが有する性質の差異に着目することで，医療AIの汎用性向上を実現する技術を構築する．具体的には，まず，データ蒸留技術に基づき，匿名性の高い汎用蒸留画像を生成する．そして，モデルベースドメイン適応技術と組み合わせることで，汎用的用途で利用可能なデータおよびモデルの構築を実現する．本研究では，複数種類の医用画像および複数施設から得られる医用画像を対象として，構築技術の有効性検証を実施する．
本年度では，まず，医用画像と対象とする前に，一般画像におけるデータ蒸留技術の構築を行った．より基礎的なデータを対象とした理論構築を出発点とすることで，構築理論の有効性を検証しつつ，医用画像データへの応用可能性について検討可能となる．本年度は，一般データを対象としたデータ蒸留における基礎理論を構築し，ニューラルネットワーク分野のトップ論文誌Neural Networksへの採択に至った．また，これらの技術の改良版を画像処理分野における世界最大の国際会議International Conference on Image Processing (ICIP) およびコンピュータビジョン分野における世界最高峰の国際会議International Conference on Computer Vision and Pattern Recognition (CVPR)のデータ蒸留ワークショップへ投稿中である．
Japan Society for the Promotion of Science, Grant-in-Aid for Scientific Research (C), Hokkaido University, 23K11141

General-purpose deep learning theory for ultra-low computational complexity and low capacity in the age of edge AI
Grants-in-Aid for Scientific Research
01 Apr. 2021 - 31 Mar. 2026
小川貴弘; 藤後廉; 前田圭介
本研究課題では、エッジAI時代の超低演算量・低容量化を実現する汎用深層学習理論の構築を目指している。研究代表者が進めてきた低演算量・低容量バイナリスパース表現技術とクロスモーダル埋め込み技術の研究を融合させ、AIの演算量と学習データ量を大幅に削減可能な新たな理論を構築する。具体的に、最先端の深層学習モデルをバイナリスパース表現により模倣し、さらに、他のモダリティからの知識転移を行うことで、深層学習の利点である高い精度を保持しつつ、演算量削減と学習データ量の小規模化を同時に実現する。本研究課題では、構築理論の汎用性を示すとともに、エッジデバイス上での評価検証を行う。尚、本研究課題は研究分担者とともに遂行し、実施項目である「① モデルクローニング技術の実現による演算量の削減」および「② クロスモーダル知識転移技術の実現による学習データ量の小規模化」については、①の研究を小川・藤後が、②の研究を小川・前田が実施する。
令和4年度は、「バイナリスパース深層学習モデルの実現」を目指し、研究を遂行した。具体的に、演算量削減と学習データ量の小規模化のそれぞれを以下のように実現した。まず、構築済みの「深層学習モデルの中間層出力」と「バイナリスパース深層学習モデルの中間出力」との相関を最大化する理論に、データの近似誤差最小化を可能にする損失関数を新たに組み込むことで、各中間層出力を低演算量のバイナリスパース表現で模倣するモデルクローニングを実現した。次に、異なる種類のモダリティの相関を最大化する理論を構築することで、学習データ量の不足をモダリティ相関に基づき補間するクロスモーダル知識転移を実現した。研究成果の対外発表についても積極的に行い、コンピュータビジョン分野のトップ国際会議ECCVへの採択や、信号処理分野のトップ国際会議ICASSPへの採択に至った。
Japan Society for the Promotion of Science, Grant-in-Aid for Scientific Research (B), Hokkaido University, 23K21676

General-purpose deep learning theory for ultra-low computational complexity and low capacity in the age of edge AI
Grants-in-Aid for Scientific Research Grant-in-Aid for Scientific Research (B)
01 Apr. 2021 - 31 Mar. 2026
小川貴弘; 前田圭介; 藤後廉
本研究課題では、エッジAI時代の超低演算量・低容量化を実現する汎用深層学習理論の構築を目指す。研究代表者が進めてきた低演算量・低容量バイナリスパース表現技術とクロスモーダル埋め込み技術の研究を融合させ、AIの演算量と学習データ量を大幅に削減可能な新たな理論を構築する。具体的に、最先端の深層学習モデルをバイナリスパース表現により模倣し、さらに、他のモダリティからの知識転移を行うことで、深層学習の利点である高い精度を保持しつつ、演算量削減と学習データ量の小規模化を同時に実現する。本研究課題では、構築した理論が汎用性を有することを示すとともに、エッジデバイス上での評価検証を行う。尚、本研究課題は研究分担者とともに遂行し、実施項目である「① モデルクローニング技術の実現による演算量の削減」および「② クロスモーダル知識転移技術の実現による学習データ量の小規模化」については、①の研究を小川・藤後が、②の研究を小川・前田が実施する。
令和3年度は、「深層学習モデルにおける中間層出力」と「バイナリスパース表現係数」との間で相関を最大化するクロスモーダル埋め込み理論を構築した。具体的に、ソースドメインに対応する実数データとバイナリスパース表現係数との間でクロスモーダル埋め込みを行い、それらの相関が最大化されるよう、バイナリスパース表現における辞書学習を可能とした。この際、バイナリスパース表現係数は0または1の疎なデータであることに注目し、観測データがバイナリスパース値である制約を設けた新たなクロスモーダル埋め込み理論を実現した。さらに、構築した理論やその応用に関する研究成果の対外発表についても積極的に行い、クロスモーダル埋め込み理論を応用した研究成果が画像処理分野における世界最高峰の国際会議ICIP等に採択されている。
Japan Society for the Promotion of Science, Grant-in-Aid for Scientific Research (B), Hokkaido University, 21H03456

Development of a Sequential Data Cleansing Technique Based on Machine Learning for Medical Imaging
Grants-in-Aid for Scientific Research
01 Apr. 2020 - 31 Mar. 2024
Togo Ren
This study aims to develop a machine learning-based data cleansing technology for stomach X-ray images. In the field of medical image analysis, supervised learning based on large-scale data is increasingly recognized for its effectiveness. However, many of the currently proposed methods focus only on model construction and evaluation, without considering the effort involved in building datasets. To apply diagnostic support technology using machine learning in real-world settings, it is necessary to consider the total performance, including the cost of labeling data. Therefore, this research focuses on the aspect of dataset construction, which is essential for the societal implementation of machine learning, and aims to develop a technology that can efficiently perform data cleansing.
Japan Society for the Promotion of Science, Grant-in-Aid for Early-Career Scientists, Hokkaido University, 20K19857

機械学習に基づくマルチモーダル画像生成手法の構築
科学研究費助成事業特別研究員奨励費
25 Apr. 2019 - 31 Mar. 2021
藤後廉
本研究では，医用画像におけるマルチモーダル画像生成手法の構築を目的として，研究を遂行してきた．具体的に本研究では，胃X線画像および血液検査抗体値を用い，胃炎初期状態の患者の胃X線画像から，胃炎が進行した場合の未知情報を画像生成により推定することを目的とする．本研究は，【研究１：胃炎の進行予測画像の生成】および【研究2：画像生成手法のマルチモーダル化】の2つのテーマから構築されている．研究計画策定時点においては，研究１では，任意の胃がんリスクに応じた予測画像の生成を行い，研究２では，血液検査結果と照らし合わせた学習を行う推定器を加えた画像生成手法のマルチモーダル化を行う予定であった．
研究実施者はまず，高解像度である胃X線画像に対する画像生成を実現するため，徐々に解像度を向上させる画像生成手法であるProgressive Growing of GANs (PGGAN) に基づき画像生成手法を構築した．さらに生成画像の胃炎分類問題への応用可能性を示した．尚，本成果は学際論文誌IEEE Accessへの採録に至っている．
次に研究実施者は，血液検査値と胃X線画像の対応付けによる画像生成の研究に着手した．血液検査による結果をドメインラベルベクトルとして扱うことで血液検査結果の入力に対応した画像生成が可能となる手法を構築した．また，本成果に関して，画像認識分野における国内最大規模の学会である「画像の認識・理解シンポジウム(MIRU) 」においてポスター発表を行った．
以上より，研究実施者は，本研究に関する成果をほぼ研究計画通りに実施しており，研究成果を対外発表や論文として社会に還元している．
日本学術振興会, 特別研究員奨励費, 北海道大学, 19J10821

■ Industrial Property Rights

画像生成装置、ゴム組成物の配合推定装置及び学習装置
Patent right, 福地将志; 山田宏明; 伊藤和加奈; 長谷山美紀; 小川貴弘; 藤後廉, 住友ゴム工業株式会社, 国立大学法人北海道大学
特願2021-019382, 09 Feb. 2021
特開2021-136024, 13 Sep. 2021
特許第7603275号
202503020259470936

情報処理システム
Patent right, 本間勇紀; 山城輝久; 長谷山美紀; 小川貴弘; 藤後廉, 株式会社ニトリホールディングス, 国立大学法人北海道大学
特願2022-206332, 23 Dec. 2022
特開2024-090422, 04 Jul. 2024
202403013564367068

情報処理システム
Patent right, 本間勇紀; 藤後廉; 阿部真育; 長谷山美紀; 小川貴弘, 株式会社ニトリホールディングス, 国立大学法人北海道大学
特願2021-202578, 14 Dec. 2021
特開2023-087988, 26 Jun. 2023
202303018862076828

画像生成装置、ゴム組成物の配合推定装置及び学習装置
Patent right, 福地将志; 山田宏明; 伊藤和加奈; 長谷山美紀; 小川貴弘; 藤後廉, 住友ゴム工業株式会社, 国立大学法人北海道大学
特願2021-019382, 09 Feb. 2021
特開2021-136024, 13 Sep. 2021
202103009110722611

■ Academic and Social Contribution Activities/Other
Media Coverage

住友ゴム工業北大との共創型研究拠点「住友ゴムイノベーションベース・札幌」説明会
18 Sep. 2025
日刊油業報知新聞

住友ゴム✕北大共創型研究拠点開発
10 Sep. 2025
自動車タイヤ新聞

住友ゴム工業北海道大学に共創型研究拠点「住友ゴム・イノベーションベース・札幌」開設
09 Sep. 2025
日刊油業報知新聞

住友ゴム工業北海道大学に共創型研究拠点を開設
08 Sep. 2025
ゴム報知新聞

AI活用による技術革新に向けた共創型研究拠点開設
08 Sep. 2025
タイヤ新報

タイヤ開発を大改革住友ゴム産学連携に活路メーカーと足並みをそろえるために
08 Sep. 2025
日刊自動車新聞

住友ゴム×北海道大学 AI活用による技術革新に向けた共創型研究拠点開設
08 Sep. 2025
週刊タイヤ新報

住友ゴムフィジカルAI実用化にアクセル研究・製造現場への導入想定
04 Sep. 2025
化学工業日報

住友ゴム研究AI開発北大拠点活用ロボが実験代行
03 Sep. 2025
北海道新聞

北大の医学研究力強化を支援文科省事業に採択「医療AI開発に力」
26 Aug. 2025
北海道新聞デジタル

住友ゴム、北海道大学に共創型研究拠点を開設
08 Aug. 2025
KYODO NEWS PRWIRE

AI活用、高機能素材開発住友ゴム工業、北大に研究拠点開設へ
08 Aug. 2025
北海道新聞デジタル

住友ゴム、北大の総合イノベーション創発機構データ駆動型融合研究創発拠点内に「住友ゴムイノベーションベース・札幌」を開設
08 Aug. 2025
日本経済新聞

住友ゴム、北海道大学に共創型研究拠点を開設～AIを活用した技術革新と、未来のモビリティを取り巻く環境に向けた新たな価値創出を加速～
08 Aug. 2025
北海道大学プレスリリース

住友ゴム、北海道大学に共創型研究拠点を開設～AIを活用した技術革新と、未来のモビリティを取り巻く環境に向けた新たな価値創出を加速～
08 Aug. 2025
住友ゴム工業株式会社プレスリリース

“現場DX”を実現するAI×デジタルツイン熟練者の技能をモデル化などの最新論文【土木×AI第27回】
11 Sep. 2024
BUILT - ITmedia

土木におけるAI活用の現状と将来
17 Jul. 2024
けんせつPlaza

えりも町沿岸の天然コンブ漁場におけるブルーカーボンによるCO₂吸収量の調査に関するAI技術を用いた研究成果が、ジャーナル「Remote Sensing」に掲載されました
21 May 2024
北海道大学 X SDGs

胃X線画像からピロリ感染を判断するAIも開発進む
10 Mar. 2018
日経メディカル
R. Togo et.al., WJGO, 2018に関して, [Paper]

一个项目帮你了解数据集蒸馏Datas
机器之心 (中国)

Most Popular AI Research Aug 2022

知識グラフと強化学習を組み合わせた説明可能な新しいアーティスト推薦システム
Other than myself
AI-SCHOLAR
[Internet]

内閣官房教育未来創造会議にて北海道大学ニトリみらい社会デザイン講座令和２年度研究公募の取組が紹介
[Others]

タイヤ構造見える化
日刊工業新聞
[Paper]

ネイチャーインデックスに掲載住友ゴムのAI技術研究成果
ゴムタイムス
[Paper]

AI技術の共同研究成果 Nature Indexに掲載
Other than myself
ゴム化学新聞
[Paper]

道新プラス道新受験情報 2022高校入試合格データ特集「AIってなんだろう? vol. 7」
北海道新聞
R. Togo et al., IEEE Access 2019に関して, [Paper]

Artificial intelligence gets to grip with tyres
Nature Index
R. Togo et.al, IEEE Access, 2019に関して, [Internet]

住友ゴムと北大、AI技術の共同研究成果が研究成果のデータベース「Nature Index」の材料科学特集として掲載
日本経済新聞
R. Togo et.al, IEEE Access, 2019に関して, [Paper]

北海道大学長谷山美紀教授と住友ゴム工業株式会社との共同研究成果を「Nature Index」に掲載
北海道大学プレスリリース
と住友ゴム工業株式会社との共同研究成果を「Nature Index」に掲載, 北海道大学プレスリリース, [Others]

住友ゴムと北海道大学とのAI技術の共同研究成果を「Nature Index」に掲載
住友ゴム工業株式会社プレスリリース
R. Togo et.al, IEEE Access, 2019に関して, [Others]

北海道大情報科学研究院 AI活用でインフラ点検
日経新聞
Guang Li, et al., ICIP, 2020に関して, [Paper]

道新プラス道新受験情報 2022大学・短大特集「AIってなんだろう? vol. 6」
北海道新聞
Zongyao Li et al., ICASSP 2020に関して, [Paper]

How to Make Artificial Intelligence More Democratic
SCIENTIFIC AMERICAN
Guang Li, Ren Togo, et al., ICIP, 2020に関して, [Internet]

道新プラス道新受験情報 2021大学・短大特集「AIってなんだろう? vol. 5」
北海道新聞
李宗曜, 他, 映像情報メディア学会技術報告, 2019に関して, [Paper]

Tyre Leap AI Analysis Technology: An Overview
TYRE TRENDS
R. Togo et.al, IEEE Access, 2019に関して, [Paper]

道新プラス道新受験情報 2020大学・短大特集「AIってなんだろう? vol. 3」
北海道新聞
R. Togo et.al, CBM, 2016に関して, [Paper]

山岳トンネル切羽の地質を自動判定－圧縮強度、風化度、割れ目状態を定量評価－
Other than myself
安藤間プレスリリース
Kentaro Yamamoto, et al., GCCE2019に関して, [Others]

SEARCH

Togo Ren

Faculty of Information Science and Technology Media and Network Technologies Information Media Science and TechnologyAssociate Professor

Researcher basic information

Research activity information

Faculty of Information Science and Technology　Media and Network Technologies　Information Media Science and Technology Associate Professor