Publications

Publications

|

Optimizing Movie Selections: A Multi-Task, Multi-Modal Framework with Strategies for Missing Modality Challenges

Authors: Subham Raj, Pawan Agrawal, Sriparna Saha (IIT Patna), Brijraj Singh, Niranjan Pedanekar
ACM Symposium on Applied Computing (SAC) | April 2024

|

Open-set Object Detection By Aligning Known Class Representations

Authors: Hiran Sarkar, Vishal Chudasama, Naoyuki Onoe, Pankaj Wasnik, Vineeth Balasubramanian (IIT Hyderabad)
Winter Conference on Applications of Computer Vision (WACV) | January 2024

|

Efficient infusion of self-supervised representations in Automatic Speech Recognition

Authors: Darshan Prabhu, Saiganesh Mirishkar, Pankaj Wasnik
Poster presentation at the Neural Information Processing Systems (NeurIPS) 3rd Workshop | December 2023

|

Enhancing Social Recommendation with Multi-View BERT Network

Authors: Tushar Prakash, Raksha Jalan, Naoyuki Onoe
IEEE International Conference on Data Mining (IEEE ICDM) | December 2023

|

Fiducial Focus Augmentation for Facial Landmark Detection

Authors: Purbayan Kar, Vishal Chudasama, Naoyuki Onoe, Pankaj Wasnik, Vineeth Balasubramanian
British Machine Vision Conference (BMVC) | November 2023

|

Impulsion of Movie's Content-Based Factors in Multi-Modal Movie Recommendation System

Authors: Prabir Mondal, Pulkit Kapoor, Siddharth Singh, Sriparna Saha, Naoyuki Onoe, Brijraj Singh
International Conference on Neural Information Processing (ICONIP) | November 2023

|

LLM Based Generation of Item-Description for Recommendation System

Authors: Arkadeep Acharya, Brijraj Singh and Naoyuki Onoe
Recommender Systems Conference (RECSYS) | Sept 2023

|

CR-SoRec: BERT driven Consistency Regularization for Social Recommendation

Authors: Tushar Prakash, Raksha Jalan, Brijraj Singh and Naoyuki Onoe
Recommender Systems Conference (RECSYS) | Sept 2023

|

Iteratively Improving Speech Recognition and Voice Conversion

Authors: Mayank Kumar Singh, Naoya Takahashi, Onoe Naoyuki
INTERSPEECH | August 2023

Read More

|

Cd-HRNN: Content-Driven HRNN to Improve Session-Based Recommendation System

Authors: Sonal Dabral, Brijraj Singh and Naoyuki Onoe
International Joint Conference on Neural Networks (IJCNN Main Conference) | April 2023

|

A Multi-Modal Multi-Task Based Approach for Movie Recommendation

Authors: Sriparna Saha (IIT Patna) and Naoyuki Onoe
International Joint Conference on Neural Networks (IJCNN Main Conference) | April 2023

|

A Meta-Learning Based Generative Model with Graph Attention Network for Multi-Modal Recommender Systems

Authors: Sriparna Saha (IIT Patna) and Naoyuki Onoe
INNS DLIA Workshop /IJCNN | April 2023

|

Task-Specific and Graph Convolutional Network Based Multi-Modal Movie Recommendation System in Indian Setting

Authors: Sriparna Saha (IIT Patna) and Naoyuki Onoe
INNS DLIA Workshop /IJCNN | April 2023

|

Revisiting Class Imbalance for End-to-end Semi-Supervised Object Detection

Authors: Purbayan Kar, Vishal Chudasama, Pankaj Wasnik and Naoyuki Onoe
Efficient Deep Learning for Computer Vision (ECV) Workshop in CVPR | April 2023

|

Nonparallel Emotional Voice Conversion For Unseen Speaker-Emotion Pairs Using Dual Domain Adversarial Network & Virtual Domain Pairing

Authors: Nirmesh Shah, Mayank Kumar Singh, Naoya Takahashi, Naoyuki Onoe
ICASSP, the International Conference on Acoustics, Speech, and Signal Processing | February 2023

Read More

|

Hierarchical disentangled representation learning for singing voice conversion

Authors: Naoya Takahashi, Mayank Kumar Singh, Yuki Mitsufuji
ICASSP, the International Conference on Acoustics, Speech, and Signal Processing | February 2023

Read More

|

Graph Network based Approaches for Multi-modal Movie Recommendation System

Authors: Daipayan Chakder**, Parbir Mondal**, Subham Raj**, Sriparna Saha**,  Angshuman Gosh, Naoyuki Onoe
IEEE International Conference on System, Man, and Cybernetics (SMC) | November 2022
Read More ➜

|

Semi-supervised Acoustic and Language Modeling for Hindi ASR

Authors: Tarun Sai Bandarupalli*, Shakti Rath*, Nirmesh Shah, Onoe Naoyuki, Sriram Ganapathy*
INTERSPEECH | September 2022

Read More

|

Towards Developing a Multi-Modal Video Recommendation System

Authors: Sriram Pingali**, Prabir Mondal**, Daipayan Chakder**, Sriparna Saha**, Angshuman Ghosh
International Joint Conference on Neural Networks (IJCNN)| September 2022

Read More

|

Leveraging Symmetrical Convolutional Transformer Networks for Speech to Singing Voice Style Transfer

Authors: Shrutina Agarwal*, Sriram Ganapathy*, Naoya Takahashi
INTERSPEECH | September 2022

Read More

|

M2FNet: Multi-modal Fusion Network for Emotion Recognition in Conversation

Authors: Vishal Chudasama, Purbayan Kar, Ashish Gudmalwar, Nirmesh Shah, Pankaj Wasnik, Naoyuki Onoe
Conference on Computer Vision and Pattern Recognition (CVPR)| June 2022

Read More

|

A Unified Model for Fingerprint Authentication and Presentation Attack Detection

Authors: Additya Popli***, Saraansh Tandon***, Joshua J. Engelsma#, Naoyuki Onoe, Atsushi Okubo, Anoop Namboodiri***
International Conference on Acoustics, Speech, and Signal Processing (IJCB)| April 2021

Read More

|

End-to-end lyrics Recognition with Voice to Singing Style Transfer

Authors: Sakya Basak*, Shrutina Agarwal*, Sriram Ganapathy*, Naoya Takahashi
International Conference on Acoustics, Speech, and Signal Processing (ICASSP)| February 2021

Read More

***International Institute of Information Technology Hyderabad **Indian Institute of Technology Patna *Indian Institute of Science, Bangalore #Michigan State University

Skip to content