Publications

[ 11 PAPERS ]
01

Multimodal Graph Representation Learning over Arbitrary Sets of Modalities

Patapati, Santosh and Srinivasan, Trisanth

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Visionpp. 7104–71152026
02
Early Goal-Guided Multi-Scale Fusion for Real-Time Vision-Language Driving

Patapati, Santosh and Srinivasan, Trisanth

2025 IEEE 102nd Vehicular Technology Conference (VTC2025-Fall)pp. 1–52026
03

WebNav: An Intelligent Agent for Voice-Controlled Web Navigation

Srinivasan, Trisanth and Patapati, Santosh

arXiv preprint arXiv:2503.138432025
04

CLIP-MG: Guiding Semantic Attention with Skeletal Pose Features and RGB Data for Micro-Gesture Recognition on the iMiGUE Dataset

Patapati, Santosh and Srinivasan, Trisanth and Adiraju, Amith

MiGa Workshop at IJCAI 20252025
05

DURA-CPS: A Multi-Role Orchestrator for Dependability Assurance in LLM-Enabled Cyber-Physical Systems

Srinivasan, Trisanth and Patapati, Santosh and Musku, Himani and Gode, Idhant and Arora, Aditya and Bhattacharya, Samvit and Nazriev, Abubakr and Hirave, Sanika and Kanjiani, Zaryab and Ghose, Srinjoy

2025 55th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W)pp. 63–702025
06

A Framework for ECA-Based Psychotherapy

Patapati, Santosh and Srinivasan, Trisanth and Musku, Himani and Adiraju, Amith

Proceedings of the 33rd ACM International Conference on Multimedia2025
07

PhysNav-DG: A Novel Adaptive Framework for Robust VLM-Sensor Fusion in Navigation Applications

Srinivasan, Trisanth and Patapati, Santosh

Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR) Workshopspp. 6575–65832025
08

Democracy-in-Silico: Institutional Design as Alignment in AI-Governed Polities

Srinivasan, Trisanth and Patapati, Santosh

arXiv preprint arXiv:2508.195622025
09

GenECA: A General-Purpose Framework for Real-Time Adaptive Multimodal Embodied Conversational Agents

Patapati, Santosh and Tatineni, Aashrith and Srinivasan, Trisanth

Interspeech 2025pp. 3541–35422025
10

Most DAIC-WOZ Depression Classifiers Are Invalid, They Don't Learn Task-Specific Features: Preliminary Findings From a Large-Scale Reproducibility Study

Patapati, Santosh Varma and Pendyala, Ishan and Ambati, Murari and Kunadharaju, Pranav and Kokati, Pranav and Adiraju, Amit and Srinivasan, Trisanth

Companion Proceedings of the 27th International Conference on Multimodal Interactionpp. 17–212025
11

Vision-Language Cross-Attention for Real-Time Autonomous Driving

Patapati, Santosh and Srinivasan, Trisanth and Ambati, Murari

arXiv preprint arXiv:2507.230642025